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The Fourth Edition 


Scientific understanding of the molecular nature of life is growing at an astounding 
rate. Significantly, society is the prime beneficiary of this increased understanding. 
Cures for diseases, better public health, remedies for environmental pollution, and 
the development of cheaper and safer natural products are just a few practical ben- 
efits of this knowledge. 

In addition, this expansion of information fuels, in the words of Thomas Jeffer- 
son, “the illimitable freedom of the human mind.” Scientists can use the tools of bio- 
chemistry and molecular biology to explore all aspects of an organism—from basic 
questions about its chemical composition, through inquiries into the complexities 
of its metabolism, its differentiation and development, to analysis of its evolution 
and even its behavior. New procedures based on the results of these explorations lie at the 
heart of the many modern medical miracles. Biochemistry is a science whose boundaries 
now encompass all aspects of biology, from molecules to cells, to organisms, to ecol- 
ogy, and to all aspects of health care. This fourth edition of Biochemistry embodies and 
reflects the expanse of this knowledge. We hope that this new edition will encour- 
age students to ask questions of their own and to push the boundaries of their cu- 
riosity about science. 


Making Connections 


As the explication of natural phenomena rests more and more on biochemistry, 
its inclusion in undergraduate and graduate curricula in biology, chemistry, and 
the health sciences becomes imperative. The challenge to authors and instructors 
is a formidable one: how to familiarize students with the essential features of mod- 
ern biochemistry in an introductory course or textbook. Fortunately, the in- 
creased scope of knowledge allows scientists to make generalizations connecting 
the biochemical properties of living systems with the character of their con- 
stituent molecules. As a consequence, these generalizations, validated by repeti- 
tive examples, emerge in time as principles of biochemistry, principles that are 
useful in discerning and describing new relationships between diverse biomolec- 
ular functions and in predicting the mechanisms underlying newly discovered 
biomolecular processes. Nevertheless, it is increasingly apparent that students 
must develop skills in inquiry-based learning, so that, beyond this first encounter 
with biochemical principles and concepts, students are equipped to explore sci- 
ence on their own. Much of the design of this new edition is meant to foster the 
development of such skills. 

We are both biochemists, but one of us is in a biology department, and the other 
is in a chemistry department. Undoubtedly, we each view biochemistry through the 
lens of our respective disciplines. We believe, however, that our collaboration on 
this textbook represents a melding of our perspectives that will provide new di- 
mensions of appreciation and understanding for all students. 
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Preface 


Our Audience 


This biochemistry textbook is designed to communicate the fundamental principles 
governing the structure, function, and interactions of biological molecules to stu- 
dents encountering biochemistry for the first time. We aim to bring an appreciation 
of biochemistry to a broad audience that includes undergraduates majoring in the 
life sciences, physical sciences, or premedical programs, as well as medical students 
and graduate students in the various health sciences for whom biochemistry is an 
important route to understanding human physiology. To make this subject matter 
more relevant and interesting to all readers, we emphasize, where appropriate, the 
biochemistry of humans. 


Objectives and Building on Previous Editions 


We carry forward the clarity of purpose found in previous editions; namely, to illu- 
minate for students the principles governing the structure, function, and interac- 
tions of biological molecules. At the same time, this new edition has been revised to 
reflect tremendous developments in biochemistry. Significantly, emphasis is placed 
on the interrelationships of ideas so that students can begin to appreciate the over- 
arching questions of biochemistry. 


Features 


e Clarity of Instruction This edition was streamlined for increased clarity and read- 
ability. Many of the lengthier figure legends were shortened and more informa- 
tion was included directly within illustrations. These changes will help the more 
visual reader. 

e Visual Instruction The richness of the Protein Data Bank (www.pdb.org) and 
availability of molecular graphics software has been exploited to enliven this text. 
Over 330 images of prominent proteins and nucleic acids involved with essential 
biological functions illustrate and inform the subject matter and were prepared 
especially for this book. 

e New End-of-Chapter Problems More than 600 end-of-chapter problems are pro- 
vided, about 15 percent of which are new. They serve as meaningful exercises 
that help students develop problem-solving skills useful in achieving their learn- 
ing goals. Some problems require students to employ calculations to find math- 
ematical answers to relevant structural or functional questions. Other questions 
address conceptual problems whose answers require application and integration 
of ideas and concepts introduced in the chapter. Each set of problems concludes 
with MCAT practice questions to aid students in their preparation for standard- 
ized examinations such as the MCAT or GRE. 

e Human Biochemistry essays emphasize the central role of basic biochemistry in 
medicine and the health sciences. These essays often present clinically important 
issues such as diet, diabetes, and cardiovascular health. 

e A Deeper Look essays expand on the text, highlighting selected topics or exper- 
imental observations. 

e Critical Developments in Biochemistry essays emphasize recent and historical 
advances in the field. 

e Up-to-Date References at the end of each chapter make it easy for students to 
find additional information about each topic. 

¢ Laboratory Techniques The experimental nature of biochemistry is highlighted, 
and a list of laboratory techniques found in this book can be seen on page xxvi. 

e Essential Questions Each chapter in this book is framed around an Essential 
Question that invites students to become actively engaged in their learning, and 
encourages curiosity and imagination about the subject matter. For example, the 
Essential Question of Chapter 3 asks, “What are the laws and principles of ther- 
modynamics that allow us to describe the flows and interchange of heat, energy, 


and matter in systems of interest?” The section heads then pose key questions 
such as, “What Is the Daily Human Requirement for ATP?” The end-of-chapter 
summary then brings the question and a synopsis of the answer together for the 
student. In addition, the CengageNOW site at www.cengage.com/login expands 
on this Essential Question theme by asking students to explore their knowledge 
of key concepts. 

e Key Questions The section headings within chapters are phrased as important 
questions that serve as organizing principles for a lecture. The subheadings are 
designed to be concept statements that respond to the section headings. 

e Text-to-Web Instruction Through icons in the margins, in figure legends, and 
within boxes, students are encouraged to further test their mastery of the Essen- 
tial and Key Questions and to explore interactive tutorials and animations at 
CengageNOW at www.cengage.com/login. 

e Linking Key Questions to Chapter Summaries The end-of-chapter summaries recite 
the key questions posed as section heads and then briefly summarize the important 
concepts and facts to aid students in organizing and understanding the material. 

e Active and Animated Figures at CengageNOW Many text figures, labeled Active 
(Figure 3.1) or Animated (Figure 3.5), can be found at www.cengage.com/login. 
Active Figures have corresponding test questions where students can quiz them- 
selves on the concepts of the figures. Animated Figures give life to the art by al- 
lowing students to watch the progress of an animation. This site also includes 
“Essential Questions” for Biochemistry. These questions are open-ended and can 
be assigned as student projects by instructors. This website also includes instruc- 
tor PowerPoint slides with embedded animations/simulations as well as molecu- 
lar movies for the classroom. 


New to This Edition 


Biochemistry is an ever-expanding discipline and new research leads to expanding 
our knowledge. This edition highlights the newest developments in the field. 


Chapter 5 Analysis of amino acid sequence information from genomic databases re- 
veals functional relationships between proteins, as well as their evolutionary history. 


Chapter 6 The discussion of protein structure now includes protein structure clas- 
sification databases (SCOP and CATH); the flexible, marginally stable nature of 
proteins; expanded coverage of intrinsically unstructured proteins; and special fea- 
tures, such as the molecular mousetrap (Q)-antitrypsin). 


Chapter 7 Glycomics and the structural code of carbohydrates; galectins as media- 
tors of inflammation, immunity, and cancer; and C-reactive protein, a lectin that 
limits inflammation damage, highlight this chapter. 


Chapter 8 Discussions of healthy dietary oils and fats, including canola oil and 
Benecol, the novel lipids in archaea, lipids as signals, and lipidomics as a framework 
for understanding the many roles of lipids are now included. 


Chapter 9 New concepts of membrane structure, function, and dynamics, and the 
recently solved structures of membrane channel proteins, active transport proteins, 
and ABC transporters are featured. 


Chapter 10 The exciting prospects for DNA nanodevices and the applied science of 
nanotechnology are reviewed, and the evolution of contemporary life from an RNA- 
based world is presented. 


Chapter 11 Nucleic acid sequencing by automated, fluorescence-based or light- 
emitting techniques has made possible sequencing the DNA of individuals. The 
structure of DNA multiplexes composed of 3 or 4 polynucleotide strands and the 
higher orders of structure in RNA molecules are new topics in this chapter. 


Chapter 12 The use of RNA interference (RNAi) as a tool to discover gene function 
and various analytical methods for probing protein-protein interaction are two new 
methodologies pertinent to this chapter. 
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Chapter 13 The possibility of creating enzymes designed to catalyze any desired re- 
action is introduced. 


Chapter 14 Enzyme mechanisms (Chapter 14 Mechanisms of Enzyme Action) are pre- 
sented before enzyme regulation (Chapter 15 Enzyme Regulation), allowing students 
to appreciate the catalytic power of enzymes immediately after learning about their 
kinetic properties (Chapter 13 Enzymes—Kinetics and Specificity). The coverage of en- 
zyme mechanisms has been reorganized, shortened, and simplified in this chapter. 
New topics added in this chapter include medical and commercial applications of 
enzyme transition state analogs, a primer on how to read and write enzyme mecha- 
nisms, the roles of near-attack complexes and protein motion in enzyme catalysis, 
and a new feature on chorismate mutase. 


Chapter 15 The regulation of enzyme activity through allosteric mechanisms is pre- 
sented in a simplified and integrated form, and the different covalent modifications 
that alter protein function are characterized. 


Chapter 16 The chapter provides substantially revised discussions of myosin, ki- 
nesin, and dynein motors; an updated discussion of helicases, including the papil- 
loma virus El helicase spiral staircase; and new information on the flagellar rotor 
structure and mechanism. 


Chapter 17 The emerging science of metabolomics and systems biology highlight 
this chapter. 


Chapter 18 The significance of glycolysis to overall metabolism is illustrated 
through a discussion of hypoxia inducible factor (HIF), a protein that acts in the 
absence of oxygen to activate transcription of genes for glycolytic enzymes. 


Chapter 19 Discussion of the TCA cycle has been updated and a new “A Deeper 
Look” box on the coenzymes of the TCA cycle has been added. 


Chapter 20 The chapter now includes discussions of the structures of the electron 
transport complexes, the ATP synthase as a rotational molecular motor that uses the 
energy of a proton gradient to drive synthesis of ATP, and the role of mitochondria 
in cell signaling and apoptosis. 


Chapter 21 The structural details of the photosystems that transduce light energy 
into chemical energy have given new insights into photosynthesis. 


Chapter 22 The identity of xylulose-5-phosphate as a metabolic regulator is a new 
feature in this chapter. 


Chapter 23 This chapter is enhanced by new information on the structure and func- 
tion of the enzymes of 6-oxidation, therapeutic effects of exercise in reversing the 
consequences of metabolic syndrome, and natural antioxidants in foods that can 
improve fat metabolism. 


Chapter 24 The recent revelation that megasynthases catalyze fatty acid synthesis in 
eukaryotes is presented in this chapter, along with new information on the structure 
and function of the LDL receptor. 


Chapter 25 Relationships between amino acid metabolism and human disease, such 
as the significance of aspargine to leukemia, are underscored. 


Chapter 26 The phenomenon of metabolic channeling as a principle in metabolic 
organization and integration is emphasized. 


Chapter 27 The role of AMP-activated protein kinase as the sensor of cellular en- 
ergy levels and regulator of whole-body energy homeostasis is introduced, and the 
biochemical connections between caloric restriction or red wine consumption to 
prolonged lifespan are explored. 


Chapter 29 The structural studies of RNA polymerase that brought Roger Kornberg 
the Nobel Prize form the basis for a deeper understanding of transcription. Also 
presented are chromatin remodeling and histone modifications as processes deter- 
mining the accessibility of chromatin to the transcriptional apparatus. 


Chapter 30 Recent discoveries regarding the molecular structure of ribosomes have 
provided new insights about the mechanisms by which they synthesize proteins. 


Chapter 31 The descriptions of protein folding include new information on how 
ATP drives and regulates protein folding by chaperonins. It is now clear that AAA+ 
ATPase modules mediate the unfolding of proteins in the proteasome. Small, 
ubiquitin-like protein modules (SUMOs) are presented as key modifiers in the post- 
translational regulation of protein function. 


Chapter 32 The chapter has been substantially revised and reorganized to consoli- 
date information on membrane receptor structure and function. Included here is 
new material about the epidermal growth factor receptor, the insulin receptor, and 
the atrial natriuretic peptide receptor, as well as the organization and integration 
of cell signaling pathways and the action of G-protein-coupled receptors through 
G-protein-independent pathways. 


Complete Support Package 


For Students 


Student Solutions Manual, Study Guide and Problems Book by David K. Jemiolo 
(Vassar College) and Steven M. Theg (University of California, Davis) includes sum- 
maries of the chapters, detailed solutions to all end-of-chapter problems, a guide to 
key points of each chapter, important definitions, and illustrations of major meta- 
bolic pathways. (0-495-11460-X) 


Student Lecture Notebook Perfect for note taking during lecture, this conve- 
nient booklet consists of black and white reproductions of the PowerPoint slides. 
(0-495-11461-8) 


CengageNOW at www.cengage.com/login CengageNOW’s online self-assessment 
tool is developed specifically for this text, extending the “Essential Questions” 
framework. You can explore a variety of tutorials, exercises, and simulations 
(cross-referenced throughout the text with margin annotations). You can also 
take chapter-specific Pre-Tests and receive a Personalized Study plan that directs 
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The Facts of Life: Chemistry 
Is the Logic of Biological 
Phenomena 





ESSENTIAL QUESTION 


Molecules are lifeless. Yet, the properties of living things derive from the properties 
of molecules. 


Despite the spectacular diversity of life, the elaborate structure of biological 
molecules, and the complexity of vital mechanisms, are life functions ulti- 
mately interpretable in chemical terms? 


Molecules are lifeless. Yet, in appropriate complexity and number, molecules com- 
pose living things. These living systems are distinct from the inanimate world because 
they have certain extraordinary properties. They can grow, move, perform the in- 
credible chemistry of metabolism, respond to stimuli from the environment, and 
most significantly, replicate themselves with exceptional fidelity. The complex struc- 
ture and behavior of living organisms veil the basic truth that their molecular consti- 
tution can be described and understood. The chemistry of the living cell resembles 
the chemistry of organic reactions. Indeed, cellular constituents or biomolecules 
must conform to the chemical and physical principles that govern all matter. Despite 
the spectacular diversity of life, the intricacy of biological structures, and the com- 
plexity of vital mechanisms, life functions are ultimately interpretable in chemical 
terms. Chemistry is the logic of biological phenomena. 


els What Are the Distinctive Properties of Living Systems? 


First, the most obvious quality of living organisms is that they are complicated and 
highly organized (Figure 1.1). For example, organisms large enough to be seen with 
the naked eye are composed of many cells, typically of many types. In turn, these 
cells possess subcellular structures, called organelles, which are complex assem- 
blies of very large polymeric molecules, called macromolecules. These macro- 
molecules themselves show an exquisite degree of organization in their intricate 
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FIGURE 1.1 (a) Gelada (Theropithecus gelada), a baboon native to the Ethiopian highlands. (b) Tropical orchid 
(Bulbophyllum blumei), New Guinea. 
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“ 


..everything that living things do can be under- 
stood in terms of the jigglings and wigglings 
of atoms.” 

Richard P. Feynman 
Lectures on Physics, Addison-Wesley, 1963 





KEY QUESTIONS 


1.1 What Are the Distinctive Properties 
of Living Systems? 
1.2 What Kinds of Molecules Are Biomolecules? 
1.3. What Is the Structural Organization of 
Complex Biomolecules? 


1.4 | How Do the Properties of Biomolecules 
Reflect Their Fitness to the Living 
Condition? 





1.5 What Is the Organization and Structure 
of Cells? 


1.6 What Are Viruses? 
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three-dimensional architecture, even though they are composed of simple sets of 
chemical building blocks, such as sugars and amino acids. Indeed, the complex 
three-dimensional structure of a macromolecule, known as its conformation, is a 
consequence of interactions between the monomeric units, according to their in- 
dividual chemical properties. 

Second, biological structures serve functional purposes. That is, biological structures 
play a role in the organism’s existence. From parts of organisms, such as limbs and 
organs, down to the chemical agents of metabolism, such as enzymes and metabolic 
intermediates, a biological purpose can be given for each component. Indeed, it is 
this functional characteristic of biological structures that separates the science of bi- 
ology from studies of the inanimate world such as chemistry, physics, and geology. In 
biology, it is always meaningful to seek the purpose of observed structures, organiza- 
tions, or patterns, that is, to ask what functional role they serve within the organism. 

Third, living systems are actively engaged in energy transformations. Maintenance of the 
highly organized structure and activity of living systems depends on their ability to 
extract energy from the environment. The ultimate source of energy is the sun. So- 
lar energy flows from photosynthetic organisms (organisms able to capture light en- 
ergy by the process of photosynthesis) through food chains to herbivores and ulti- 
mately to carnivorous predators at the apex of the food pyramid (Figure 1.2). The 
biosphere is thus a system through which energy flows. Organisms capture some of 
this energy, be it from photosynthesis or the metabolism of food, by forming special 
energized biomolecules, of which ATP and NADPH are the two most prominent ex- 
amples (Figure 1.3). (Commonly used abbreviations such as ATP and NADPH are 
defined on the inside back cover of this book.) ATP and NADPH are energized bio- 
molecules because they represent chemically useful forms of stored energy. We ex- 
plore the chemical basis of this stored energy in subsequent chapters. For now, suf- 
fice it to say that when these molecules react with other molecules in the cell, the 
energy released can be used to drive unfavorable processes. That is, ATP, NADPH, 
and related compounds are the power sources that drive the energy-requiring activ- 
ities of the cell, including biosynthesis, movement, osmotic work against concentra- 
tion gradients, and in special instances, light emission (bioluminescence). Only 
upon death does an organism reach equilibrium with its inanimate environment. 
The living state is characterized by the flow of energy through the organism. At the expense 


Entropy is a thermodynamic term used to desig- of this energy flow, the organism can maintain its intricate order and activity far re- 
nate that amount of energy in a system that is moved from equilibrium with its surroundings, yet exist in a state of apparent con- 
unavailable to do work. stancy over time. This state of apparent constancy, or so-called steady state, is actu- 
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FIGURE 1.2 The food pyramid. Photosynthetic organisms at the base capture light energy. Herbivores and car- 
nivores derive their energy ultimately from these primary producers. 


1.1. What Are the Distinctive Properties of Living Systems? 3 


FIGURE 1.3 ATP and NADPH, two bio- NHo 
chemically important energy-rich 
compounds. 
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ally a very dynamic condition: Energy and material are consumed by the organism OL N NÍ 
and used to maintain its stability and order. In contrast, inanimate matter, as exem- 
plified by the universe in totality, is moving to a condition of increasing disorder or, 
in thermodynamic terms, maximum entropy. OH O 
Fourth, living systems have a remarkable capacity for self-replication. Generation after a ae 


generation, organisms reproduce virtually identical copies of themselves. This self- i 
replication can proceed by a variety of mechanisms, ranging from simple division in 

bacteria to sexual reproduction in plants and animals; but in every case, it is char- NADPH 
acterized by an astounding degree of fidelity (Figure 1.4). Indeed, if the accuracy of 

self-replication were significantly greater, the evolution of organisms would be ham- 

pered. This is so because evolution depends upon natural selection operating on in- 

dividual organisms that vary slightly in their fitness for the environment. The fidelity 





FIGURE 1.4 Organisms resemble their parents. (a) The Garrett 
guys at Hatteras. Left to right: son Randal, Peg Garrett, grand- 
sons Reggie and Ricky, son Jeff, grandson Jackson, and son 
Robert. (b) Orangutan with infant. (c) The Grisham family. Left 
to right: Charles, David, Rosemary, Emily, and Andrew. 
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of self-replication resides ultimately in the chemical nature of the genetic material. 
This substance consists of polymeric chains of deoxyribonucleic acid, or DNA, 
which are structurally complementary to one another (Figure 1.5). These mole- 
cules can generate new copies of themselves in a rigorously executed polymeriza- 
tion process that ensures a faithful reproduction of the original DNA strands. In 
contrast, the molecules of the inanimate world lack this capacity to replicate. A 
crude mechanism of replication must have existed at life’s origin. 


f2 What Kinds of Molecules Are Biomolecules? 


The elemental composition of living matter differs markedly from the relative abun- 
dance of elements in the earth’s crust (Table 1.1). Hydrogen, oxygen, carbon, and ni- 
trogen constitute more than 99% of the atoms in the human body, with most of the H 
and O occurring as HO. Oxygen, silicon, aluminum, and iron are the most abundant 
atoms in the earth’s crust, with hydrogen, carbon, and nitrogen being relatively rare 
(less than 0.2% each). Nitrogen as dinitrogen (Nə) is the predominant gas in the at- 
mosphere, and carbon dioxide (COs) is present at a level of 0.04%, a small but critical 
amount. Oxygen is also abundant in the atmosphere and in the oceans. What property 
unites H, O, C, and N and renders these atoms so suitable to the chemistry of life? It 
is their ability to form covalent bonds by electron-pair sharing. Furthermore, H, C, N, 
and O are among the lightest elements of the periodic table capable of forming such 
bonds (Figure 1.6). Because the strength of covalent bonds is inversely proportional to 
the atomic weights of the atoms involved, H, C, N, and O form the strongest covalent 
bonds. Two other covalent bond-forming elements, phosphorus (as phosphate 
[—OPO,?"] derivatives) and sulfur, also play important roles in biomolecules. 


Biomolecules Are Carbon Compounds 


All biomolecules contain carbon. The prevalence of C is due to its unparalleled ver- 
satility in forming stable covalent bonds through electron-pair sharing. Carbon can 
form as many as four such bonds by sharing each of the four electrons in its outer 
shell with electrons contributed by other atoms. Atoms commonly found in covalent 
linkage to C are C itself, H, O, and N. Hydrogen can form one such bond by con- 
tributing its single electron to the formation of an electron pair. Oxygen, with two 
unpaired electrons in its outer shell, can participate in two covalent bonds, and ni- 
trogen, which has three unshared electrons, can form three such covalent bonds. 
Furthermore, C, N, and O can share two electron pairs to form double bonds with 
one another within biomolecules, a property that enhances their chemical versatil- 
ity. Carbon and nitrogen can even share three electron pairs to form triple bonds. 

Two properties of carbon covalent bonds merit particular attention. One is the 
ability of carbon to form covalent bonds with itself. The other is the tetrahedral na- 
ture of the four covalent bonds when carbon atoms form only single bonds. Together 
these properties hold the potential for an incredible variety of linear, branched, and 
cyclic compounds of C. This diversity is multiplied further by the possibilities for in- 
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TABLE 1.1 | 

Earth's Crust Seawater Human Body* 
Element % Compound mM Element % 
O 47 cr 548 H 63 
Si 28 Na* 470 O 25.5 
Al 7.9 Meg?* 54 C 9.5 
Fe 4.5 SO, 28 N 1.4 
Ca 3.5 Ca?* 10 Ca 0.31 
Na 2.5 Kt 10 P 0.22 
K 2.5 HCO;~ 2,3 Cl 0.08 
Mg 2.2 NO; 0.01 K 0.06 
Ti 0.46 HPO, <0.001 S 0.05 
H 0.22 Na 0.03 
C 0.19 Mg 0.01 











*Figures for the earth's crust and the human body are presented as percentages of the total number of atoms; seawater 
data are in millimoles per liter. Figures for the earth's crust do not include water, whereas figures for the human body do. 
Trace elements found in the human body serving essential biological functions include Mn, Fe, Co, Cu, Zn, Mo, |, Ni, and Se. 


cluding N, O, and H atoms in these compounds (Figure 1.7). We can therefore en- 
vision the ability of C to generate complex structures in three dimensions. These 
structures, by virtue of appropriately included N, O, and H atoms, can display unique 
chemistries suitable to the living state. Thus, we may ask, is there any pattern or un- 
derlying organization that brings order to this astounding potentiality? 


‘le3e What Is the Structural Organization 
of Complex Biomolecules? 


Examination of the chemical composition of cells reveals a dazzling variety of or- 
ganic compounds covering a wide range of molecular dimensions (Table 1.2). As this 
complexity is sorted out and biomolecules are classified according to the similarities 
of their sizes and chemical properties, an organizational pattern emerges. The bio- 
molecules are built according to a structural hierarchy: Simple molecules are the 
units for building complex structures. 

The molecular constituents of living matter do not reflect randomly the infinite 
possibilities for combining C, H, O, and N atoms. Instead, only a limited set of the 
many possibilities is found, and these collections share certain properties essential 
to the establishment and maintenance of the living state. The most prominent as- 
pect of biomolecular organization is that macromolecular structures are con- 
structed from simple molecules according to a hierarchy of increasing structural 
complexity. What properties do these biomolecules possess that make them so ap- 
propriate for the condition of life? 


Metabolites Are Used to Form the Building Blocks of Macromolecules 


The major precursors for the formation of biomolecules are water, carbon dioxide, 
and three inorganic nitrogen compounds—ammonium (NH,*), nitrate (NO;~), and 
dinitrogen (Nə). Metabolic processes assimilate and transform these inorganic pre- 
cursors through ever more complex levels of biomolecular order (Figure 1.8). In the 
first step, precursors are converted to metabolites, simple organic compounds that 
are intermediates in cellular energy transformation and in the biosynthesis of various 
sets of building blocks: amino acids, sugars, nucleotides, fatty acids, and glycerol. 
Through covalent linkage of these building blocks, the macromolecules are con- 
structed: proteins, polysaccharides, polynucleotides (DNA and RNA), and lipids. 
(Strictly speaking, lipids contain relatively few building blocks and are therefore not 
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FIGURE 1.7 Examples of the versatility of C—C bonds in building complex structures: linear, cyclic, branched, 
and planar. 
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TABLE 1.2 | 











The dimensions of mass* and length for biomolecules are given typically in daltons and 
nanometers,' respectively. One dalton (D) is the mass of one hydrogen atom, 1.67 X 10% g, 
One nanometer (nm) is 107°? m, or 10 A (angstroms). 
Mass 

Length 
Biomolecule (long dimension, nm) Daltons Picograms 
Water 0.3 18 
Alanine 0.5 89 
Glucose 0.7 180 
Phospholipid 3.5 750 
Ribonuclease (a small protein) 4 12,600 
Immunoglobulin G (IgG) 14 150,000 
Myosin (a large muscle protein) 160 470,000 
Ribosome (bacteria) 18 2,520,000 
Bacteriophage @X174 (a very small bacterial virus) 25 4,700,000 
Pyruvate dehydrogenase complex (a multienzyme complex) 60 7,000,000 
Tobacco mosaic virus (a plant virus) 300 40,000,000 6.68 xX 107° 
Mitochondrion (liver) 1,500 1.5 
Escherichia coli cell 2,000 2 
Chloroplast (spinach leaf) 8,000 60 
Liver cell 20,000 8,000 








*Molecular mass is expressed in units of daltons (D) or kilodaltons (kD) in this book; alternatively, the dimensionless term molecular weight, symbolized by Mr, and defined as the ratio of the 
mass of a molecule to 1 dalton of mass, is used. 
tPrefixes used for powers of 10 are 


10° mega M 103 milli m 
10 kilo k 1076 micro u 
107! deci d 107° nano n 
1072? centi c 1071? pico p 
107° femto f 


really polymeric like other macromolecules; however, lipids are important contribu- 
tors to higher levels of complexity.) Interactions among macromolecules lead to the 
next level of structural organization, supramolecular complexes. Here, various mem- 
bers of one or more of the classes of macromolecules come together to form specific 
assemblies that serve important subcellular functions. Examples of these supramo- 
lecular assemblies are multifunctional enzyme complexes, ribosomes, chromosomes, 
and cytoskeletal elements. For example, a eukaryotic ribosome contains four differ- 
ent RNA molecules and at least 70 unique proteins. These supramolecular assemblies 
are an interesting contrast to their components because their structural integrity is 
maintained by noncovalent forces, not by covalent bonds. These noncovalent forces 
include hydrogen bonds, ionic attractions, van der Waals forces, and hydrophobic in- 
teractions between macromolecules. Such forces maintain these supramolecular as- 
semblies in a highly ordered functional state. Although noncovalent forces are weak 
(less than 40 kJ/mol), they are numerous in these assemblies and thus can collectively 
maintain the essential architecture of the supramolecular complex under conditions 
of temperature, pH, and ionic strength that are consistent with cell life. 


Organelles Represent a Higher Order in Biomolecular Organization 


The next higher rung in the hierarchical ladder is occupied by the organelles, en- 
tities of considerable dimensions compared with the cell itself. Organelles are 
found only in eukaryotic cells, that is, the cells of “higher” organisms (eukaryotic 
cells are described in Section 1.5). Several kinds, such as mitochondria and 
chloroplasts, evolved from bacteria that gained entry to the cytoplasm of early eu- 
karyotic cells. Organelles share two attributes: They are cellular inclusions, usually 
membrane bounded, and they are dedicated to important cellular tasks. Or- 
ganelles include the nucleus, mitochondria, chloroplasts, endoplasmic reticulum, 
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FIGURE 1.8 Molecular organization in the cell is a hierarchy. 
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Golgi apparatus, and vacuoles, as well as other relatively small cellular inclusions, 
such as peroxisomes, lysosomes, and chromoplasts. The nucleus is the repository 
of genetic information as contained within the linear sequences of nucleotides in 
the DNA of chromosomes. Mitochondria are the “power plants” of cells by virtue 
of their ability to carry out the energy-releasing aerobic metabolism of carbohy- 


1.4 How Do the Properties of Biomolecules Reflect Their Fitness to the Living Condition? 


drates and fatty acids, capturing the energy in metabolically useful forms such as 
ATP. Chloroplasts endow cells with the ability to carry out photosynthesis. They 
are the biological agents for harvesting light energy and transforming it into 
metabolically useful chemical forms. 


Membranes Are Supramolecular Assemblies That Define 
the Boundaries of Cells 


Membranes define the boundaries of cells and organelles. As such, they are not eas- 
ily classified as supramolecular assemblies or organelles, although they share the 
properties of both. Membranes resemble supramolecular complexes in their con- 
struction because they are complexes of proteins and lipids maintained by noncova- 
lent forces. Hydrophobic interactions are particularly important in maintaining mem- 
brane structure. Hydrophobic interactions arise because water molecules prefer to 
interact with each other rather than with nonpolar substances. The presence of non- 
polar molecules lessens the range of opportunities for water—water interaction by 
forcing the water molecules into ordered arrays around the nonpolar groups. Such 
ordering can be minimized if the individual nonpolar molecules redistribute from a 
dispersed state in the water into an aggregated organic phase surrounded by water. 
The spontaneous assembly of membranes in the aqueous environment where life 
arose and exists is the natural result of the hydrophobic (“water-fearing”) character of 
their lipids and proteins. Hydrophobic interactions are the creative means of mem- 
brane formation and the driving force that presumably established the boundary of 
the first cell. The membranes of organelles, such as nuclei, mitochondria, and chloro- 
plasts, differ from one another, with each having a characteristic protein and lipid 
composition tailored to the organelle’s function. Furthermore, the creation of dis- 
crete volumes or compartments within cells is not only an inevitable consequence 
of the presence of membranes but usually an essential condition for proper 
organellar function. 


The Unit of Life Is the Cell 


The cell is characterized as the unit of life, the smallest entity capable of displaying 
the attributes associated uniquely with the living state: growth, metabolism, stimu- 
lus response, and replication. In the previous discussions, we explicitly narrowed 
the infinity of chemical complexity potentially available to organic life and we pre- 
viewed an organizational arrangement, moving from simple to complex, that pro- 
vides interesting insights into the functional and structural plan of the cell. Never- 
theless, we find no obvious explanation within these features for the living 
characteristics of cells. Can we find other themes represented within biomolecules 
that are explicitly chemical yet anticipate or illuminate the living condition? 


i4 How Do the Properties of Biomolecules Reflect 
Their Fitness to the Living Condition? 


If we consider what attributes of biomolecules render them so fit as components of 
growing, replicating systems, several biologically relevant themes of structure and 
organization emerge. Furthermore, as we study biochemistry, we will see that these 
themes serve as principles of biochemistry. Prominent among them is the necessity 
Jor information and energy in the maintenance of the living state. Some biomolecules must 
have the capacity to contain the information, or “recipe,” of life. Other biomole- 
cules must have the capacity to translate this information so that the organized 
structures essential to life are synthesized. Interactions between these structures are 
the processes of life. An orderly mechanism for abstracting energy from the envi- 
ronment must also exist in order to obtain the energy needed to drive these 
processes. What properties of biomolecules endow them with the potential for such 
remarkable qualities? 
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Biological Macromolecules and Their Building Blocks Have 
a“Sense” or Directionality 


The macromolecules of cells are built of units—amino acids in proteins, nu- 
cleotides in nucleic acids, and carbohydrates in polysaccharides—that have struc- 
tural polarity. That is, these molecules are not symmetrical, and so they can be 
thought of as having a “head” and a “tail.” Polymerization of these units to form 
macromolecules occurs by head-to-tail linear connections. Because of this, the poly- 
mer also has a head and a tail, and hence, the macromolecule has a “sense” or di- 
rection to its structure (Figure 1.9). 


Biological Macromolecules Are Informational 


Because biological macromolecules have a sense to their structure, the sequential or- 
der of their component building blocks, when read along the length of the mole- 
cule, has the capacity to specify information in the same manner that the letters of 
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CENGAGENOW™ ACTIVE FIGURE 1.9 (a) Amino acids build proteins. (b) Polysaccharides are built 
by joining sugars together. (c) Nucleic acids are polymers of nucleotides. All these polymerization 
processes involve bond formations accompanied by the elimination of water (dehydration synthe- 
sis reactions). Test yourself on the concepts in this figure at www.cengage.com/login 
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A strand of DNA CENGAGENOW” ACTIVE FIGURE 1.10 The sequence of monomeric 


units in a biological polymer has the potential to contain information 
5 3 tive. Nucleic acids and proteins are information-rich molecules; poly- 


saccharides are not. Test yourself on the concepts in this figure at 
A polypeptide segment www.cengage.com/login 
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the alphabet can form words when arranged in a linear sequence (Figure 1.10). Not 
all biological macromolecules are rich in information. Polysaccharides are often 
composed of the same sugar unit repeated over and over, as in cellulose or starch, 
which are homopolymers of many glucose units. On the other hand, proteins and 
polynucleotides are typically composed of building blocks arranged in no obvious 
repetitive way; that is, their sequences are unique, akin to the letters and punctua- 
tion that form this descriptive sentence. In these unique sequences lies meaning. Dis- 
cerning the meaning, however, requires some mechanism for recognition. 


Biomolecules Have Characteristic Three-Dimensional Architecture 


The structure of any molecule is a unique and specific aspect of its identity. Mo- 
lecular structure reaches its pinnacle in the intricate complexity of biological macro- 
molecules, particularly the proteins. Although proteins are linear sequences of co- 
valently linked amino acids, the course of the protein chain can turn, fold, and coil 
in the three dimensions of space to establish a specific, highly ordered architecture 
that is an identifying characteristic of the given protein molecule (Figure 1.11). 


Weak Forces Maintain Biological Structure and Determine 
Biomolecular Interactions 


Covalent bonds hold atoms together so that molecules are formed. In contrast, 
weak chemical forces or noncovalent bonds (hydrogen bonds, van der Waals forces, 
ionic interactions, and hydrophobic interactions) are intramolecular or intermo- 
lecular attractions between atoms. None of these forces, which typically range from 
4 to 30 kJ/mol, are strong enough to bind free atoms together (Table 1.3). The av- 
erage kinetic energy of molecules at 25°C is 2.5 kJ/mol, so the energy of weak forces 








FIGURE 1.11 Antigen-binding domain of immunoglob- 
ulin G (IgG). 





TABLE 1.3 | he TH 
Strength Distance 

Force (kJ/mol) (nm) Description 

Van der Waals interactions 0.4-4.0 0.3-0.6 Strength depends on the relative size of the atoms or molecules and the 
distance between them. The size factor determines the area of contact 
between two molecules: The greater the area, the stronger the interaction. 

Hydrogen bonds 12-30 0.3 Relative strength is proportional to the polarity of the H bond donor and 
H bond acceptor. More polar atoms form stronger H bonds. 

Ionic interactions 20 0.25 Strength also depends on the relative polarity of the interacting charged 
species. Some ionic interactions are also H bonds: —NH3;* - - - OOC— 

Hydrophobic interactions <40 — Force is a complex phenomenon determined by the degree to which the 
structure of water is disordered as discrete hydrophobic molecules or 
molecular regions coalesce. 
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FIGURE 1.12 Van der Waals packing is en- 
hanced in molecules that are structurally com- 
plementary. Gln'?', a surface protuberance on 
lysozyme, is recognized by the antigen-binding 
site of an antibody against lysozyme. Gln!” 
(pink) fits nicely in a pocket formed by Tyr? 
(orange), Phe?! (light green), Trp”? (dark green), 
and Tyr?! (blue) components of the antibody. 
(See also Figure 1.16.) (a) Ball-and-stick model. 
(b) Space-filling representation. (From Amit, A.G, 


et al., 1986. Three-dimensional structure of an antigen- 


antibody complex at 2.8 A resolution. Science 
233:747-753, figure 5.) 
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FIGURE 1.13 The van der Waals interaction energy pro- 
file as a function of the distance, r, between the centers 


of two atoms. 
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is only several times greater than the dissociating tendency due to thermal motion 
of molecules. Thus, these weak forces create interactions that are constantly form- 
ing and breaking at physiological temperature, unless by cumulative number they 
impart stability to the structures generated by their collective action. These weak 
forces merit further discussion because their attributes profoundly influence the na- 
ture of the biological structures they build. 


Van der Waals Attractive Forces Play an Important Role 
in Biomolecular Interactions 


Van der Waals forces are the result of induced electrical interactions between 
closely approaching atoms or molecules as their negatively charged electron clouds 
fluctuate instantaneously in time. These fluctuations allow attractions to occur be- 
tween the positively charged nuclei and the electrons of nearby atoms. Van der 
Waals attractions operate only over a very limited interatomic distance (0.3 to 
0.6 nm) and are an effective bonding interaction at physiological temperatures only 
when a number of atoms in a molecule can interact with several atoms in a neigh- 
boring molecule. For this to occur, the atoms on interacting molecules must pack 
together neatly. That is, their molecular surfaces must possess a degree of structural 
complementarity (Figure 1.12). 

At best, van der Waals interactions are weak and individually contribute 0.4 to 
4.0 kJ/mol of stabilization energy. However, the sum of many such interactions within 
a macromolecule or between macromolecules can be substantial. Calculations indi- 
cate that the attractive van der Waals energy between the enzyme lysozyme and a 
sugar substrate that it binds is about 60 kJ/mol. 

When two atoms approach each other so closely that their electron clouds inter- 
penetrate, strong repulsive van der Waals forces occur, as shown in Figure 1.13. Be- 
tween the repulsive and attractive domains lies a low point in the potential curve. 
This low point defines the distance known as the van der Waals contact distance, 
which is the interatomic distance that results if only van der Waals forces hold two 
atoms together. The limit of approach of two atoms is determined by the sum of 
their van der Waals radii (Table 1.4). 


Hydrogen Bonds Are Important in Biomolecular Interactions 


Hydrogen bonds form between a hydrogen atom covalently bonded to an elec- 
tronegative atom (such as oxygen or nitrogen) and a second electronegative atom 
that serves as the hydrogen bond acceptor. Several important biological examples 
are given in Figure 1.14. Hydrogen bonds, at a strength of 12 to 30 kJ/mol, are 
stronger than van der Waals forces and have an additional property: H bonds are 
cylindrically symmetrical and tend to be highly directional, forming straight bonds 
between donor, hydrogen, and acceptor atoms. Hydrogen bonds are also more spe- 
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TABLE 1.4 
Atom 
Van der Waals Covalent Represented 
Atom Radius (nm) Radius (nm) to Scale 
H 0.1 0.037 


C 0.17 0.077 


N 0.15 0.070 


P 0.19 0.096 
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O 0.14 0.066 Oo 
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S 0.185 0.104 


Half-thickness 
of an aromatic 0.17 = 
ring 











cific than van der Waals interactions because they require the presence of com- 
plementary hydrogen donor and acceptor groups. 


lonic Interactions Ionic interactions are the result of attractive forces between op- 
positely charged structures, such as negative carboxyl groups and positive amino 
groups (Figure 1.15). These electrostatic forces average about 20 kJ/mol in aque- 
ous solutions. Typically, the electrical charge is radially distributed, so these inter- 
actions may lack the directionality of hydrogen bonds or the precise fit of van der 
Waals interactions. Nevertheless, because the opposite charges are restricted to ster- 
ically defined positions, ionic interactions can impart a high degree of structural 
specificity. 

The strength of electrostatic interactions is highly dependent on the nature of the 
interacting species and the distance, 7, between them. Electrostatic interactions may 
involve ions (species possessing discrete charges), permanent dipoles (having a per- 
manent separation of positive and negative charge), or induced dipoles (having a 
temporary separation of positive and negative charge induced by the environment). 


Hydrophobic Interactions Hydrophobic interactions result from the strong 
tendency of water to exclude nonpolar groups or molecules (see Chapter 2). Hy- 
drophobic interactions arise not so much because of any intrinsic affinity of non- 
polar substances for one another (although van der Waals forces do promote the 
weak bonding of nonpolar substances), but because water molecules prefer the 
stronger interactions that they share with one another, compared to their inter- 
action with nonpolar molecules. Hydrogen-bonding interactions between polar 
water molecules can be more varied and numerous if nonpolar molecules come 
together to form a distinct organic phase. This phase separation raises the entropy 
of water because fewer water molecules are arranged in orderly arrays around in- 
dividual nonpolar molecules. It is these preferential interactions between water 
molecules that “exclude” hydrophobic substances from aqueous solution and 
drive the tendency of nonpolar molecules to cluster together. Thus, nonpolar re- 
gions of biological macromolecules are often buried in the molecule’s interior to 
exclude them from the aqueous milieu. The formation of oil droplets as hy- 
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H bonds Approximate 
Bonded atoms bond length* 
O—H---O 0.27 nm 
O—H---O- 0.26 nm 
O—H===N 0.29 nm 
N=H=-=0 0.30 nm 
*N—H---O 0.29 nm 
N—H---N 0.31 nm 


*Lengths given are distances from the atom 
covalently linked to the H to the atom 
H bonded to the hydrogen: 


O—H---O 


k-0.27 nm- 


Functional groups that are important 
H-bond donors and acceptors: 
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CENGAGENOW”™ ANIMATED FIGURE 1.14 Some biolog- 
ically important H bonds. See this figure animated at 
www.cengage.com/login 


drophobic nonpolar lipid molecules coalesce in the presence of water is an ap- 
proximation of this phenomenon. These tendencies have important conse- 
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CENGAGENOW" ANIMATED FIGURE 1.15 Ionic bonds 
in biological molecules. See this figure animated at 
www.cengage.com/login 


Ligand: a molecule (or atom) that binds specif- 
cally to another molecule (from Latin ligare, to 
bind). 
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quences in the creation and maintenance of the macromolecular structures and 
supramolecular assemblies of living cells. 


The Defining Concept of Biochemistry Is “Molecular Recognition 
Through Structural Complementarity” 


Structural complementarity is the means of recognition in biomolecular interac- 
tions. The complicated and highly organized patterns of life depend on the ability 
of biomolecules to recognize and interact with one another in very specific ways. 
Such interactions are fundamental to metabolism, growth, replication, and other vi- 
tal processes. The interaction of one molecule with another, a protein with a metabo- 
lite, for example, can be most precise if the structure of one is complementary to the 
structure of the other, as in two connecting pieces of a puzzle or, in the more popu- 
lar analogy for macromolecules and their ligands, a lock and its key (Figure 1.16). 
This principle of structural complementarity is the very essence of biomolecular recognition. 
Structural complementarity is the significant clue to understanding the functional 
properties of biological systems. Biological systems from the macromolecular level to 
the cellular level operate via specific molecular recognition mechanisms based on 
structural complementarity: A protein recognizes its specific metabolite, a strand of 
DNA recognizes its complementary strand, sperm recognize an egg. All these inter- 
actions involve structural complementarity between molecules. 


Biomolecular Recognition Is Mediated by Weak Chemical Forces 


Weak chemical forces underlie the interactions that are the basis of biomolecular 
recognition. It is important to realize that because these interactions are sufficiently 
weak, they are readily reversible. Consequently, biomolecular interactions tend to 
be transient; rigid, static lattices of biomolecules that might paralyze cellular activi- 
ties are not formed. Instead, a dynamic interplay occurs between metabolites and 
macromolecules, hormones and receptors, and all the other participants instru- 
mental to life processes. This interplay is initiated upon specific recognition be- 
tween complementary molecules and ultimately culminates in unique physiological 
activities. Biological function is achieved through mechanisms based on structural 
complementarity and weak chemical interactions. 

This principle of structural complementarity extends to higher interactions es- 
sential to the establishment of the living condition. For example, the formation of 


Puzzle 


Lock and key 











FIGURE 1.16 Structural complementarity: the pieces of a puzzle, the lock and its key, a biological macromolecule 
and its ligand—an antigen-antibody complex. The antigen on the right (gold) is a small protein, lysozyme, from 
hen egg white. The antibody molecule (IgG) (left) has a pocket that is structurally complementary to a surface fea- 
ture (red) on the antigen. (See also Figure 1.12.) 
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Native Denatured 
protein protein 


supramolecular complexes occurs because of recognition and interaction between 
their various macromolecular components, as governed by the weak forces formed 
between them. If a sufficient number of weak bonds can be formed, as in macro- 
molecules complementary in structure to one another, larger structures assemble 
spontaneously. The tendency for nonpolar molecules and parts of molecules to 
come together through hydrophobic interactions also promotes the formation of 
supramolecular assemblies. Very complex subcellular structures are actually spon- 
taneously formed in an assembly process that is driven by weak forces accumulated 
through structural complementarity. 


Weak Forces Restrict Organisms to a Narrow Range 
of Environmental Conditions 


Because biomolecular interactions are governed by weak forces, living systems are re- 
stricted to a narrow range of physical conditions. Biological macromolecules are func- 
tionally active only within a narrow range of environmental conditions, such as tem- 
perature, ionic strength, and relative acidity. Extremes of these conditions disrupt the 
weak forces essential to maintaining the intricate structure of macromolecules. The 
loss of structural order in these complex macromolecules, so-called denaturation, is ac- 
companied by loss of function (Figure 1.17). As a consequence, cells cannot tolerate 
reactions in which large amounts of energy are released, nor can they generate a large 
energy burst to drive energy-requiring processes. Instead, such transformations take 
place via sequential series of chemical reactions whose overall effect achieves dramatic 
energy changes, even though any given reaction in the series proceeds with only mod- 
est input or release of energy (Figure 1.18). These sequences of reactions are orga- 
nized to provide for the release of useful energy to the cell from the breakdown of food 
or to take such energy and use it to drive the synthesis of biomolecules essential to the 
living state. Collectively, these reaction sequences constitute cellular metabolism—the 
ordered reaction pathways by which cellular chemistry proceeds and biological energy 
transformations are accomplished. 


Enzymes Catalyze Metabolic Reactions 


The sensitivity of cellular constituents to environmental extremes places another 
constraint on the reactions of metabolism. The rate at which cellular reactions pro- 
ceed is a very important factor in maintenance of the living state. However, the com- 
mon ways chemists accelerate reactions are not available to cells; the temperature 
cannot be raised, acid or base cannot be added, the pressure cannot be elevated, 
and concentrations cannot be dramatically increased. Instead, biomolecular cata- 
lysts mediate cellular reactions. These catalysts, called enzymes, accelerate the re- 
action rates many orders of magnitude and, by selecting the substances undergoing 


CENGAGENOW” ANIMATED FIGURE 1.17 Denaturation 
and renaturation of the intricate structure of a protein. 
See this figure animated at www.cengage.com/ 
login 


CENGAGENOW*” Go to CengageNOW at www 
.cengage.com/login and click BiochemistrylInteractive 
to explore the structure of immunoglobulin G, center- 
ing on the role of weak intermolecular forces in es- 
tablishing higher orders of structure. 
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The combustion of glucose: CeH1906 + 6 Op ———> 6 COs + 6 HO + 2870 kJ energy 


(a) In an aerobic cell 





6 CO; + 6 HO 





CENGAGENOW” ANIMATED FIGURE 1.19 Carbonic 
anhydrase, a representative enzyme. See this figure 


animated at www.cengage.com/login 


(b) In a bomb calorimeter 
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Glycolysis 
2 Pyruvate 2810 KJ 
energy 
as heat 











Citric acid cycle 
and oxidative 
phosphorylation 





30-38 ATP 6 COs + 6 HO 


CENGAGENOW” ACTIVE FIGURE 1.18 Metabolism is the organized release or capture of small amounts of 
energy in processes whose overall change in energy is large. (a) Cells can release the energy of glucose in a 
stepwise fashion and the small “packets” of energy appear in ATP. (b) Combustion of glucose in a bomb 
calorimeter results in an uncontrolled, explosive release of energy in its least useful form, heat. Test yourself 
on the concepts in this figure at www.cengage.com/login 


reaction, determine the specific reaction that takes place. Virtually every metabolic 
reaction is catalyzed by an enzyme (Figure 1.19). 


Metabolic Regulation Is Achieved by Controlling the Activity of Enzymes Thou- 
sands of reactions mediated by an equal number of enzymes are occurring at any 
given instant within the cell. Collectively, these reactions constitute cellular metab- 
olism. Metabolism has many branch points, cycles, and interconnections, as subse- 
quent chapters reveal. All these reactions, many of which are at apparent cross- 
purposes in the cell, must be fine-tuned and integrated so that metabolism and life 
proceed harmoniously. The need for metabolic regulation is obvious. This meta- 
bolic regulation is achieved through controls on enzyme activity so that the rates of 
cellular reactions are appropriate to cellular requirements. 

Despite the organized pattern of metabolism and the thousands of enzymes re- 
quired, cellular reactions nevertheless conform to the same thermodynamic princi- 
ples that govern any chemical reaction. Enzymes have no influence over energy 
changes (the thermodynamic component) in their reactions. Enzymes only influ- 
ence reaction rates. Thus, cells are systems that take in food, release waste, and carry 
out complex degradative and biosynthetic reactions essential to their survival while 
operating under conditions of essentially constant temperature and pressure and 
maintaining a constant internal environment (homeostasis) with no outwardly ap- 
parent changes. Cells are open thermodynamic systems exchanging matter and energy with 
their environment and functioning as highly regulated isothermal chemical engines. 


The Time Scale of Life 


Individual organisms have life spans ranging from a day or less to a century or more, 
but the phenomena that characterize and define living systems have durations rang- 
ing over 33 orders of magnitude, from 107" sec (electron transfer reactions, photo- 
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TABLE 1.5 | 

Time (sec) Process Example 

107 Electron transfer The light reactions in photosynthesis 

10-8 Transition states Transition states in chemical reactions have lifetimes of 107"! to 107” sec (the reciprocal 
of the frequency of bond vibrations) 

jor" H-bond lifetimes H bonds are exchanged between HO molecules due to the rotation of the water 
molecules themselves 

107'* to 10° Motion in proteins Fast: tyrosine ring flips, methyl group rotations 
Slow: bending motions between protein domains 

10° to 10° Enzyme catalysis 10° sec: fast enzyme reactions 
107? sec: typical enzyme reactions 
10° sec: slow enzyme reactions 

10° Diffusion in membranes A typical membrane lipid molecule can diffuse from one end of a bacterial cell to the 
other in 1 sec; a small protein would go half as far 

10! to 10? Protein synthesis Some ribosomes synthesize proteins at a rate of 20 amino acids added per second 

104 to 10° Cell division Prokaryotic cells can divide as rapidly as every hour or so; eukaryotic cell division varies 
greatly (from hours to years) 

107 to 108 Embryonic development Human embryonic development takes 9 months (2.4 X 108 sec) 

10° to 10° Life span Human life expectancy is 77.6 years (about 2.5 X 10° sec) 

1018 Evolution The first organisms appeared 3.5 X 10° years ago and evolution has continued since then 








excitation in photosynthesis) to 10!8 sec (the period of evolution, spanning from 
the first appearance of organisms on the earth more than 3 billion years ago to to- 
day) (Table 1.5). Because proteins are the agents of biological function, phenom- 
ena involving weak interactions and proteins dominate the shorter times. As time 
increases, more stable interactions (covalent bonds) and phenomena involving the 
agents of genetic information (the nucleic acids) come into play. 


‘Teds What Is the Organization and Structure of Cells? 


All living cells fall into one of three broad categories—Archaea, Bacteria and Eu- 
karya. Archaea and bacteria are referred to collectively as prokaryotes. As a group, 
prokaryotes are single-celled organisms that lack nuclei and other organelles; the 
word is derived from pro meaning “prior to” and karyot meaning “nucleus.” In con- 
ventional biological classification schemes, prokaryotes are grouped together as 
members of the kingdom Monera. The other four living kingdoms are all Eukarya— 
the single-celled Protists, such as amoebae, and all multicellular life forms, including 
the Fungi, Plant, and Animal kingdoms. Eukaryotic cells have true nuclei and other 
organelles such as mitochondria, with the prefix eu meaning “true.” 


The Evolution of Early Cells Gave Rise to Eubacteria, Archaea, 
and Eukaryotes 


For a long time, most biologists believed that eukaryotes evolved from the simpler 
prokaryotes in some linear progression from simple to complex over the course of 
geological time. However, contemporary evidence favors the view that present-day 
organisms are better grouped into the three classes mentioned: eukarya, bacteria, 
and archaea. All are believed to have evolved approximately 3.5 billion years ago 
from an ancestral communal gene pool shared among primitive cellular entities. 
Furthermore, contemporary eukaryotic cells are, in reality, composite cells that har- 
bor various bacterial contributions. 

Despite great diversity in form and function, cells and organisms share much bio- 
chemistry in common. This commonality and diversity has been substantiated by 
the results of whole genome sequencing, the determination of the complete nu- 
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Gene is a unit of hereditary information, physi- 
cally defined by a specific sequence of nucleo- 
tides in DNA; in molecular terms, a gene is a 
nucleotide sequence that encodes a protein or 
RNA product. 


cleotide sequence within the DNA of an organism. For example, the genome of the 
metabolically divergent archaea Methanococcus jannaschii shows 44% similarity to 
known genes in eubacteria and eukaryotes, yet 56% of its genes are new to science. 


How Many Genes Does a Cell Need? 


The genome of the Mycoplasma genitalium consists of 523 genes, encoding 484 pro- 
teins, in just 580,074 base pairs (Table 1.6). This information sparks an interesting 
question: How many genes are needed for cellular life? Any minimum gene set must 
encode all the information necessary for cellular metabolism, including the vital 
functions essential to reproduction. The simplest cell must show at least (1) some 
degree of metabolism and energy production; (2) genetic replication based on a 
template molecule that encodes information (DNA or RNA?); and (3) formation 
and maintenance of a cell boundary (membrane). Top-down studies aim to discover 
from existing cells what a minimum gene set might be. These studies have focused 
on simple parasitic bacteria, because parasites often obtain many substances from 
their hosts and do not have to synthesize them from scratch; thus, they require 
fewer genes. One study concluded that 206 genes are sufficient to form a minimum 
gene set. The set included genes for DNA replication and repair, transcription, 
translation, protein processing, cell division, membrane structure, nutrient trans- 
port, metabolic pathways for ATP synthesis, and enzymes to make a small number 
of metabolites that might not be available, such as pentoses for nucleotides. Yet 
another study based on computer modeling decided that a minimum gene set 
might have only 105 protein-coding genes. Bottom-up studies aim to create a mini- 
mal cell by reconstruction based on known cellular components. At this time, no 
such bottom-up creation of an artificial cell has been reported. The simplest func- 
tional artificial cell capable of replication would contain an informational macro- 
molecule (presumably a nucleic acid) and enough metabolic apparatus to maintain 
a basic set of cellular components within a membranelike boundary. 








Number of Cells 





Organism in Adult* Number of Genes 
Mycobacterium genitalium 1 523 
Pathogenic bacterium 

Methanococcus jannaschii 1 1,800 
Archaeal methanogen 

Escherichia coli K12 1 4,400 
Intestinal bacterium 

Saccharomyces cereviseae 1 6,000 
Baker’s yeast (eukaryote) 

Caenorhabditis elegans 959 19,000 
Nematode worm 

Drosophila melanogaster 104 13,500 

Fruit fly 

Arabidopsis thaliana 107 27,000 
Flowering plant 

Fugu rubripes 10!” 38,000 (est.) 
Pufferfish 

Homo sapiens 10!4 20,500 (est.) 
Human 











The first four of the nine organisms in the table are single-celled microbes; the last six are eukaryotes; the last five are 
multicellular, four of which are animals; the final two are vertebrates. Although pufferfish and humans have roughly the 
same number of genes, the pufferfish genome, at 0.365 billion nucleotide pairs, is only one-eighth the size of the human 
genome. 

*Numbers for Arabidopsis thaliana, the pufferfish, and human are “order-of-magnitude” rough estimates. 
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E. coli bacteria 


FIGURE 1.20 This bacterium is Escherichia coli, a member of the coliform group of bacteria that colonize the 
intestinal tract of humans. (See Table 1.7.) (Photo, Martin Rotker/Phototake, Inc; inset photo, David M. Phillips/The Popula- 
tion Council/Science Source/Photo Researchers, Inc.) 


Archaea and Bacteria Have a Relatively Simple Structural Organization 


The bacteria form a widely spread group. Certain of them are pathogenic to hu- 
mans. The archaea, about which we know less, are remarkable because they can be 
found in unusual environments where other cells cannot survive. Archaea include 
the thermoacidophiles (heat- and acid-loving bacteria) of hot springs, the 
halophiles (salt-loving bacteria) of salt lakes and ponds, and the methanogens (bac- 
teria that generate methane from CO» and Hg). Prokaryotes are typically very small, 
on the order of several microns in length, and are usually surrounded by a rigid cell 
wall that protects the cell and gives it its shape. The characteristic structural orga- 
nization of one of these cells is depicted in Figure 1.20. 

Prokaryotic cells have only a single membrane, the plasma membrane or cell 
membrane. Because they have no other membranes, prokaryotic cells contain no 
nucleus or organelles. Nevertheless, they possess a distinct nuclear area where a sin- 
gle circular chromosome is localized, and some have an internal membranous struc- 
ture called a mesosome that is derived from and continuous with the cell mem- 
brane. Reactions of cellular respiration are localized on these membranes. In 
cyanobacteria, flat, sheetlike membranous structures called lamellae are formed 
from cell membrane infoldings. These lamellae are the sites of photosynthetic ac- 
tivity, but they are not contained within plastids, the organelles of photosynthesis 
found in higher plant cells. Prokaryotic cells also lack a cytoskeleton; the cell wall 
maintains their structure. Some bacteria have flagella, single, long filaments used 
for motility. Prokaryotes largely reproduce by asexual division, although sexual ex- 
changes can occur. Table 1.7 lists the major features of bacterial cells. 


The Structural Organization of Eukaryotic Cells Is More Complex 
Than That of Prokaryotic Cells 


Compared with prokaryotic cells, eukaryotic cells are much greater in size, typically 
having cell volumes 10° to 10* times larger. They are also much more complex. 
These two features require that eukaryotic cells partition their diverse metabolic 
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TABLE 1.7 | 


Structure 


Molecular Composition 





Function 





Cell wall 


Cell membrane 


Nuclear area or nucleoid 


Ribosomes 


Storage granules 


Cytosol 





Peptidoglycan: a rigid framework of polysaccharide 
crosslinked by short peptide chains. Some bacteria 
possess a lipopolysaccharide- and protein-rich outer 
membrane. 

The cell membrane is composed of about 

45% lipid and 55% protein. The lipids form a 
bilayer that is a continuous nonpolar hydrophobic 
phase in which the proteins are embedded. 


The genetic material is a single, tightly coiled DNA 
molecule 2 nm in diameter but more than 1 mm 
in length (molecular mass of E. coli DNA is 3 X 10° 
daltons; 4.64 X 10° nucleotide pairs). 


Bacterial cells contain about 15,000 ribosomes. 
Each is composed of a small (30S) subunit and a 
large (50S) subunit. The mass of a single ribosome 
is 2.3 X 10° daltons. It consists of 65% RNA and 
35% protein. 

Bacteria contain granules that represent storage 
forms of polymerized metabolites such as sugars or 
B-hydroxybutyric acid. 

Despite its amorphous appearance, the cytosol is an 
organized gelatinous compartment that is 20% 
protein by weight and rich in the organic molecules 
that are the intermediates in metabolism. 


Mechanical support, shape, and protection against 
swelling in hypotonic media. The cell wall is a 
porous nonselective barrier that allows most small 
molecules to pass. 


The cell membrane is a highly selective perme- 
ability barrier that controls the entry of most sub- 
stances into the cell. Important enzymes in the 
generation of cellular energy are located in the 
membrane. 

DNA provides the operating instructions for the 
cell; it is the repository of the cell’s genetic infor- 
mation. During cell division, each strand of the 
double-stranded DNA molecule is replicated to 
yield two double-helical daughter molecules. 
Messenger RNA (mRNA) is transcribed from 
DNA to direct the synthesis of cellular proteins. 


Ribosomes are the sites of protein synthesis. The 
mRNA binds to ribosomes, and the mRNA 
nucleotide sequence specifies the protein that is 
synthesized. 


When needed as metabolic fuel, the monomeric 
units of the polymer are liberated and degraded 
by energy-yielding pathways in the cell. 

The cytosol is the site of intermediary metabo- 
lism, the interconnecting sets of chemical reac- 
tions by which cells generate energy and form the 
precursors necessary for biosynthesis of macro- 
molecules essential to cell growth and function. 








processes into organized compartments, with each compartment dedicated to a par- 
ticular function. A system of internal membranes accomplishes this partitioning. A 
typical animal cell is shown in Figure 1.21 and a typical plant cell in Figure 1.22. 
Tables 1.8 and 1.9 list the major features of a typical animal cell and a higher plant 
cell, respectively. 

Eukaryotic cells possess a discrete, membrane-bounded nucleus, the repository 
of the cell’s genetic material, which is distributed among a few or many chromo- 
somes. During cell division, equivalent copies of this genetic material must be 
passed to both daughter cells through duplication and orderly partitioning of the 
chromosomes by the process known as mitosis. Like prokaryotic cells, eukaryotic 
cells are surrounded by a plasma membrane. Unlike prokaryotic cells, eukaryotic 
cells are rich in internal membranes that are differentiated into specialized struc- 
tures such as the endoplasmic reticulum (ER) and the Golgi apparatus. Membranes 
also surround certain organelles (mitochondria and chloroplasts, for example) and 
various vesicles, including vacuoles, lysosomes, and peroxisomes. The common 
purpose of these membranous partitionings is the creation of cellular compart- 
ments that have specific, organized metabolic functions, such as the mitochon- 
drion’s role as the principal site of cellular energy production. Eukaryotic cells also 
have a cytoskeleton composed of arrays of filaments that give the cell its shape and 
its capacity to move. Some eukaryotic cells also have long projections on their 
surface—cilia or flagella—which provide propulsion. 
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FIGURE 1.21 This figure diagrams a rat liver cell, a typical higher animal cell. 


16) What Are Viruses? 


Viruses are supramolecular complexes of nucleic acid, either DNA or RNA, en- 
capsulated in a protein coat and, in some instances, surrounded by a membrane 
envelope (Figure 1.23). Viruses are acellular, but they act as cellular parasites in 
order to reproduce. The bits of nucleic acid in viruses are, in reality, mobile ele- 
ments of genetic information. The protein coat serves to protect the nucleic acid 
and allows it to gain entry to the cells that are its specific hosts. Viruses unique for 
all types of cells are known. Viruses infecting bacteria are called bacteriophages 
(“bacteria eaters”); different viruses infect animal cells and plant cells. Once the 
nucleic acid of a virus gains access to its specific host, it typically takes over the 
metabolic machinery of the host cell, diverting it to the production of virus parti- 
cles. The host metabolic functions are subjugated to the synthesis of viral nucleic 
acid and proteins. Mature virus particles arise by encapsulating the nucleic acid 
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FIGURE 1.22 This figure diagrams a cell in the leaf of a higher plant. The cell wall, membrane, nucleus, chloro- 
plasts, mitochondria, vacuole, endoplasmic reticulum (ER), and other characteristic features are shown. 


within a protein coat called the capsid. Thus, viruses are supramolecular assem- 
blies that act as parasites of cells (Figure 1.24). 

Often, viruses cause disintegration of the cells that they have infected, a process 
referred to as cell lysis. It is their cytolytic properties that are the basis of viral dis- 
ease. In certain circumstances, the viral genetic elements may integrate into the 
host chromosome and become quiescent. Such a state is termed lysogeny. Typically, 
damage to the host cell activates the replicative capacities of the quiescent viral nu- 
cleic acid, leading to viral propagation and release. Some viruses are implicated in 
transforming cells into a cancerous state, that is, in converting their hosts to an un- 
regulated state of cell division and proliferation. Because all viruses are heavily de- 
pendent on their host for the production of viral progeny, viruses must have evolved 
after cells were established. Presumably, the first viruses were fragments of nucleic 
acid that developed the ability to replicate independently of the chromosome and 
then acquired the necessary genes enabling protection, autonomy, and transfer be- 
tween cells. 
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TABLE 1.8 | 

Structure Molecular Composition Function 

Extracellular The surfaces of animal cells are covered with a flexible This complex coating is cell specific, serves in cell- 
matrix and sticky layer of complex carbohydrates, proteins, and cell recognition and communication, creates cell 


Cell membrane 
(plasma 
membrane) 


Nucleus 


Endoplasmic 
reticulum (ER) 
and ribosomes 


Golgi apparatus 


Mitochondria 


Lysosomes 


Peroxisomes 


Cytoskeleton 


lipids. 
Roughly 50:50 lipid: protein as a 5-nm-thick continuous 


sheet of lipid bilayer in which a variety of proteins are 
embedded. 


The nucleus is separated from the cytosol by a double 
membrane, the nuclear envelope. The DNA is com- 
plexed with basic proteins (histones) to form chromatin 
fibers, the material from which chromosomes are made. 
A distinct RNA-rich region, the nucleolus, is the site of 
ribosome assembly. 


Flattened sacs, tubes, and sheets of internal membrane 
extending throughout the cytoplasm of the cell and en- 
closing a large interconnecting series of volumes called 
cisternae. The ER membrane is continuous with the outer 
membrane of the nuclear envelope. Portions of the sheet- 
like areas of the ER are studded with ribosomes, giving 
rise to rough ER. Eukaryotic ribosomes are larger than 
prokaryotic ribosomes. 


The Golgi is an asymmetrical system of flattened membrane- 
bounded vesicles often stacked into a complex. The face of 
the complex nearest the ER is the cis face; that most distant 
from the ER is the trans face. Numerous small vesicles found 
peripheral to the trans face of the Golgi contain secretory 
material packaged by the Golgi. 

Mitochondria are organelles surrounded by two mem- 
branes that differ markedly in their protein and lipid com- 
position. The inner membrane and its interior volume— 
the matrix—contain many important enzymes of energy 
metabolism. Mitochondria are about the size of bacteria, 
=] um. Cells contain hundreds of mitochondria, which 
collectively occupy about one-fifth of the cell volume. 
Lysosomes are vesicles 0.2-0.5 um in diameter, bounded 
by a single membrane. They contain hydrolytic enzymes 
such as proteases and nucleases that act to degrade cell 
constituents targeted for destruction. They are formed as 
membrane vesicles budding from the Golgi apparatus. 


Like lysosomes, peroxisomes are 0.2—0.5 um, single- 
membrane-—bounded vesicles. They contain a variety of 
oxidative enzymes that use molecular oxygen and gener- 
ate peroxides. They are also formed from membrane vesi- 
cles budding from the smooth ER. 


The cytoskeleton is composed of a network of protein 
filaments: actin filaments (or microfilaments), 7 nm in 
diameter; intermediate filaments, 8-10 nm; and micro- 
tubules, 25 nm. These filaments interact in establishing 
the structure and functions of the cytoskeleton. This in- 
teracting network of protein filaments gives structure and 
organization to the cytoplasm. 


adhesion, and provides a protective outer layer. 

The plasma membrane is a selectively permeable 
outer boundary of the cell, containing specific 
systems—pumps, channels, transporters, receptors— 
for the exchange of materials with the environment 
and the reception of extracellular information. Impor- 
tant enzymes are also located here. 

The nucleus is the repository of genetic information 
encoded in DNA and organized into chromosomes. 
During mitosis, the chromosomes are replicated and 
transmitted to the daughter cells. The genetic informa- 
tion of DNA is transcribed into RNA in the nucleus and 
passes into the cytosol, where it is translated into pro- 
tein by ribosomes. 

The endoplasmic reticulum is a labyrinthine organelle 
where both membrane proteins and lipids are synthe- 
sized. Proteins made by the ribosomes of the rough 
ER pass through the ER membrane into the cisternae 
and can be transported via the Golgi to the periphery 
of the cell. Other ribosomes unassociated with the ER 
carry on protein synthesis in the cytosol. The nuclear 
membrane, ER, Golgi, and additional vesicles are all 
part of a continuous endomembrane system. 

Involved in the packaging and processing of macro- 
molecules for secretion and for delivery to other 
cellular compartments. 


Mitochondria are the power plants of eukaryotic cells 
where carbohydrates, fats, and amino acids are oxi- 
dized to CO, and H0O. The energy released is 
trapped as high-energy phosphate bonds in ATP. 


Lysosomes function in intracellular digestion of 
materials entering the cell via phagocytosis or pino- 
cytosis. They also function in the controlled degrada- 
tion of cellular components. Their internal pH is 
about 5, and the hydrolytic enzymes they contain work 
best at this pH. 

Peroxisomes act to oxidize certain nutrients, such as 
amino acids. In doing so, they form potentially toxic 
hydrogen peroxide, HyOs, and then decompose it to 
HO and Oz by way of the peroxide-cleaving enzyme 
catalase. 

The cytoskeleton determines the shape of the cell and 
gives it its ability to move. It also mediates the internal 
movements that occur in the cytoplasm, such as the 
migration of organelles and mitotic movements of 
chromosomes. The propulsion instruments of cells— 
cilia and flagella—are constructed of microtubules. 
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Structure 


Molecular Composition 


Function 





Cell wall 


Cell membrane 


Nucleus 


Endoplasmic reticulum, 
Golgi apparatus, 
ribosomes, lysosomes, 
peroxisomes, and 
cytoskeleton 


Chloroplasts 


Mitochondria 


Vacuole 





Cellulose fibers embedded in a polysaccharide/ 
protein matrix; it is thick (>0.1 wm), rigid, and 
porous to small molecules. 


Plant cell membranes are similar in overall struc- 
ture and organization to animal cell membranes 
but differ in lipid and protein composition. 


The nucleus, nucleolus, and nuclear envelope of 
plant cells are like those of animal cells. 


Plant cells also contain all of these characteristic 
eukaryotic organelles, essentially in the form 
described for animal cells. 


Chloroplasts have a double-membrane envelope, an 
inner volume called the stroma, and an internal 
membrane system rich in thylakoid membranes, 
which enclose a third compartment, the thylakoid 
lumen. Chloroplasts are significantly larger than 
mitochondria. Other plastids are found in special- 
ized structures such as fruits, flower petals, and roots 
and have specialized roles. 


Plant cell mitochondria resemble the mitochondria 
of other eukaryotes in form and function. 


The vacuole is usually the most obvious compart- 
ment in plant cells. It is a very large vesicle en- 
closed by a single membrane called the tonoplast. 
Vacuoles tend to be smaller in young cells, but in 
mature cells, they may occupy more than 50% of 
the cell’s volume. Vacuoles occupy the center of the 
cell, with the cytoplasm being located peripherally 
around it. They resemble the lysosomes of animal 
cells. 


Protection against osmotic or mechanical rupture. 
The walls of neighboring cells interact in cement- 
ing the cells together to form the plant. Channels 
for fluid circulation and for cell-cell communica- 
tion pass through the walls. The structural material 
confers form and strength on plant tissue. 

The plasma membrane of plant cells is selectively 
permeable, containing transport systems for the 
uptake of essential nutrients and inorganic ions. A 
number of important enzymes are localized here. 
Chromosomal organization, DNA replication, tran- 
scription, ribosome synthesis, and mitosis in plant 
cells are generally similar to the analogous features 
in animals. 

These organelles serve the same purposes in plant 
cells that they do in animal cells. 


Chloroplasts are the site of photosynthesis, the 
reactions by which light energy is converted to 
metabolically useful chemical energy in the form of 
ATP. These reactions occur on the thylakoid mem- 
branes. The formation of carbohydrate from CO», 
takes place in the stroma. Oxygen is evolved during 
photosynthesis. Chloroplasts are the primary 
source of energy in the light. 

Plant mitochondria are the main source of energy 
generation in photosynthetic cells in the dark and 
in nonphotosynthetic cells under all conditions. 


Vacuoles function in transport and storage of 
nutrients and cellular waste products. By accumu- 
lating water, the vacuole allows the plant cell to 
grow dramatically in size with no increase in cyto- 
plasmic volume. 
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FIGURE 1.23 Viruses are genetic elements enclosed in a protein coat. Viruses are not free-living organisms and 
can reproduce only within cells. Viruses show an almost absolute specificity for their particular host cells, infect- 
ing and multiplying only within those cells. Viruses are known for virtually every kind of cell. Shown here are 
examples of (a) an animal virus, adenovirus; (b) bacteriophage T; on E. coli; and (c) a plant virus, tobacco 


mosaic virus. 


© Science Source/Photo Researchers, Inc. 
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CENGAGENOW” ACTIVE FIGURE 1.24 The virus life cycle. Viruses are mobile bits of genetic information 
encapsulated in a protein coat. The genetic material can be either DNA or RNA. Once this genetic material 
gains entry to its host cell, it takes over the host machinery for macromolecular synthesis and subverts it to the 
synthesis of viral-specific nucleic acids and proteins. These virus components are then assembled into mature 
virus particles that are released from the cell. Often, this parasitic cycle of virus infection leads to cell death and 


disease. Test yourself on the concepts in this figure at www.cengage.com/login 





SUMMARY 


1.1 What Are the Distinctive Properties of Living Systems? Living sys- 
tems display an astounding array of activities that collectively constitute 
growth, metabolism, response to stimuli, and replication. In accord with 
their functional diversity, living organisms are complicated and highly or- 
ganized entities composed of many cells. In turn, cells possess subcellular 
structures known as organelles, which are complex assemblies of very 
large polymeric molecules, or macromolecules. The monomeric units of 
macromolecules are common organic molecules (metabolites). Biologi- 
cal structures play a role in the organism’s existence. From parts of or- 
ganisms, such as limbs and organs, down to the chemical agents of me- 
tabolism, such as enzymes and metabolic intermediates, a biological 
purpose can be given for each component. Maintenance of the highly 
organized structure and activity of living systems requires energy that 
must be obtained from the environment. Energy is required to create and 
maintain structures and to carry out cellular functions. In terms of the ca- 
pacity of organisms to self-replicate, the fidelity of selfreplication resides 
ultimately in the chemical nature of DNA, the genetic material. 


1.2 What Kinds of Molecules Are Biomolecules? C, H, N, and O are 
among the lightest elements capable of forming covalent bonds 
through electron-pair sharing. Because the strength of covalent bonds 
is inversely proportional to atomic weight, H, C, N, and O form the 
strongest covalent bonds. Two properties of carbon covalent bonds 
merit attention: the ability of carbon to form covalent bonds with itself 
and the tetrahedral nature of the four covalent bonds when carbon 
atoms form only single bonds. Together these properties hold the po- 
tential for an incredible variety of structural forms, whose diversity is 
multiplied further by including N, O, and H atoms. 


1.3 What Is the Structural Organization of Complex Biomolecules? 
Biomolecules are built according to a structural hierarchy: Simple mol- 
ecules are the units for building complex structures. HO, COs, NH4*, 
NOs", and No are the inorganic precursors for the formation of simple 
organic compounds from which metabolites are made. These metabo- 
lites serve as intermediates in cellular energy transformation and as 
building blocks (amino acids, sugars, nucleotides, fatty acids, and glyc- 
erol) for lipids and for macromolecular synthesis (synthesis of proteins, 
polysaccharides, DNA, and RNA). The next higher level of structural or- 
ganization is created when macromolecules come together through 


noncovalent interactions to form supramolecular complexes, such as 
multifunctional enzyme complexes, ribosomes, chromosomes, and cyto- 
skeletal elements. 

The next higher rung in the hierarchical ladder is occupied by the 
organelles. Organelles are membrane-bounded cellular inclusions ded- 
icated to important cellular tasks, such as the nucleus, mitochondria, 
chloroplasts, endoplasmic reticulum, Golgi apparatus, and vacuoles, as 
well as other relatively small cellular inclusions. At the apex of the bio- 
molecular hierarchy is the cell, the unit of life, the smallest entity dis- 
playing those attributes associated uniquely with the living state— 
growth, metabolism, stimulus response, and replication. 


1.4 How Do the Properties of Biomolecules Reflect Their Fitness to the 
Living Condition? Some biomolecules carry the information of life; 
others translate this information so that the organized structures essen- 
tial to life are formed. Interactions between such structures are the 
processes of life. Properties of biomolecules that endow them with the 
potential for creating the living state include the following: Biological 
macromolecules and their building blocks have directionality, and thus 
biological macromolecules are informational; in addition, biomolecules 
have characteristic three-dimensional architectures, providing the 
means for molecular recognition through structural complementarity. 
Weak forces (H bonds, van der Waals interactions, ionic attractions, and 
hydrophobic interactions) mediate the interactions between biological 
molecules and, as a consequence, restrict organisms to the narrow 
range of environmental conditions where these forces operate. 


1.5 What Is the Organization and Structure of Cells? All cells share a 
common ancestor and fall into one of two broad categories—prokaryotic 
and eukaryotic—depending on whether the cell has a nucleus. Prokary- 
otes are typically single-celled organisms and have a rather simple cellu- 
lar organization. In contrast, eukaryotic cells are structurally more com- 
plex, having organelles and various subcellular compartments defined 
by membranes. Other than the Protists, eukaryotes are multicellular. 


1.6 What Are Viruses? Viruses are supramolecular complexes of nu- 
cleic acid encapsulated in a protein coat and, in some instances, sur- 
rounded by a membrane envelope. Viruses are not alive; they are not 
even cellular. Instead, they are packaged bits of genetic material that 
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can parasitize cells in order to reproduce. Often, they cause disintegra- 
tion, or lysis, of the cells they’ve infected. It is these cytolytic properties 
that are the basis of viral disease. In certain circumstances, the viral nu- 
cleic acid may integrate into the host chromosome and become quies- 
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cent, creating a state known as lysogeny. If the host cell is damaged, the 
replicative capacities of the quiescent viral nucleic acid may be acti- 
vated, leading to viral propagation and release. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create you own study path for this 
chapter at www.cengage.com/login 


1. The nutritional requirements of Escherichia coli cells are far simpler 
than those of humans, yet the macromolecules found in bacteria 
are about as complex as those of animals. Because bacteria can 
make all their essential biomolecules while subsisting on a simpler 
diet, do you think bacteria may have more biosynthetic capacity and 
hence more metabolic complexity than animals? Organize your 
thoughts on this question, pro and con, into a rational argument. 


2. Without consulting the figures in this chapter, sketch the charac- 
teristic prokaryotic and eukaryotic cell types and label their perti- 


nent organelle and membrane systems. 


3. Escherichia coli cells are about 2 wm (microns) long and 0.8 wm in 


diameter. 


a. How many E. coli cells laid end to end would fit across the diame- 


ter of a pinhead? (Assume a pinhead diameter of 0.5 mm.) 


b. What is the volume of an E. coli cell? (Assume it is a cylinder, with 


the volume of a cylinder given by V= 7 h, where 7 = 3.14.) 


c. What is the surface area of an £E. coli cell? What is the surface-to- 


volume ratio of an E. coli cell? 


d. Glucose, a major energy-yielding nutrient, is present in bacterial 
cells at a concentration of about 1 mM. What is the concentration 
of glucose, expressed as mg/mL? How many glucose molecules 
are contained in a typical E. coli cell? (Recall that Avogadro’s 


number = 6.023 X 10%.) 


e. A number of regulatory proteins are present in E. coli at only one 
or two molecules per cell. If we assume that an E. coli cell contains 
just one molecule of a particular protein, what is the molar con- 
centration of this protein in the cell? If the molecular weight of 
this protein is 40 kD, what is its concentration, expressed as 


mg/mL? 


f. An E. coli cell contains about 15,000 ribosomes, which carry out 
protein synthesis. Assuming ribosomes are spherical and have a di- 
ameter of 20 nm (nanometers), what fraction of the E. coli cell vol- 


ume is occupied by ribosomes? 


g. The E. coli chromosome is a single DNA molecule whose mass 
is about 3 X 10° daltons. This macromolecule is actually a linear 
array of nucleotide pairs. The average molecular weight of a 
nucleotide pair is 660, and each pair imparts 0.34 nm to the 
length of the DNA molecule. What is the total length of the 
E. colichromosome? How does this length compare with the over- 
all dimensions of an E. coli cell? How many nucleotide pairs does 
this DNA contain? The average E. coli protein is a linear chain of 
360 amino acids. If three nucleotide pairs in a gene encode one 
amino acid in a protein, how many different proteins can the 
E. colichromosome encode? (The answer to this question is a rea- 
sonable approximation of the maximum number of different 


kinds of proteins that can be expected in bacteria.) 


4. Assume that mitochondria are cylinders 1.5 wm in length and 


0.6 um in diameter. 
a. What is the volume of a single mitochondrion? 


b. Oxaloacetate is an intermediate in the citric acid cycle, an im- 
portant metabolic pathway localized in the mitochondria of 
eukaryotic cells. The concentration of oxaloacetate in mito- 
chondria is about 0.03 uM. How many molecules of oxaloacetate 


are in a single mitochondrion? 


5. 


6. 


2 


10 


11. 
12. 


Assume that liver cells are cuboidal in shape, 20 wm on a side. 

a. How many liver cells laid end to end would fit across the diameter 
of a pinhead? (Assume a pinhead diameter of 0.5 mm.) 

b. What is the volume of a liver cell? (Assume it is a cube.) 

c. What is the surface area of a liver cell? What is the surface- 
to-volume ratio of a liver cell? How does this compare to the 
surface-to-volume ratio of an E. coli cell (compare this answer with 
that of problem 3c)? What problems must cells with low surface- 
to-volume ratios confront that do not occur in cells with high 
surface-to-volume ratios? 

d. A human liver cell contains two sets of 23 chromosomes, each 
set being roughly equivalent in information content. The total 
mass of DNA contained in these 46 enormous DNA molecules is 
4x10” daltons. Because each nucleotide pair contributes 
660 daltons to the mass of DNA and 0.34 nm to the length of 
DNA, what is the total number of nucleotide pairs and the com- 
plete length of the DNA in a liver cell? How does this length 
compare with the overall dimensions of a liver cell? The maximal 
information in each set of liver cell chromosomes should be re- 
lated to the number of nucleotide pairs in the chromosome set’s 
DNA. This number can be obtained by dividing the total number 
of nucleotide pairs just calculated by 2. What is this value? If this 
information is expressed in proteins that average 400 amino 
acids in length and three nucleotide pairs encode one amino 
acid in a protein, how many different kinds of proteins might a 
liver cell be able to produce? (In reality, liver cell DNA encodes 
approximately 20,000 different proteins. Thus, a large discrep- 
ancy exists between the theoretical information content of DNA 
in liver cells and the amount of information actually expressed.) 


Biomolecules interact with one another through molecular surfaces 
that are structurally complementary. How can various proteins in- 
teract with molecules as different as simple ions, hydrophobic 
lipids, polar but uncharged carbohydrates, and even nucleic acids? 
What structural features allow biological polymers to be informa- 
tional macromolecules? Is it possible for polysaccharides to be in- 
formational macromolecules? 

Why is it important that weak forces, not strong forces, mediate bio- 
molecular recognition? 

What is the distance between the centers of two carbon atoms (their 
limit of approach) that are interacting through van der Waals forces? 
What is the distance between the centers of two carbon atoms 
joined in a covalent bond? (See Table 1.4.) 


Why does the central role of weak forces in biomolecular interac- 
tions restrict living systems to a narrow range of environmental 
conditions? 

Describe what is meant by the phrase “cells are steady-state systems.” 
The genome of the Mycoplasma genitalium consists of 523 genes, en- 
coding 484 proteins, in just 580,074 base pairs (Table 1.6). What 
fraction of the M. genitalium genes encode proteins? What do you 
think the other genes encode? If the fraction of base pairs devoted 
to protein-coding genes is the same as the fraction of the total 
genes that they represent, what is the average number of base pairs 
per protein-coding gene? If it takes 3 base pairs to specify an amino 
acid in a protein, how many amino acids are found in the average 
M. genitalium protein? If each amino acid contributes on average 


120 Daltons to the mass of a protein, what is the mass of an average 
M. genitalium protein? 

13. Studies of existing cells to determine the minimum number of 
genes for a living cell have suggested that 206 genes are sufficient. 
If the ratio of protein-coding genes to non—protein-coding genes is 
the same in this minimal organism as the genes of Mycoplasma geni- 
talium, how many proteins are represented in these 206 genes? How 
many base pairs would be required to form the genome of this min- 
imal organism if the genes are the same size as M. genitalium genes? 

14. Virus genomes range in size from approximately 3500 nucleotides 
to approximately 280,000 base pairs. If viral genes are about the 
same size as M. genitalium genes, what is the minimum and maxi- 
mum number of genes in viruses? 
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15. The endoplasmic reticulum (ER) is a site of protein synthesis. Pro- 
teins made by ribosomes associated with the ER may pass into the 
ER membrane or enter the lumen of the ER. Devise a pathway by 
which: 
a. a plasma membrane protein may reach the plasma membrane. 
b. a secreted protein may be deposited outside the cell. 


Preparing for the MCAT Exam 


16. Biological molecules often interact via weak forces (H bonds, van 
der Waals interactions, etc.). What would be the effect of an in- 
crease in kinetic energy on such interactions? 

17. Proteins and nucleic acids are informational macromolecules. What 
are the two minimal criteria for a linear informational polymer? 
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Water: The Medium of Life 






ESSENTIAL QUESTION 


Water provided conditions for the origin, evolution, and flourishing of life; water is 








g the medium of life. 
8 What are the properties of water that render it so suited to its role as the 
3 medium of life? 
È 
© 
Where there's water, there's life. 
If there is magic on this planet, it is contained in Water is a major chemical component of the earth’s surface. It is indispensable to 
water. life. Indeed, it is the only liquid that most organisms ever encounter. We are 
Loren Eisley prone to take it for granted because of its ubiquity and bland nature, yet we mar- 
(inscribed on the wall of the National Aquarium vel at its many unusual and fascinating properties. At the center of this fascination 
in Baltimore, Maryland) is the role of water as the medium of life. Life originated, evolved, and thrives in 
the seas. Organisms invaded and occupied terrestrial and aerial niches, but none 
KEY QUESTIONS gained true independence from water. Typically, organisms are 70% to 90% water. 
Indeed, normal metabolic activity can occur only when cells are at least 65% H0O. 
2.1 What Are the Properties of Water? This dependency of life on water is not a simple matter, but it can be grasped by 
2.2 What Is pH? considering the unusual chemical and physical properties of HO. Subsequent 
2.3. What Are Buffers, and What Do They Do? chapters establish that water and its ionization products, hydrogen ions and hy- 
2.4 What Properties of Water Give It a Unique droxide ions, are critical determinants of the structure and function of many bio- 
Role in the Environment? molecules, including amino acids and proteins, nucleotides and nucleic acids, 


and even phospholipids and membranes. In yet another essential role, water is an 
indirect participant—a difference in the concentration of hydrogen ions on op- 
posite sides of a membrane represents an energized condition essential to bio- 
logical mechanisms of energy transformation. First, let’s review the remarkable 
properties of water. 


ule What Are the Properties of Water? 


Water Has Unusual Properties 


Compared with chemical compounds of similar atomic organization and molecular 
size, water displays unexpected properties. For example, compare water, the hy- 
dride of oxygen, with hydrides of oxygen’s nearest neighbors in the periodic table, 
namely, ammonia (NHs) and hydrogen fluoride (HF), or with the hydride of its 
nearest congener, sulfur (H»S). Water has a substantially higher boiling point, melt- 
ing point, heat of vaporization, and surface tension. Indeed, all of these physical 
properties are anomalously high for a substance of this molecular weight that is nei- 
ther metallic nor ionic. These properties suggest that intermolecular forces of at- 
traction between HO molecules are high. Thus, the internal cohesion of this sub- 
stance is high. Furthermore, water has an unusually high dielectric constant, its 
maximum density is found in the liquid (not the solid) state, and it has a negative 
volume of melting (that is, the solid form, ice, occupies more space than does the 
liquid form, water). It is truly remarkable that so many eccentric properties occur 
together in this single substance. As chemists, we expect to find an explanation for 
CENGAGENOW* Create your own study path for these apparent eccentricities in the structure of water. The key to its intermolecular 
this chapter with tutorials, simulations, animations, attractions must lie in its atomic constitution. Indeed, the unrivaled ability to form hy- 
and Active Figures at www.cengage.com/ login : : od : 
drogen bonds is the crucial fact to understanding its properties. 








Hydrogen Bonding in Water Is Key to Its Properties 


The two hydrogen atoms of water are linked covalently to oxygen, each sharing an 
electron pair, to give a nonlinear arrangement (Figure 2.1). This “bent” structure of 
the HO molecule has enormous influence on its properties. If HO were linear, it 
would be a nonpolar substance. In the bent configuration, however, the electroneg- 
ative O atom and the two H atoms form a dipole that renders the molecule distinctly 
polar. Furthermore, this structure is ideally suited to H-bond formation. Water can 
serve as both an H donor and an H acceptor in H-bond formation. The potential to 
form four H bonds per water molecule is the source of the strong intermolecular at- 
tractions that endow this substance with its anomalously high boiling point, melting 
point, heat of vaporization, and surface tension. In ordinary ice, the common crys- 
talline form of water, each H¿O molecule has four nearest neighbors to which it is 
hydrogen bonded: Each H atom donates an H bond to the O of a neighbor, and the 
O atom serves as an H-bond acceptor from H atoms bound to two different water 
molecules (Figure 2.2). A local tetrahedral symmetry results. 

Hydrogen bonding in water is cooperative. That is, an H-bonded water mole- 
cule serving as an acceptor is a better H-bond donor than an unbonded molecule 
(and an HO molecule serving as an H-bond donor becomes a better H-bond ac- 
ceptor). Thus, participation in H bonding by HO molecules is a phenomenon 
of mutual reinforcement. The H bonds between neighboring molecules are weak 
(23 kJ/mol each) relative to the H—O covalent bonds (420 kJ/mol). As a conse- 
quence, the hydrogen atoms are situated asymmetrically between the two oxygen 
atoms along the O-O axis. There is never any ambiguity about which O atom the 
H atom is chemically bound to, nor to which O it is H bonded. 


The Structure of Ice Is Based On H-Bond Formation 


In ice, the hydrogen bonds form a space-filling, three-dimensional network. These 
bonds are directional and straight; that is, the H atom lies on a direct line between 
the two O atoms. This linearity and directionality mean that the H bonds in ice are 
strong. In addition, the directional preference of the H bonds leads to an open lat- 
tice structure. For example, if the water molecules are approximated as rigid 
spheres centered at the positions of the O atoms in the lattice, then the observed 
density of ice is actually only 57% of that expected for a tightly packed arrangement 
of such spheres. The H bonds in ice hold the water molecules apart. Melting in- 
volves breaking some of the H bonds that maintain the crystal structure of ice so 





CENGAGENOW”™ ANIMATED FIGURE 2.2 The structure of normal ice. The smallest number of H2O molecules in 
any closed circuit of H-bonded molecules is six, so this structure bears the name hexagonal ice. See this figure 
animated at www.cengage.com/login 
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Dipole moment 


Covalent bond 
length = 0.095 nm 


van der Waals radius 
of oxygen = 0.14 nm 





van der Waals radius 
of hydrogen = 0.12 nm 


CENGAGENOW” ACTIVE FIGURE 2.1 The structure of 
water. Two lobes of negative charge formed by the 
lone-pair electrons of the oxygen atom lie above and 
below the plane of the diagram. This electron density 
contributes substantially to the large dipole moment of 
the water molecule. Note that the H—O—H angle is 
104.3°, not 109°, the angular value found in molecules 
with tetrahedral symmetry, such as CH4. Many of the 
important properties of water derive from this angular 
value, such as the decreased density of its crystalline 
state, ice. Test yourself on the concepts in this figure 
at www.cengage.com/login 
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CENGAGENOW” ACTIVE FIGURE 2.3 The fluid network 
of H bonds linking water molecules in the liquid state. It 
is revealing to note that, in 10 psec, a photon of light 
(which travels at 3 X 108 m/sec) would move a distance 
of only 0.003 m. Test yourself on the concepts in this 


figure at www.cengage.com/login 


H bond 


that the molecules of water (now liquid) can actually pack closer together. Thus, the 
density of ice is slightly less than that of water. Ice floats, a property of great impor- 
tance to aquatic organisms in cold climates. 

In liquid water, the rigidity of ice is replaced by fluidity and the crystalline pe- 
riodicity of ice gives way to spatial homogeneity. The HO molecules in liquid wa- 
ter form a disordered H-bonded network, with each molecule having an average 
of 4.4 close neighbors situated within a center-to-center distance of 0.284 nm 
(2.84 A). At least half of the hydrogen bonds have nonideal orientations (that is, 
they are not perfectly straight); consequently, liquid HO lacks the regular lat- 
ticelike structure of ice. The space about an O atom is not defined by the pres- 
ence of four hydrogens but can be occupied by other water molecules randomly 
oriented so that the local environment, over time, is essentially uniform. Never- 
theless, the heat of melting for ice is but a small fraction (13%) of the heat of sub- 
limation for ice (the energy needed to go from the solid to the vapor state). This 
fact indicates that the majority of H bonds between HO molecules survive the 
transition from solid to liquid. At 10°C, 2.3 H bonds per H,O molecule remain 
and the tetrahedral bond order persists, even though substantial disorder is now 
present. 


Molecular Interactions in Liquid Water Are Based on H Bonds 


The present interpretation of water structure is that water molecules are connected 
by uninterrupted H-bond paths running in every direction, spanning the whole 
sample. The participation of each water molecule in an average state of H bonding 
to its neighbors means that each molecule is connected to every other in a fluid net- 
work of H bonds. The average lifetime of an H-bonded connection between two 
H,O molecules in water is 9.5 psec (picoseconds, where 1 psec = 107!* sec). Thus, 
about every 10 psec, the average HyO molecule moves, reorients, and interacts with 
new neighbors, as illustrated in Figure 2.3. 

In summary, pure liquid water consists of HO molecules held in a disordered, 
three-dimensional network that has a local preference for tetrahedral geometry, yet 
contains a large number of strained or broken hydrogen bonds. The presence of 
strain creates a kinetic situation in which HO molecules can switch H-bond alle- 
giances; fluidity ensues. 


The Solvent Properties of Water Derive from Its Polar Nature 


Because of its highly polar nature, water is an excellent solvent for ionic sub- 
stances such as salts; nonionic but polar substances such as sugars, simple 
alcohols, and amines; and carbonyl-containing molecules such as aldehydes and 
ketones. Although the electrostatic attractions between the positive and negative 
ions in the crystal lattice of a salt are very strong, water readily dissolves salts. For 
example, sodium chloride is dissolved because dipolar water molecules partici- 
pate in strong electrostatic interactions with the Na* and Cl” ions, leading to the 
formation of hydration shells surrounding these ions (Figure 2.4). Although hy- 
dration shells are stable structures, they are also dynamic. Each water molecule in 
the inner hydration shell around a Na* ion is replaced on average every 2 to 
4 nsec (nanoseconds, where 1 nsec = 107° sec) by another HO. Consequently, a 
water molecule is trapped only several hundred times longer by the electrostatic 
force field of an ion than it is by the H-bonded network of water. (Recall that the 
average lifetime of H bonds between water molecules is about 10 psec.) 


Water Has a High Dielectric Constant The attractions between the water mole- 
cules interacting with, or hydrating, ions are much greater than the tendency of op- 
positely charged ions to attract one another. Water’s ability to surround ions in di- 
pole interactions and diminish their attraction for each other is a measure of its 
dielectric constant, D. Indeed, ionization in solution depends on the dielectric con- 
stant of the solvent; otherwise, the strongly attracted positive and negative ions 





CENGAGENOW”™ ANIMATED FIGURE 2.4 
Hydration shells surrounding ions in 
solution. Water molecules orient so that 
the electrical charge on the ion is 
sequestered by the water dipole. See 
this figure animated at www 
.cengage.com/login 











would unite to form neutral molecules. The strength of the dielectric constant is re- 
lated to the force, F, experienced between two ions of opposite charge separated by 
a distance, r, as given in the relationship 


F= @@/Dr? 


where e and & are the charges on the two ions. Table 2.1 lists the dielectric con- 
stants of some common liquids. Note that the dielectric constant for water is more 
than twice that of methanol and more than 40 times that of hexane. 


Water Forms H Bonds with Polar Solutes In the case of nonionic but polar com- 
pounds such as sugars, the excellent solvent properties of water stem from its abil- 
ity to readily form hydrogen bonds with the polar functional groups on these com- 
pounds, such as hydroxyls, amines, and carbonyls (see Figure 1.14). These polar 
interactions between solvent and solute are stronger than the intermolecular at- 
tractions between solute molecules caused by van der Waals forces and weaker hy- 
drogen bonding. Thus, the solute molecules readily dissolve in water. 


Hydrophobic Interactions The behavior of water toward nonpolar solutes is dif- 
ferent from the interactions just discussed. Nonpolar solutes (or nonpolar func- 
tional groups on biological macromolecules) do not readily H bond to HO, and 
as a result, such compounds tend to be only sparingly soluble in water. The 
process of dissolving such substances is accompanied by significant reorganization 
of the water surrounding the solute so that the response of the solvent water to 
such solutes can be equated to “structure making.” Because nonpolar solutes must 
occupy space, the random H-bonded network of water must reorganize to ac- 
commodate them. At the same time, the water molecules participate in as many 
H-bonded interactions with one another as the temperature permits. Conse- 
quently, the H-bonded water network rearranges toward formation of a local cage- 
like (clathrate) structure surrounding each solute molecule, as shown for a long- 
chain fatty acid in Figure 2.5. This fixed orientation of water molecules around a 
hydrophobic “solute” molecule results in a hydration shell. A major consequence 
of this rearrangement is that the molecules of HO participating in the cage layer 
have markedly reduced options for orientation in three-dimensional space. Water 
molecules tend to straddle the nonpolar solute such that two or three tetrahedral 
directions (H-bonding vectors) are tangential to the space occupied by the inert 
solute. “Straddling” allows the water molecules to retain their H-bonding possi- 
bilities because no H-bond donor or acceptor of the H20 is directed toward the 
caged solute. The water molecules forming these clathrates are involved in highly 
ordered structures. That is, clathrate formation is accompanied by significant or- 
dering of structure or negative entropy. 
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Solvent 





Dielectric Constant 





Formamide 
Water 

Methyl alcohol 
Ethyl alcohol 
Acetone 
Acetic acid 
Chloroform 
Benzene 


Hexane 
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78.5 
32.6 
24.3 
20.7 
6.2 
5.0 
2.3 
1.9 





*The dielectric constant is also referred to as relative permitiv- 


ity by physical chemists. 
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CENGAGENOW” ANIMATED FIGURE 2.5 (left) Disordered network of H-bonded water 
molecules. (right) Clathrate cage of ordered, H-bonded water molecules around a 
nonpolar solute molecule. See this figure animated at www.cengage.com/ 

login 














Multiple nonpolar molecules tend to cluster together, because their joint solva- 
tion cage involves less total surface area and thus fewer ordered water molecules 
than in their separate cages. It is as if the nonpolar molecules had some net attrac- 
tion for one another. This apparent affinity of nonpolar structures for one another 
is called hydrophobic interactions (Figure 2.6). In actuality, the “attraction” be- 
tween nonpolar solutes is an entropy-driven process due to a net decrease in order 

















FIGURE 2.6 Hydrophobic interactions between nonpolar molecules (or nonpolar regions of molecules) are due 
to the increase in entropy of solvent water molecules. 


among the HO molecules. To be specific, hydrophobic interactions between non- 
polar molecules are maintained not so much by direct interactions between the 
inert solutes themselves as by the increase in entropy when the water cages coalesce 
and reorganize. Because interactions between nonpolar solute molecules and the 
water surrounding them are of uncertain stoichiometry and do not share the equal- 
ity of atom-to-atom participation implicit in chemical bonding, the term hydrophobic 
interaction is more correct than the misleading expression hydrophobic bond. 


Amphiphilic Molecules Compounds containing both strongly polar and strongly 
nonpolar groups are called amphiphilic molecules (from the Greek amphi meaning 
“both” and philos meaning “loving”). Such compounds are also referred to as 
amphipathic molecules (from the Greek pathos meaning “passion”). Salts of fatty 
acids are a typical example that has biological relevance. They have a long nonpo- 
lar hydrocarbon tail and a strongly polar carboxyl head group, as in the sodium salt 
of palmitic acid (Figure 2.7). Their behavior in aqueous solution reflects the com- 
bination of the contrasting polar and nonpolar nature of these substances. The 
ionic carboxylate function hydrates readily, whereas the long hydrophobic tail is in- 
trinsically insoluble. Nevertheless, sodium palmitate and other amphiphilic mole- 
cules readily disperse in water because the hydrocarbon tails of these substances are 
joined together in hydrophobic interactions as their polar carboxylate functions 
are hydrated in typical hydrophilic fashion. Such clusters of amphipathic molecules 
are termed micelles; Figure 2.7b depicts their structure. 


Influence of Solutes on Water Properties The presence of dissolved substances 
disturbs the structure of liquid water, thereby changing its properties. The dynamic 
H-bonding interactions of water must now accommodate the intruding substance. 
The net effect is that solutes, regardless of whether they are polar or nonpolar, fix 
nearby water molecules in a more ordered array. Ions, by establishing hydration 
shells through interactions with the water dipoles, create local order. Hydrophobic 
substances, for different reasons, make structures within water. To put it another 
way, by limiting the orientations that neighboring water molecules can assume, 
solutes give order to the solvent and diminish the dynamic interplay among HO 
molecules that occurs in pure water. 


Colligative Properties This influence of the solute on water is reflected in a set of 
characteristic changes in behavior termed colligative properties, or properties re- 
lated by a common principle. These alterations in solvent properties are related in 
that they all depend only on the number of solute particles per unit volume of sol- 
vent and not on the chemical nature of the solute. These effects include freezing 
point depression, boiling point elevation, vapor pressure lowering, and osmotic 
pressure effects. For example, 1 mol of an ideal solute dissolved in 1000 g of water 


(a) The sodium salt of palmitic acid: Sodium (b) 
palmitate (Nat 7OOC(CHo)14CH3) 
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CENGAGENOW” ACTIVE FIGURE 2.7 (a) An 
amphiphilic molecule: sodium palmitate. (b) Micelle 
formation by amphiphilic molecules in aqueous 
solution. Because of their negatively charged sur- 
faces, neighboring micelles repel one another and 
thereby maintain a relative stability in solution. Test 
yourself on the concepts in this figure at 
www.cengage.com/login 
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CENGAGENOW” ACTIVE FIGURE 2.8 The osmotic pres- 
sure of a 1 molal (m) solution is equal to 22.4 atmo- 
spheres of pressure. (a) If a nonpermeant solute is sepa- 
rated from pure water by a semipermeable membrane 
through which H,O passes freely, (b) water molecules 
enter the solution (osmosis) and the height of the solu- 
tion column in the tube rises. The pressure necessary to 
push water back through the membrane at a rate 
exactly equaled by the water influx is the osmotic pres- 
sure of the solution. (c) For a 1 m solution, this force is 
equal to 22.4 atm of pressure. Osmotic pressure is direct- 
ly proportional to the concentration of the nonper- 
meant solute. Test yourself on the concepts in this fig- 
ure at www.cengage.com/login 
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CENGAGENOW” ACTIVE FIGURE 2.9 The ionization of 
water. Test yourself on the concepts in this figure at 
www.cengage.com/login 
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(a 1 m, or molal, solution) at 1 atm pressure depresses the freezing point by 1.86°C, 
raises the boiling point by 0.543°C, lowers the vapor pressure in a temperature- 
dependent manner, and yields a solution whose osmotic pressure relative to pure 
water is 22.4 atm (at 25°C). In effect, by imposing local order on the water mole- 
cules, solutes make it more difficult for water to assume its crystalline lattice (freeze) 
or escape into the atmosphere (boil or vaporize). Furthermore, when a solution 
(such as the 1 m solution discussed here) is separated from a volume of pure water 
by a semipermeable membrane, the solution draws water molecules across this bar- 
rier. The water molecules are moving from a region of higher effective concentra- 
tion (pure HO) to a region of lower effective concentration (the solution). This 
movement of water into the solution dilutes the effects of the solute that is present. 
The osmotic force exerted by each mole of solute is so strong that it requires the 
imposition of 22.4 atm of pressure to be negated (Figure 2.8). 

Osmotic pressure from high concentrations of dissolved solutes is a serious prob- 
lem for cells. Bacterial and plant cells have strong, rigid cell walls to contain these 
pressures. In contrast, animal cells are bathed in extracellular fluids of comparable 
osmolarity, so no net osmotic gradient exists. Also, to minimize the osmotic pressure 
created by the contents of their cytosol, cells tend to store substances such as amino 
acids and sugars in polymeric form. For example, a molecule of glycogen or starch 
containing 1000 glucose units exerts only 1/1000 the osmotic pressure that 1000 
free glucose molecules would. 


Water Can lonize to Form H* and 0H- 


Water shows a small but finite tendency to form ions. This tendency is demonstrated 
by the electrical conductivity of pure water, a property that clearly establishes the 
presence of charged species (ions). Water ionizes because the larger, strongly elec- 
tronegative oxygen atom strips the electron from one of its hydrogen atoms, leav- 
ing the proton to dissociate (Figure 2.9): 


H—O—H——> H* + OH- 


Two ions are thus formed: (1) protons or hydrogen ions, H*, and (2) hydroxyl ions, 
OHZ. Free protons are immediately hydrated to form hydronium ions, H;O*: 


H* + HO —> H;O+ 


Indeed, because most hydrogen atoms in liquid water are hydrogen bonded to a 
neighboring water molecule, this protonic hydration is an instantaneous process 
and the ion products of water are HO" and OH’: 


H H H 
pej -H-O —> O—H+*+ OHT 
H a 


The amount of H;O* or OH™ in 1 L (liter) of pure water at 25°C is 1 X 1077 mol; 
the concentrations are equal because the dissociation is stoichiometric. 

Although it is important to keep in mind that the hydronium ion, or hydrated 
hydrogen ion, represents the true state in solution, the convention is to speak of hy- 
drogen ion concentrations in aqueous solution, even though “naked” protons are 
virtually nonexistent. Indeed, H;O* itself attracts a hydration shell by H bonding to 
adjacent water molecules to form an HQ," species (Figure 2.10) and even more 
highly hydrated forms. Similarly, the hydroxyl ion, like all other highly charged 
species, is also hydrated. 


Kw the lon Product of Water The dissociation of water into hydrogen ions and hy- 
droxyl ions occurs to the extent that 10~’mol of H* and 10~“mol of OH” are pres- 
ent at equilibrium in 1 L of water at 25°C. 








HO H* + OH” 
The equilibrium constant for this process is 
_ [H*][OH7] 
a [H20] 


where brackets denote concentrations in moles per liter. Because the concentration 
of H,O in 1 L of pure water is equal to the number of grams in a liter divided by the 
gram molecular weight of H,O, or 1000/18, the molar concentration of HsO in 
pure water is 55.5 M (molar). The decrease in HsO concentration as a result of ion 
formation ([H*], [OH7] = 1077M) is negligible in comparison; thus its influence 
on the overall concentration of H,O can be ignored. Thus, 


_, a07 (1077) 
55.5 


= 1.8 X 10° M 





Kea 


Because the concentration of H,O in pure water is essentially constant, a new con- 
stant, Kw, the ion product of water, can be written as 


K, = 55.5 Keq = 10° M? = [H*][OH7] 


This equation has the virtue of revealing the reciprocal relationship between H* 
and OH” concentrations of aqueous solutions. If a solution is acidic (that is, it has 
a significant [H*]), then the ion product of water dictates that the OH- concen- 
tration is correspondingly less. For example, if [H*] is 107° M, [OH~] must be 
1071? M (Ky = 107 M? = [10°-7][OH~]; [OH7] = 107" M). Similarly, in an alka- 
line, or basic, solution in which [OH 7] is great, [H*] is low. 


222s What Is pH? 


To avoid the cumbersome use of negative exponents to express concentrations that 
range over 14 orders of magnitude, Sgren Sgrensen, a Danish biochemist, devised 
the pH scale by defining pH as the negative logarithm of the hydrogen ion concentration’: 


pH = -logo [H+] 


Table 2.2 gives the pH scale. Note again the reciprocal relationship between [H+] 
and [OH]. Also, because the pH scale is based on negative logarithms, low pH val- 
ues represent the highest H* concentrations (and the lowest OH™ concentrations, 
as K,, specifies). Note also that 


pk, = pH + pOH = 14 


'To be precise in physical chemical terms, the activities of the various components, not their molar con- 
centrations, should be used in these equations. The activity (a) of a solute component is defined as the 
product of its molar concentration, c, and an activity coefficient, y: a = [cl y. Most biochemical work in- 
volves dilute solutions, and the use of activities instead of molar concentrations is usually neglected. 
However, the concentration of certain solutes may be very high in living cells. 
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CENGAGENOW”™ ANIMATED FIGURE 2.10 The hydration 
of H,0*. See this figure animated at www 
.cengage.com/login 
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TABLE 2.3 | ; 
Fluid pH 
Household lye 13.6 
Bleach 12.6 
Household ammonia 11.4 
Milk of magnesia 10.3 
Baking soda 8.4 
Seawater 8.0 
Pancreatic fluid 7.8-8.0 
Blood plasma 7.4 
Intracellular fluids 

Liver 6.9 

Muscle 6.1 
Saliva 6.6 
Urine 5-8 
Boric acid 5.0 
Beer 4.5 
Orange juice 4.3 
Grapefruit juice 3.2 
Vinegar 2.9 
Soft drinks 2.8 
Lemon juice 2.3 
Gastric juice 1.2-3.0 
Battery acid 0:35 











pH m H+ m~~ [OH] — 
0 (10°) 1.0 0.00000000000001 (10-14) 
1 (107!) 0.1 0.0000000000001 (1073) 
2 (10-2) 0.01 0.000000000001 (107!) 
3 (10-3) 0.001 0.00000000001 (10-1) 
4 a074 0.0001 0.0000000001 (10719) 
5 (10-5) 0.00001 0.000000001 (10-°) 
6 (1076) 0.000001 0.00000001 (10-8) 
7 (10-7) 0.0000001 0.0000001 (10-7) 
8 (10-8) 0.00000001 0.000001 (10-5) 
9 (1079) 0.000000001 0.00001 (10-5) 
10 (107!) 0.0000000001 0.0001 a074) 
11 a071 0.00000000001 0.001 (10-3) 
12 (107!) 0.000000000001 0.01 (10-23) 
13 a07») 0.0000000000001 0.1 (107!) 
14 (107!) 0.00000000000001 1.0 (10°) 








The pH scale is widely used in biological applications because hydrogen ion con- 
centrations in biological fluids are very low, about 1077 M or 0.0000001 M, a value 
more easily represented as pH 7. The pH of blood plasma, for example, is 7.4, or 
0.00000004 M H”. Certain disease conditions may lower the plasma pH level to 6.8 
or less, a situation that may result in death. At pH 6.8, the H* concentration is 
0.00000016 M, four times greater than at pH 7.4. 

At pH 7, [H*] = [OH]; that is, there is no excess acidity or basicity. The 
point of neutrality is at pH 7, and solutions having a pH of 7 are said to be at 
neutral pH. The pH values of various fluids of biological origin or relevance are 
given in Table 2.3. Because the pH scale is a logarithmic scale, two solutions 
whose pH values differ by 1 pH unit have a tenfold difference in [H*]. For ex- 
ample, grapefruit juice at pH 3.2 contains more than 12 times as much H* as or- 
ange juice at pH 4.3. 


Strong Electrolytes Dissociate Completely in Water 


Substances that are almost completely dissociated to form ions in solution are called 
strong electrolytes. The term electrolyte describes substances capable of generating 
ions in solution and thereby causing an increase in the electrical conductivity of the 
solution. Many salts (such as NaCl and KySQO,) fit this category, as do strong acids 
(such as HCl) and strong bases (such as NaOH). Recall from general chemistry that 
acids are proton donors and bases are proton acceptors. In effect, the dissociation 
of a strong acid such as HCl in water can be treated as a proton transfer reaction be- 
tween the acid HCl and the base HO to give the conjugate acid H;O* and the con- 
jugate base Cl": 


HCl + HyO—>H;07 + CI 
The equilibrium constant for this reaction is 
K= [H;O+] [C] 
[H20] [HCI] 


Customarily, because the term [HO] is essentially constant in dilute aqueous solu- 
tions, it is incorporated into the equilibrium constant K to give a new term, K,, the 


acid dissociation constant, where K, = K [HoO]. Also, the term [H30*] is often re- 
placed by H*, such that 
_ [HI(Cr] 
a [HCI] 
For HCI, the value of K, is exceedingly large because the concentration of HCI in 


aqueous solution is vanishingly small. Because this is so, the pH of HCI solutions is 
readily calculated from the amount of HCI used to make the solution: 


[H+Ħ] in solution = [HCI] added to solution 


Thus, a 1 M solution of HCI has a pH of 0; a 1 mM HCI solution has a pH of 3. Sim- 
ilarly, a 0.1 M NaOH solution has a pH of 13. (Because [OH] = 0.1 M, [H*] must 
be 107! M.) Viewing the dissociation of strong electrolytes another way, we see that 
the ions formed show little affinity for each other. For example, in HCl in water, C17 
has very little affinity for H*: 


HCI—> H?* + C- 


and in NaOH solutions, Na* has little affinity for OH~. The dissociation of these 
substances in water is effectively complete. 


Weak Electrolytes Are Substances That Dissociate Only Slightly in Water 


Substances with only a slight tendency to dissociate to form ions in solution are 
called weak electrolytes. Acetic acid, CH;COOH, is a good example: 


CH,COOH + H,O—=CH;,COO- + H;O+* 





The acid dissociation constant K, for acetic acid is 1.74 X 10~° M: 


[H*][CH;COO-] E 
K, = = 1.74 X 1075 M 
[CH;COOH] 





K, is also termed an ionization constant because it states the extent to which a sub- 
stance forms ions in water. The relatively low value of K, for acetic acid reveals that 
the un-ionized form, CH;COOH, predominates over H* and CH;COO7 in aqueous 
solutions of acetic acid. Viewed another way, CH;COO_, the acetate ion, has a high 
affinity for H+. 


| EXAMPLE | What is the pH of a 0.1 Msolution of acetic acid? In other words, what 


is the final pH when 0.1 mol of acetic acid (HAc) is added to water and the volume 
of the solution is adjusted to equal 1 L? 


Answer 
The dissociation of HAc in water can be written simply as 


HAc ==H?* + Ac™ 


where Ac” represents the acetate ion, CH;COO~. In solution, some amount x of 
HAc dissociates, generating x amount of Ac~ and an equal amount x of H*. Ionic 
equilibria characteristically are established very rapidly. At equilibrium, the concen- 
tration of HAc + Ac~ must equal 0.1 M. So, [HAc] can be represented as (0.1 — x) 
M, and [Ac] and [H*] then both equal x molar. From 1.74 X 10% M= 
({H*][Ac7])/[HAc], we get 1.74 X 107™* M= x?/[0.1 — x]. The solution to qua- 
dratic equations of this form (ax? + bx + c= 0) is x= —b + VP? — 4ac/2a. For x? + 
(1.74 X 107°) x — (1.74 X 107°) = 0, x = 1.319 X 107° M, so pH = 2.88. (Note that 
the calculation of x can be simplified here: Because K, is quite small, x < 0.1 M. 
Therefore, K, is essentially equal to x?/0.1. Thus, x? = 1.74 X 107° M?, so x = 1.32 X 
107° M, and pH = 2.88.) 
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The Henderson-Hasselbalch Equation Describes the Dissociation 
of a Weak Acid In the Presence of Its Conjugate Base 


Consider the ionization of some weak acid, HA, occurring with an acid dissociation 
constant, K,. Then, 


HA= =H? + A7 


and 
H+] [A7 
ga mM 
[HA] 
Rearranging this expression in terms of the parameter of interest, [H*], we have 
ape) = BA 
[A7] 


Taking the logarithm of both sides gives 


[HA] 
log [H*] = log K, + logio TA] 


If we change the signs and define pK, = —log Ka, we have 


[HA] 
pH = pK, — logio TAJ 


or 


[A7] 


H = pk, +1 a 
P P 0810 [HA] 


This relationship is known as the Henderson—Hasselbalch equation. Thus, the pH 
of a solution can be calculated, provided K, and the concentrations of the weak 
acid HA and its conjugate base A~ are known. Note particularly that when [HA] = 
[A7], pH = pKa. For example, if equal volumes of 0.1 M HAc and 0.1 M sodium 
acetate are mixed, then 


pH = pK, = 4.76 
pK, = —log Ka = —logio(1.74 X 107°) = 4.76 


(Sodium acetate, the sodium salt of acetic acid, is a strong electrolyte and dissoci- 
ates completely in water to yield Na* and Ac.) 

The Henderson-Hasselbalch equation provides a general solution to the quan- 
titative treatment of acid-base equilibria in biological systems. Table 2.4 gives the 
acid dissociation constants and pK, values for some weak electrolytes of biochemi- 
cal interest. 





| EXAMPLE | What is the pH when 100 mL of 0.1 N NaOH is added to 150 mL of 
0.2 M HAc if pK, for acetic acid = 4.76? 


Answer 
100 mL 0.1 N NaOH = 0.01 mol OH~, which neutralizes 0.01 mol of HAc, giving 
an equivalent amount of Ac”: 


OH- + HAc—> Ac” + HO 


0.02 mol of the original 0.03 mol of HAc remains essentially undissociated. The fi- 
nal volume is 250 mL. 
H = pk, +1 eee ees (0.01 mol/0.02 mol) 
P PAa T 10810 [HAc] : og (0.01 mol/0.02 mo 











If 150 mL of 0.2 M HAc had merely been diluted with 100 mL of water, this would 
leave 250 mL of a 0.12 M HAc solution. The pH would be given by: 


[H+] [Ac] x2 
a [HAc] — 0.12 M 





= 1.74 X 10°M 


x= 1.44 x 10°3 = [H+] 
pH = 2.84 








Titration Curves Illustrate the Progressive Dissociation 
of a Weak Acid 


Titration is the analytical method used to determine the amount of acid in 
a solution. A measured volume of the acid solution is titrated by slowly adding a 
solution of base, typically NaOH, of known concentration. As incremental 
amounts of NaOH are added, the pH of the solution is determined and a plot of 
the pH of the solution versus the amount of OH” added yields a titration curve. 
The titration curve for acetic acid is shown in Figure 2.11. In considering the 
progress of this titration, keep in mind two important equilibria: 


1. HAc=>=H*+ + Ac -K, = 1.74 X 10° 


[H20] 
[Ky] 





2. H? + OH ==H,O K= = 5.55 X 10! 


As the titration begins, mostly HAc is present, plus some H* and Ac” in amounts that 
can be calculated (see the Example on page 37). Addition of a solution of NaOH al- 
lows hydroxide ions to neutralize any H* present. Note that reaction (2) as written 
is strongly favored; its apparent equilibrium constant is greater than 1015! As H* is 
neutralized, more HAc dissociates to H* and Ac”. The stoichiometry of the titration 
is 1:1—for each increment of OH™ added, an equal amount of the weak acid HAc is 
titrated. As additional NaOH is added, the pH gradually increases as Ac™ accumu- 











TABLE 2.4 | 

Acid K, (M) pKa 

HCOOH (formic acid) 1.78 X 1074 3.75 
CHCOOH (acetic acid) 1.74 X 10% 4.76 
CH;CH,COOH (propionic acid) 1.35 X 107% 4.87 
CH3;CHOHCOOH (lactic acid) 1.38 X 1074 3.86 
HOOCCH,CH.2COOH (succinic acid) pk,* 6.16 x 107% 4.21 
HOOCCHCH COO- (succinic acid) pKə 2.34 X 107° 5.63 
HPO; (phosphoric acid) pK; 7.08 X 1073 2.15 
HPO; (phosphoric acid) pK 6.31 X 1078 7.20 
HPO,” (phosphoric acid) pK; 3.98 X 107? 12.40 
CsNoH;* (imidazole) 1.02 X 1077 6.99 
CgOeN3H1;* (histidine-imidazole group) pK! 9.12 X 1077 6.04 
HCO; (carbonic acid) pk 1.70 X 1074 3.77 
HCO; (bicarbonate) pK» 5.75 X 107! 10.24 
(HOCH,);CNH3;* (trishydroxymethyl aminomethane) 8.32 X 107° 8.07 
NH,* (ammonium) 5.62 X 10719 9.25 
CH;NH;* (methylammonium) 2.46 X 1071! 10.62 








*The pK values listed as pK;, pK2, or pK; are in actuality pK, values for the respective dissociations. This simplification in 
notation is used throughout this book. 

pKr refers to the imidazole ionization of histidine. 

Data from CRC Handbook of Biochemistry, The Chemical Rubber Co., 1968. 
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CENGAGENOW™ ANIMATED FIGURE 2.11 The titration 
curve for acetic acid. Note that the titration curve is rela- 
tively flat at pH values near the pK,. In other words, the 
pH changes relatively little as OH7 is added in this 
region of the titration curve. See this figure animated 
at www.cengage.com/login 
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Titration midpoint 
[HA] = [A7] 
pH px, 


NH3 
pK, =9.25 
[NH}] = [NHs] w 
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I 
i Nv 
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N-H 








— Imidazole 
[imid.H*] = [imid] 
pK, = 6.99 


pH 
H-N N-H 
+ 6 


Imidazole Ht ————* 






pK, = 4.76 
CHCOOH CH,COO- 


[CH,COOH] = [CH,;COO7] 


0.5 1.0 
Equivalents of OH™ 


CENGAGENOW” ANIMATED FIGURE 2.12 The titration curves of several weak electrolytes: acetic acid, imidazole, 
and ammonium. See this figure animated at www.cengage.com/login 


lates at the expense of diminishing HAc and the neutralization of H+. At the point 
where half of the HAc has been neutralized (that is, where 0.5 equivalent of OH~ has 
been added), the concentrations of HAc and Ac” are equal and pH = pK, for HAc. 
Thus, we have an experimental method for determining the pK, values of weak elec- 
trolytes. These pK, values lie at the midpoint of their respective titration curves. Af- 
ter all of the acid has been neutralized (that is, when one equivalent of base has been 
added), the pH rises exponentially. 

The shapes of the titration curves of weak electrolytes are identical, as Figure 
2.12 reveals. Note, however, that the midpoints of the different curves vary in a way 
that characterizes the particular electrolytes. The pK, for acetic acid is 4.76, the pK, 
for imidazole is 6.99, and that for ammonium is 9.25. These pK, values are directly 
related to the dissociation constants of these substances, or, viewed the other way, to 
the relative affinities of the conjugate bases for protons. NH; has a high affinity for 
protons compared to Ac’; NH,* is a poor acid compared to HAc. 


Phosphoric Acid Has Three Dissociable H* 


Figure 2.13 shows the titration curve for phosphoric acid, H3PO,4. This substance 
is a polyprotic acid, meaning it has more than one dissociable proton. Indeed, it 
has three, and thus three equivalents of OH- are required to neutralize it, as Fig- 
ure 2.13 shows. Note that the three dissociable H* are lost in discrete steps, each 
dissociation showing a characteristic pK,. Note that pK, occurs at pH = 2.15, and 
the concentrations of the acid HPO; and the conjugate base HsPO,4” are equal. 
As the next dissociation is approached, HPO, is treated as the acid and HPO,?~ 
is its conjugate base. Their concentrations are equal at pH 7.20, so pK = 7.20. 
(Note that at this point, 1.5 equivalents of OH- have been added.) As more OH- 
is added, the last dissociable hydrogen is titrated, and pK; occurs at pH = 12.4, 
where [HPO,27] = [PO,°7]. 

The shape of the titration curves for weak electrolytes has a biologically rele- 
vant property: In the region of the pK,, pH remains relatively unaffected as in- 
crements of OH- (or H+) are added. The weak acid and its conjugate base are 
acting as a buffer. 





ul [HPO,2-] = 

I [PO] JPO, 
12 PE a A 
10 


[HPO,?-] = 


4 


Equivalents OH” added —~ 


2:30 What Are Buffers, and What Do They Do? 


Buffers are solutions that tend to resist changes in their pH as acid or base is added. 
Typically, a buffer system is composed of a weak acid and its conjugate base. A solution 
of a weak acid that has a pH nearly equal to its pK,, by definition, contains an amount 
of the conjugate base nearly equivalent to the weak acid. Note that in this region, the 
titration curve is relatively flat (Figure 2.14). Addition of H* then has little effect be- 
cause it is absorbed by the following reaction: 


Ht + A-—>HA 
Similarly, any increase in [OH] is offset by the process 
OH- + HA—>A™ + HO 


Thus, the pH remains relatively constant. The components of a buffer system are 
chosen such that the pK, of the weak acid is close to the pH of interest. It is at the 
pX, that the buffer system shows its greatest buffering capacity. At pH values more 
than 1 pH unit from the pKa, buffer systems become ineffective because the con- 
centration of one of the components is too low to absorb the influx of H* or OH™. 
The molarity of a buffer is defined as the swm of the concentrations of the acid and 
conjugate base forms. 

Maintenance of pH is vital to all cells. Cellular processes such as metabolism are 
dependent on the activities of enzymes; in turn, enzyme activity is markedly influ- 
enced by pH, as the graphs in Figure 2.15 show. Consequently, changes in pH would 
be disruptive to metabolism for reasons that become apparent in later chapters. Or- 
ganisms have a variety of mechanisms to keep the pH of their intracellular and ex- 
tracellular fluids essentially constant, but the primary protection against harmful pH 
changes is provided by buffer systems. The buffer systems selected reflect both the 
need for a pK, value near pH 7 and the compatibility of the buffer components with 
the metabolic machinery of cells. Two buffer systems act to maintain intracellular pH 
essentially constant—the phosphate (HPO, /HPO,_) system and the histidine sys- 
tem. The pH of the extracellular fluid that bathes the cells and tissues of animals is 
maintained by the bicarbonate/carbonic acid (HCO, /H»COs) system. 


The Phosphate Buffer System Is a Major Intracellular Buffering System 


The phosphate system serves to buffer the intracellular fluid of cells at physiological 
pH because pK lies near this pH value. The intracellular pH of most cells is main- 
tained in the range between 6.9 and 7.4. Phosphate is an abundant anion in cells, both 
in inorganic form and as an important functional group on organic molecules that 
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CENGAGENOW™ ANIMATED FIGURE 2.13 The titration 
curve for phosphoric acid. See this figure animated at 
www.cengage.com/login 
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Buffer action: 


OH H,O 


CENGAGENOW” ANIMATED FIGURE 2.14 A buffer 
system consists of a weak acid, HA, and its conjugate 
base, A~. See this figure animated at www.cengage 
.com/login 
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FIGURE 2.15 pH versus enzymatic activity. Pepsin is a 
protein-digesting enzyme active in the gastric fluid. 
Fumarase is a metabolic enzyme found in mitochon- 
dria. Lysozyme digests the cell walls of bacteria; it is 
found in tears. 


serve as metabolites or macromolecular precursors. In both organic and inorganic 
forms, its characteristic pK means that the ionic species present at physiological pH 
are sufficient to donate or accept hydrogen ions to buffer any changes in pH, as the 
titration curve for HPO; in Figure 2.13 reveals. For example, if the total cellular con- 
centration of phosphate is 20 mM (millimolar) and the pH is 7.4, the distribution of 
the major phosphate species is given by 


[HPO,?-] 
pH = pKo + logio [HPO] 
P [HPO? | 

4 = 7.20 + logio [H.PO,7] 
[HPO] 
wmon n L58 
[H2PO, ] 


Thus, if [HPO,?~] + [H.PO,7] = 20 mM, then 
[HPO] = 12.25 mM and [HPO] = 7.75 mM 


Dissociation of the Histidine—Imidazole Group Also Serves 
as an Intracellular Buffering System 


Histidine is one of the 20 naturally occurring amino acids commonly found in pro- 
teins (see Chapter 4). It possesses as part of its structure an imidazole group, a five- 
membered heterocyclic ring possessing two nitrogen atoms. The pK, for dissocia- 
tion of the imidazole hydrogen of histidine is 6.04. 


cCoO COO 
pK, = 6.04 


| | 
Hwe ey | H* + N= 0 
HN. ,N'H HN. N 


\4 \4Z 


In cells, histidine occurs as the free amino acid, as a constituent of proteins, and 
as part of dipeptides in combination with other amino acids. Because the con- 
centration of free histidine is low and its imidazole pK, is more than 1 pH unit re- 
moved from prevailing intracellular pH, its role in intracellular buffering is mi- 
nor. However, protein-bound and dipeptide histidine may be the dominant 
buffering system in some cells. In combination with other amino acids, as in pro- 
teins or dipeptides, the imidazole pK, may increase substantially. For example, 
the imidazole pK, is 7.04 in anserine, a dipeptide containing f-alanine and histi- 
dine (Figure 2.16). Thus, this pK, is near physiological pH, and some histidine 
peptides are well suited for buffering at physiological pH. 


“Good” Buffers Are Buffers Useful Within Physiological pH Ranges 


Not many common substances have pK, values in the range from 6 to 8. Conse- 
quently, biochemists conducting in vitro experiments were limited in their choice 
of buffers effective at or near physiological pH. In 1966, N. E. Good devised a set of 


o Cc 
I | 


+ 
HN — CHy— CHy— C i CH— Ch 7 





+ 
N, NH 
H 
w >Z 


FIGURE 2.16 Anserine (N-B-alanyl-3-methyl-L-histidine) is an important dipeptide buffer in the maintenance of 
intracellular pH in some tissues. The structure shown is the predominant ionic species at pH 7. pK; (COOH) = 2.64; 
pk, (imidazole-N*H) = 7.04; pK; (NH3) = 9.49. 
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The Bicarbonate Buffer System of Blood Plasma 


The important buffer system of blood plasma is the bicarbonate/ 
carbonic acid couple: 


HCO; == Ht + HCO;~ 


The relevant pKa, pK; for carbonic acid, has a value far removed 
from the normal pH of blood plasma (pH 7.4). (The pk, for 
HCO; at 25°C is 3.77 [Table 2.4], but at 37°C, pK; is 3.57.) At pH 
7.4, the concentration of HCO; is a minuscule fraction of the 
HCOs;~ concentration; thus the plasma appears to be poorly pro- 
tected against an influx of OH” ions. 








H = 7.4 = 3.57 +1 Her! 
E | 
3 
—— = 6761 
[H,COs] 


For example, if [HCO;"] = 24 mM, then [H COs] is only 3.55 uM 
(3.55 X 107° M), and an equivalent amount of OH” (its usual con- 
centration in plasma) would swamp the buffer system, causing a 
dangerous rise in the plasma pH. How, then, can this bicarbonate 
system function effectively? The bicarbonate buffer system works 
well because the critical concentration of HCO; is maintained 
relatively constant through equilibrium with dissolved CO, pro- 
duced in the tissues and available as a gaseous CO» reservoir in the 
lungs.* 

Gaseous CO, from the lungs and tissues is dissolved in the blood 
plasma, symbolized as CO(d), and hydrated to form H»COs: 


CO2(g) ==CO2(d) 
CO»(d) + H:O === HCO; 
HCO; ==H* T HCO; 


Thus, the concentration of HCO; is itself buffered by the avail- 
able pools of CO». The hydration of CO» is actually mediated by an 
enzyme, carbonic anhydrase, which facilitates the equilibrium by 
rapidly catalyzing the reaction 


Under the conditions of temperature and ionic strength prevail- 
ing in mammalian body fluids, the equilibrium for this reaction 
lies far to the left, such that more than 300 COs molecules are 
present in solution for every molecule of HyCOs. Because dis- 
solved CO, and HCO; are in equilibrium, the proper expression 
for HCO; availability is [COə(d)] + [H»COs], the so-called total 
carbonic acid pool, consisting primarily of COə(d). The overall 
equilibrium for the bicarbonate buffer system then is 


Ky 
CO;(d) + H:O — = FRc; 


Ka 
HCO; =—=H+* F HCO; 


An expression for the ionization of HyCOs under such conditions 
(that is, in the presence of dissolved CO») can be obtained from 


*Well-fed humans exhale about 1 kg of CO» daily. Imagine the excretory 
problem if CO, were not a volatile gas. 


Ky, the equilibrium constant for the hydration of COs, and from 
K,, the first acid dissociation constant for HyCOs;: 


__[HCOs] 
*" [CO,(d)] 
Thus, 
[HgCOs] = K,[COs(d)] 
Putting this value for [H»COs] into the expression for the first dis- 
sociation of HCO; gives 
[H+] [HCO; ] 
[HCO;] 
[H*][HCO;7] 
K, [CO2(d)] 
Therefore, the overall equilibrium constant for the ionization of 
HCO; in equilibrium with COə(d) is given by 
[H+] [HCO 37] 
K, [CO2(d) ] 
and K,Kn, the product of two constants, can be defined as a new 
equilibrium constant, Koyerau» The value of K, is 0.003 at 37°C and 


K, the ionization constant for HsCOs, is 107357 = 0.000269. 
Therefore, 


K,= 


K,By, = 


Koveratt = (0.000269) (0.003) 
= 8.07 X 1077 
PKoverait = 6.1 


which yields the following Henderson—Hasselbalch relationship: 


ES E T a 
= overa! (0) 
Pie preca 1°80 TOO(d)] 


Although the prevailing blood pH of 7.4 is more than 1 pH unit 
away from pXKoyveran, the bicarbonate system is still an effective 
buffer. Note that, at blood pH, the concentration of the acid com- 
ponent of the buffer will be less than 10% of the conjugate base 
component. One might imagine that this buffer component could 
be overwhelmed by relatively small amounts of alkali, with conse- 
quent disastrous rises in blood pH. However, the acid component 
is the total carbonic acid pool, that is, [COə(d)] + [H2COs], which 
is stabilized by its equilibrium with CO»(g). Gaseous CO» serves to 
buffer any losses from the total carbonic acid pool by entering so- 
lution as COs(d), and blood pH is effectively maintained. Thus, 
the bicarbonate buffer system is an open system. The natural pres- 
ence of COs gas at a partial pressure of 40 mm Hg in the alveoli of 
the lungs and the equilibrium 


CO2(g) —=CO,(d) 


keep the concentration of CO.(d) (the principal component of the 
total carbonic acid pool in blood plasma) in the neighborhood of 
1.2 mM. Plasma [HCO;_] is about 24 mM under such conditions. 
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Blood pH and Respiration 


Hyperventilation, defined as a breathing rate more rapid than 
necessary for normal COs, elimination from the body, can result 
in an inappropriately low [CO»(g) ] in the blood. Central nervous 
system disorders such as meningitis, encephalitis, or cerebral 
hemorrhage, as well as a number of drug- or hormone-induced 
physiological changes, can lead to hyperventilation. As [COo(g) | 
drops due to excessive exhalation, [H2COs] in the blood plasma 
falls, followed by a decline in [H*] and [HCO; ] in the blood 
plasma. Blood pH rises within 20 sec of the onset of hyperventi- 
lation, becoming maximal within 15 min. [H*] can change from 





its normal value of 40 nM (pH = 7.4) to 18 nM (pH = 7.74). This 
rise in plasma pH (increase in alkalinity) is termed respiratory 
alkalosis. 

Hypoventilation is the opposite of hyperventilation and is char- 
acterized by an inability to excrete CO», rapidly enough to meet 
physiological needs. Hypoventilation can be caused by narcotics, 
sedatives, anesthetics, and depressant drugs; diseases of the lung 
also lead to hypoventilation. Hypoventilation results in respiratory 
acidosis, as CO»(g) accumulates, giving rise to HpCOs, which dis- 
sociates to form H* and HCOs;-. 





FIGURE 2.17 The structure of HEPES, 4-(2-hydroxy)-1- 
piperazine ethane sulfonic acid, in its fully protonated 








form.The pk, of the sulfonic acid group is about 3; the 


pK, of the piperazine-N*H is 7.55 at 20°C. 


TARA 
HO—CH;—CH;—NH N—CH—CH—SO;H 
A 
HEPES 


synthetic buffers to remedy this problem, and over the years the list has expanded 
so that a “good” selection is available. Some of these compounds are analogs of tris- 
hydroxymethyl aminomethane (Tris, see end-of-chapter Problem 19), such as tri- 
ethanolamine (TEA, see end-of-chapter Problem 18) or N,N-bis (2-hydroxyethyl) 
glycine (Bicine, see end-of-chapter Problems 8 and 20). Others are derivatives of 
Nethane sulfonic acids, such as HEPES (Figure 2.17). 


2:4: What Properties of Water Give It a Unique Role 
in the Environment? 


The remarkable properties of water render it particularly suitable to its unique role 
in living processes and the environment, and its presence in abundance favors the 
existence of life. Let’s examine water’s physical and chemical properties to see the 
extent to which they provide conditions that are advantageous to organisms. 

As a solvent, water is powerful yet innocuous. No other chemically inert solvent 
compares with water for the substances it can dissolve. Also, it is very important to 
life that water is a “poor” solvent for nonpolar substances. Thus, through hydropho- 
bic interactions, lipids coalesce, membranes form, boundaries are created delimiting 
compartments, and the cellular nature of life is established. Because of its very high 
dielectric constant, water is a medium for ionization. Ions enrich the living environ- 
ment in that they enhance the variety of chemical species and introduce an impor- 
tant class of chemical reactions. They provide electrical properties to solutions and 
therefore to organisms. Aqueous solutions are the prime source of ions. 

The thermal properties of water are especially relevant to its environmental fitness. 
It has great power as a buffer resisting thermal (temperature) change. Its heat capac- 
ity, or specific heat (4.1840 J/g°C), is remarkably high; it is ten times greater than 
iron, five times greater than quartz or salt, and twice as great as hexane. Its heat of fu- 
sion is 335 J/g. Thus, at 0°C, it takes a loss of 335 J to change the state of 1 g of HO 
from liquid to solid. Its heat of vaporization (2.24 kJ/g) is exceptionally high. These 
thermal properties mean that it takes substantial changes in heat content to alter the 
temperature and especially the state of water. Water’s thermal properties allow it to 
buffer the climate through such processes as condensation, evaporation, melting, and 
freezing. Furthermore, these properties allow effective temperature regulation in liv- 
ing organisms. For example, heat generated within an organism as a result of metab- 
olism can be efficiently eliminated through evaporation or conduction. The thermal 
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conductivity of water is very high compared with that of other liquids. The anomalous 
expansion of water as it cools to temperatures near its freezing point is a unique at- 
tribute of great significance to its natural fitness. As water cools, H bonding increases 
because the thermal motions of the molecules are lessened. H bonding tends to sep- 
arate the water molecules (Figure 2.2), thus decreasing the density of water. These 
changes in density mean that, at temperatures below 4°C, cool water rises and, most 
important, ice freezes on the surface of bodies of water, forming an insulating layer 
protecting the liquid water underneath. 

Water has the highest surface tension (75 dyne/cm) of all common liquids (ex- 
cept mercury). Together, surface tension and density determine how high a liquid 
rises in a capillary system. Capillary movement of water plays a prominent role in 
the life of plants. Last, consider osmosis as it relates to water and, in particular, the 
bulk movement of water in the direction from a dilute aqueous solution to a more 
concentrated one across a semipermeable boundary. Such bulk movements deter- 
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mine the shape and form of living things. 


Water is truly a crucial determinant of the fitness of the environment. In a very 


real sense, organisms are aqueous systems in a watery world. 


SUMMARY 


2.1 What Are the Properties of Water? Life depends on the unusual 
chemical and physical properties of H2O. Its high boiling point, melting 
point, heat of vaporization, and surface tension indicate that intermol- 
ecular forces of attraction between H,O molecules are high. Hydrogen 
bonds between adjacent water molecules are the basis of these forces. 
Liquid water consists of HO molecules held in a random, three- 
dimensional network that has a local preference for tetrahedral geome- 
try, yet contains a large number of strained or broken hydrogen bonds. 
The presence of strain creates a kinetic situation in which H,O mole- 
cules can switch H-bond allegiances; fluidity ensues. As kinetic energy 
decreases (the temperature falls), crystalline water (ice) forms. 

The solvent properties of water are attributable to the “bent” struc- 
ture of the water molecule and polar nature of its O—H bonds. To- 
gether these attributes yield a liquid that can form hydration shells 
around salt ions or dissolve polar solutes through H-bond interactions. 
Hydrophobic interactions in aqueous environments also arise as a con- 
sequence of polar interactions between water molecules. The polarity of 
the O—H bonds means that water also ionizes to a small but finite ex- 
tent to release H* and OH ions. Kw, the ion product of water, reveals 
that the concentration of [H+] and [OH™] at 25°C is 1077 M. 


2.2 What Is pH? pH is defined as —logıọ [H*]. pH is an important 
concept in biochemistry because the structure and function of biologi- 
cal molecules depend strongly on functional groups that ionize, or not, 
depending on small changes in [H*] concentration. Weak electrolytes 
are substances that dissociate incompletely in water. The behavior of 
weak electrolytes determines the concentration of [H*] and hence, pH. 
The Henderson-—Hasselbalch equation provides a general solution to 
the quantitative treatment of acid-base equilibria in biological systems. 


2.3 What Are Buffers, and What Do They Do? Buffers are solutions 
composed of a weak acid and its conjugate base. Such solutions can re- 
sist changes in pH when acid or base is added to the solution. Mainte- 


nance of pH is vital to all cells, and primary protection against harmful 
pH changes is provided by buffer systems. The buffer systems used by 
cells reflect a need for a pK, value near pH 7 and the compatibility of 
the buffer components with the metabolic apparatus of cells. The phos- 
phate buffer system and the histidine—-imidazole system are the two 
prominent intracellular buffers, whereas the bicarbonate buffer system 
is the principal extracellular buffering system in animals. 


2.4 What Properties of Water Give It a Unique Role in the Environ- 
ment? Life and water are inextricably related. Water is particularly 
suited to its unique role in living processes and the environment. As a sol- 
vent, water is powerful yet innocuous; no other chemically inert solvent 
compares with water for the substances it can dissolve. Also, water as a 
“poor” solvent for nonpolar substances gives rise to hydrophobic inter- 
actions, leading lipids to coalesce, membranes to form, and boundaries 
delimiting compartments to appear. 

Water is a medium for ionization. Ions enrich the living environment 
and introduce an important class of chemical reactions. Ions provide 
electrical properties to solutions and therefore to organisms. 

The thermal properties of water are especially relevant to its envi- 
ronmental fitness. It takes substantial changes in heat content to alter 
the temperature and especially the state of water. Water’s thermal 
properties allow it to buffer the climate through such processes as 
condensation, evaporation, melting, and freezing. Furthermore, wa- 
ter’s thermal properties allow effective temperature regulation in liv- 
ing organisms. 

Osmosis as it relates to water, and in particular, the bulk movement of 
water in the direction from a dilute aqueous solution to a more concen- 
trated one across semipermeable membranes, determines the shape and 
form of living things. In large degree, the properties of water define the 
fitness of the environment. Organisms are aqueous systems in a watery 
world. 


PROBLEMS 


CENGAGENOW" Preparing for an exam? Create your own study path for this 
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1. Calculate the pH of the following. 
a. 5 X 1074 M HCI d. 3 X 10°? M KOH 
b. 7 X 107” M NaOH e. 0.04 mM HCI 
c. 2 uM HCl f. 6 X 10°°M HCl 


2. Calculate the following from the pH values given in Table 2.3. 
a. [H*] in vinegar 
b. [H*] in saliva 
c. [H*] in household ammonia 
d. [OH™] in milk of magnesia 
e. [OH™] in beer 
f. [H*] inside a liver cell 
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3. 


10. 


11. 


12. 


13. 


14. 


15; 


The pH of a 0.02 M solution of an acid was measured at 4.6. 
a. What is the [H*] in this solution? 
b. Calculate the acid dissociation constant K, and pK, for this acid. 


. The K, for formic acid is 1.78 X 107+ M. 


a. What is the pH of a 0.1 M solution of formic acid? 

b. 150 mL of 0.1 M NaOH is added to 200 mL of 0.1 M formic acid, 
and water is added to give a final volume of 1 L. What is the pH of 
the final solution? 


. Given 0.1 M solutions of acetic acid and sodium acetate, describe 


the preparation of 1 L of 0.1 M acetate buffer at a pH of 5.4. 


. If the internal pH of a muscle cell is 6.8, what is the 


[HPO,?°]/[HsPO,"] ratio in this cell? 


. Given 0.1 M solutions of NasPO,4 and H3POy,, describe the prepara- 


tion of 1 L of a phosphate buffer at a pH of 7.5. What are the mo- 
lar concentrations of the ions in the final buffer solution, including 
Na* and H? 


. Bicine is a compound containing a tertiary amino group whose 
g g 


relevant pK, is 8.3 (Figure 2.17). Given 1 L of 0.05 M Bicine with 
its tertiary amino group in the unprotonated form, how much 
0.1 N HCI must be added to have a Bicine buffer solution of 
pH 7.5? What is the molarity of Bicine in the final buffer? What is 
the concentration of the protonated form of Bicine in this final 
buffer? 


. What are the approximate fractional concentrations of the follow- 


ing phosphate species at pH values of 0, 2, 4, 6, 8, 10, and 12? 

a. HPO, 

b. HPO, 

Cy HPO,- 

d. PO 

Citric acid, a tricarboxylic acid important in intermediary metabo- 
lism, can be symbolized as H;A. Its dissociation reactions are 





HsA — Ht + HA~ pki = 3.13 
HA- =— H+ + HA? = pK, = 4.76 
HA2- ——= H+ + A3- pK; = 6.40 








If the total concentration of the acid and its anion forms is 0.02 M, 
what are the individual concentrations of HA, HyA~, HA?~, and A?~ 
at pH 5.2? 

a. If50 mL of 0.01 M HCI is added to 100 mL of 0.05 M phosphate 
buffer at pH 7.2, what is the resultant pH? What are the concen- 
trations of HPO, and HPO,” in the final solution? 

b. If50 mL of 0.01 M NaOH is added to 100 mL of 0.05 M phosphate 
buffer at pH 7.2, what is the resultant pH? What are the concen- 
trations of HPO, and HPO,?~ in this final solution? 

At 37°C, if the plasma pH is 7.4 and the plasma concentration of 

HCO; is 15 mM, what is the plasma concentration of HCO? What 

is the plasma concentration of COs dissoivea)? If metabolic activity 

changes the concentration of COs dissoivea) to 3 mM and [HCO;] 
remains at 15 mM, what is the pH of the plasma? 


Draw the titration curve for anserine (Figure 2.16). The isoelectric 
point of anserine is the pH where the net charge on the molecule is 
zero; what is the isoelectric point for anserine? Given a 0.1 M 
solution of anserine at its isoelectric point and ready access to 0.1 M 
HCl, 0.1 M NaOH and distilled water, describe the preparation of 
1 L of 0.04 M anserine buffer solution, pH 7.2. 

Given a solution of 0.1 M HEPES in its fully protonated form, and 
ready access to 0.1 M HCl, 0.1 M NaOH and distilled water, de- 
scribe the preparation of 1 L of 0.025 M HEPES buffer solution, 
pH 7.8. 


A 100-g amount of a solute was dissolved in 1000 g of water. The 
freezing point of this solution was measured accurately and de- 
termined to be —1.12°C. What is the molecular weight of the 
solute? 
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16. 


17. 


18. 


19. 


20. 


Shown here is the structure of triethanolamine in its fully proto- 
nated form: 


CHyCH,OH 
| 
7 
HOCH,CHy — N — CHyCH,OH 
| 
H 


Its pK, is 7.8. You have available at your lab bench 0.1 M solutions 
of HCI, NaOH, and the uncharged (free base) form of triethanol- 
amine, as well as ample distilled water. Describe the preparation of 
a 1 L solution of 0.05 M triethanolamine buffer, pH 7.6. 


Trishydroxymethyl aminomethane (TRIS) is widely used for the 
preparation of buffers in biochemical research. Shown here is the 
structure of TRIS in its protonated form: 
TNH, 
HOCH, — C — CHOH 
| 
CHOH 


Its acid dissociation constant, K, is 8.32 X 107° M. You have available 
at your lab bench a 0.1 M solution of TRIS in its protonated form, 0.1 
Msolutions of HCI and NaOH, and ample distilled water. Describe the 
preparation of a 1 L solution of 0.02 M TRIS buffer, pH 7.8. 

Bicine (N, N—bis (2-hydroxyethyl) glycine) is another commonly 
used buffer in biochemistry labs (see problem 8). The structure of 
Bicine in its fully protonated form is shown below: 


CH,CH,OH 
| 

HN — CHCOOH 
| 


CH CHOH 


a. Draw the titration curve for Bicine, assuming the pK, for its free 
COOH group is 2.3 and the pK, for its tertiary amino group is 8.3. 

b. Draw the structure of the fully deprotonated form (completely dis- 
sociated form) of bicine. 

c. You have available a 0.1 M solution of Bicine at its isoelectric point 
(pH)), 0.1 Msolutions of HCl and NaOH, and ample distilled HO. 
Describe the preparation of 1 L of 0.04 M Bicine buffer, pH 7.5. 

d. What is the concentration of fully protonated form of Bicine in 
your final buffer solution? 

Hydrochloric acid is a significant component of gastric juice. What 

is the concentration of chloride ion in gastric juice if pH = 1.2? 

From the pK, for lactic acid given in Table 2.4, calculate the con- 

centration of lactate in blood plasma (pH = 7.4) if the concentra- 

tion of lactic acid is 1.5 uM. 


21. When a 0.1 M solution of a weak acid was titrated with base, the fol- 


lowing results were obtained: 
Equivalents of 





base added pH observed 
0.05 3.4 
0.15 3.9 
0.25 4.2 
0.40 4.5 
0.60 4.9 
0.75 5.2 
0.85 5:4 
0.95 6.0 


Plot the results of this titration and determine the pK, of the weak 
acid from your graph. 


22. The enzyme alcohol dehydrogenase catalyzes the oxidation of ethyl 
alcohol by NAD* to give acetaldehyde plus NADH and a proton: 


CH;CH,OH + NAD*——>CH;,CHO + NADH + H+ 


The rate of this reaction can be measured by following the change 
in pH. The reaction is run in 1 mL 10 mM TRIS buffer at pH 8.6. If 
the pH of the reaction solution falls to 8.4 after ten minutes, what 
is the rate of alcohol oxidation, expressed as nanomoles of ethanol 
oxidized per mL per sec of reaction mixture? 
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23. In light of the Human Biochemistry box on page 43, what would be 
the effect on blood pH if cellular metabolism produced a sudden 
burst of carbon dioxide? 


24. On the basis of Figure 2.12, what will be the pH of the acetate— 
acetic acid solution when the ratio of [acetate] /[acetic acid] is 10? 
a. 3.76 
b. 4.76 
c. 5.76 
d. 14.76 
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Thermodynamics 
of Biological Systems 





ESSENTIAL QUESTION 


Living things require energy. Movement, growth, synthesis of biomolecules, and the 
transport of ions and molecules across membranes all demand energy input. All or- 
ganisms must acquire energy from their surroundings and must utilize that energy 
efficiently to carry out life processes. To study such bioenergetic phenomena re- 
quires familiarity with thermodynamics. Thermodynamics also allows us to deter- 
mine whether chemical processes and reactions occur spontaneously. The student 
should appreciate the power and practical value of thermodynamic reasoning and 
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The sun is the source of energy for virtually all life. : a s 
We even harvest its energy R for eee realize that this is well worth the effort needed to understand it. 


using windmills driven by air heated by the sun. 


What are the laws and principles of thermodynamics that allow us to describe 
the flows and interchanges of heat, energy, and matter in biochemical systems? 





A theory is the more impressive the greater is the 
simplicity of its premises, the more different 
are the kinds of things it relates and the more 
extended is its range of applicability. 
Therefore, the deep impression which classical Even the most complicated aspects of thermodynamics are based ultimately on three 
thermodynamics made upon me. It is the only rather simple and straightforward laws. These laws and their extensions sometimes 
physical theory of universal content which I run counter to our intuition. However, once truly understood, the basic principles of 
e ced ha ET Tie PATE OE thermodynamics become powerful devices for sorting out complicated chemical and 
applicability of its basic concepts, will never be : : er ‘ ne 

biochemical problems. Once we reach this milestone in our scientific development, 

















overthrown. ; soje i ga za 
Pon thermodynamic thinking becomes an enjoyable and satisfying activity. 
o e Several basic thermodynamic principles are presented in this chapter, including 
the analysis of heat flow, entropy production, and free energy functions and the re- 
KEY QUESTIONS lationship between entropy and information. In addition, some ancillary concepts 
are considered, including the concept of standard states, the effect of pH on 
3.1 What Are the Basic Concepts of Thermo- standard-state free energies, the effect of concentration on the net free energy 
dynamics? change of a reaction, and the importance of coupled processes in living things. The 
3.2 What Is the Effect of Concentration on Net chapter concludes with a discussion of ATP and other energy-rich compounds. 
Free Energy Changes? 
3.3. What Is the Effect of pH on Standard-State 


Free Energies? è g 
E wcrc Bwl What Are the Basic Concepts of Thermodynamics? 


Us About Biochemical Events? 


3.5 What Are the Characteristics of High- 
Energy Biomolecules? 


In any consideration of thermodynamics, a distinction must be made between the 
system and the surroundings. The system is that portion of the universe with which 
ra we are concerned. It might be a mixture of chemicals in a test tube, or a single cell, 
3.6 What Are the Complex Equilibria Involved : i A : : ; ; 
in ATP Hetek? or an entire organism. The surroundings include everything else in the universe 
(Figure 3.1). The nature of the system must also be specified. There are three basic 
kinds of systems: isolated, closed, and open. An isolated system cannot exchange 
AGO Wiari tre Dely Humen Reaniraneni matter or energy with its surroundings. A closed system may exchange energy, but 
for ATP? not matter, with the surroundings. An open system may exchange matter, energy, or 
both with the surroundings. Living things are typically open systems that exchange 
matter (nutrients and waste products) and energy (heat from metabolism, for ex- 
ample) with their surroundings. 





3.7 Why Are Coupled Processes Important to 
Living Things? 





The First Law: The Total Energy of an Isolated System Is Conserved 


It was realized early in the development of thermodynamics that heat could be con- 
verted into other forms of energy and moreover that all forms of energy could ulti- 
mately be converted to some other form. The first law of thermodynamics states 
CENGAGENOW* Create your own study path for . z sn 
this chapter with tutorials, simulations, animations, that the total energy of an isolated system is conserved. Thermodynamicists have formu- 
and Active Figures at www.cengage.com/ login lated a mathematical function for keeping track of heat transfers and work expen- 
ditures in thermodynamic systems. This function is called the internal energy, com- 








3.1 What Are the Basic Concepts of Thermodynamics? 


Isolated system: Closed system: 


No exchange of matter or energy Energy exchange may occur 


Open system: 
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Energy exchange and/or matter exchange may occur 





Isolated system 





Surroundings Surroundings 














Surroundings 





Open system 











CENGAGENOW” ACTIVE FIGURE 3.1 The characteristics of isolated, closed, and open systems. Isolated systems 
exchange neither matter nor energy with their surroundings. Closed systems may exchange energy, but not 
matter, with their surroundings. Open systems may exchange either matter or energy with the surroundings. 
Test yourself on the concepts in this figure at www.cengage.com/login 


monly designated E or U, and it includes all the energies that might be exchanged 
in physical or chemical processes, including rotational, vibrational, and transla- 
tional energies of molecules and also the energy stored in covalent and noncovalent 
bonds. The internal energy depends only on the present state of a system and hence 
is referred to as a state function. The internal energy does not depend on how the 
system got there and is thus independent of path. An extension of this thinking is 
that we can manipulate the system through any possible pathway of changes, and as 
long as the system returns to the original state, the internal energy, E, will not have 
been changed by these manipulations. 

The internal energy, E, of any system can change only if energy flows in or out 
of the system in the form of heat or work. For any process that converts one state 
(state 1) into another (state 2), the change in internal energy, AF, is given as 


AF=F,-K=qtw (3.1) 


where the quantity qis the heat absorbed by the system from the surroundings and wis the work 
done on the system by the surroundings. Mechanical work is defined as movement through 
some distance caused by the application of a force. Both movement and force are required 
for work to have occurred. Examples of work done in biological systems include the 
flight of insects and birds, the circulation of blood by a pumping heart, the transmis- 
sion of an impulse along a nerve, and the lifting of a weight by someone who is exer- 
cising. On the other hand, if a person strains to lift a heavy weight but fails to move the 
weight at all, then, in the thermodynamic sense, no work has been done. (The energy 
expended in the muscles of the would-be weight lifter is given off in the form of heat.) 
In chemical and biochemical systems, work is often concerned with the pressure and 
volume of the system under study. The mechanical work done on the system is defined 
as w = — PAV, where P is the pressure and AV is the volume change and is equal to 
Və — Vi. When work is defined in this way, the sign on the right side of Equation 3.1 is 
positive. (Sometimes wis defined as work done by the system; in this case, the equation 
is AE = q— w.) Work may occur in many forms, such as mechanical, electrical, mag- 
netic, and chemical. AE, q, and w must all have the same units. The calorie, abbrevi- 
ated cal, and kilocalorie (kcal) have been traditional choices of chemists and bio- 
chemists, but the SI unit, the joule, is now recommended. 


Enthalpy Is a More Useful Function for Biological Systems 


If the definition of work is limited to mechanical work (w = —PAV) and no change 
in volume occurs, an interesting simplification is possible. In this case, AF is merely 
the heat exchanged at constant volume. This is so because if the volume is constant, no 
mechanical work can be done on or by the system. Then AE = q. Thus AF is a very 
useful quantity in constant volume processes. However, chemical and especially bio- 
chemical processes and reactions are much more likely to be carried out at constant 
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FIGURE 3.2 The enthalpy change, AH”, for a reaction 
can be determined from the slope of a plot of R In Keq 
versus 1/T.To illustrate the method, the values of the 
data points on either side of the 327.5 K (54.5°C) data 
point have been used to calculate AH? at 54.5°C. 
Regression analysis would normally be preferable. 
(Adapted from Brandts, J. F., 1964. The thermodynamics of pro- 
tein denaturation. |. The denaturation of chymotrypsinogen. 
Journal of the American Chemical Society 86:4291-4301.) 


pressure. In constant pressure processes, AF is not necessarily equal to the heat 
transferred. For this reason, chemists and biochemists have defined a function that 
is especially suitable for constant pressure processes. It is called the enthalpy, H, and 
it is defined as 


H=E+ PV (3.2) 


The clever nature of this definition is not immediately apparent. However, if the 
pressure is constant, then we have 


AH=AE+ PAV=q+ w+ PAV=q- PAV+ PAV=4q (3.3) 


So, AF is the heat transferred in a constant volume process, and AH is the heat 
transferred in a constant pressure process. 

Often, because biochemical reactions normally occur in liquids or solids rather 
than in gases, volume changes are typically quite small, and enthalpy and internal en- 
ergy are often essentially equal. 

In order to compare the thermodynamic parameters of different reactions, it is 
convenient to define a standard state. For solutes in a solution, the standard state is 
normally unit activity (often simplified to 1 M concentration). Enthalpy, internal 
energy, and other thermodynamic quantities are often given or determined for 
standard-state conditions and are then denoted by a superscript degree sign (“°”), 
as in AH’, AE’, and so on. 

Enthalpy changes for biochemical processes can be determined experimentally 
by measuring the heat absorbed (or given off) by the process in a calorimeter (a re- 
action vessel that can be used to measure the heat evolved by a reaction). Alterna- 
tively, for any process A>—B at equilibrium, the standard-state enthalpy change 
for the process can be determined from the temperature dependence of the equi- 
librium constant: 


d(In Keq) 
VT) 


Here Ris the gas constant, defined as R = 8.314 J/mol - K. A plot of R(In Keq) ver- 
sus 1/T is called a van’t Hoff plot. The example below demonstrates how a van’t 
Hoff plot is constructed and how the enthalpy change for a reaction can be deter- 
mined from the plot itself. 


AH? = (3.4) 





| EXAMPLE | In a study! of the temperature-induced reversible denaturation of the 
protein chymotrypsinogen, 


Native state (N) — denatured state (D) 
Keg = [D]/IN] 


John F. Brandts measured the equilibrium constants for the denaturation over a 
range of pH and temperatures. The data for pH 3: 


T(K): 324.4 326.1 327.5 329.0 330.7 332.0 333.8 
Keg: 0.041 0.12 0.27 0.68 1.9 5.0 21 


A plot of R(In Keq) versus 1/T (a van’t Hoff plot) is shown in Figure 3.2. A H° for 
the denaturation process at any temperature is the negative of the slope of the plot 
at that temperature. As shown, A H° at 54.5°C (327.5 K) is 


AH’ = —[-3.2 —(—17.6)]/[(3.04 — 3.067) X 107°] = +533 kJ/mol 


What does this value of AH° mean for the unfolding of the protein? Positive values 
of A H° would be expected for the breaking of hydrogen bonds as well as for the ex- 
posure of hydrophobic groups from the interior of the native, folded protein during 
the unfolding process. Such events would raise the energy of the protein solution. 
The magnitude of this enthalpy change (533 kJ/mol) at 54.5°C is large, compared 


'Brandts, J. F., 1964. The thermodynamics of protein denaturation. I. The denaturation of chymo- 
trypsinogen. Journal of the American Chemical Society 86:4291-4301. 








3.1. What Are the Basic Concepts of Thermodynamics? 


to similar values of A H° for other proteins and for this same protein at 25°C (Table 
3.1). If we consider only this positive enthalpy change for the unfolding process, the 
native, folded state is strongly favored. As we shall see, however, other parameters 
must be taken into account. 









TABLE 3.1 l 








Protein AH° AS? AG° ACp 
(and conditions) kJ/mol kJ/mol « K kJ/mol kJ/mol - K 
Chymotrypsinogen 164 0.440 31.0 10.9 
(pH 3, 25°C) 

B-Lactoglobulin —88 —0.300 2.5 9.0 
(5 Murea, pH 3, 25°C) 

Myoglobin 180 0.400 57.0 5.9 
(pH 9, 25°C) 

Ribonuclease 240 0.780 3.8 8.4 
(pH 2.5, 30°C) 











Adapted from Cantor, C., and Schimmel, P., 1980. Biophysical Chemistry. San Francisco: W.H. Freeman; and Tanford, C., 1968. 
Protein denaturation. Advances in Protein Chemistry 23:121-282. 


The Second Law: Systems Tend Toward Disorder and Randomness 


The second law of thermodynamics has been described and expressed in many dif- 
ferent ways, including the following: 


1. Systems tend to proceed from ordered (low-entropy or low-probability) states to disor- 
dered (high-entropy or high-probability) states. 

2. The entropy of the system plus surroundings is unchanged by reversible processes; 
the entropy of the system plus surroundings increases for irreversible processes. 

3. All naturally occurring processes proceed toward equilibrium, that is, to a state 
of minimum potential energy. Energy flows spontaneously so as to become dif- 
fused or dispersed or spread out. Energy dispersal results in entropy increase. 


Several of these statements of the second law invoke the concept of entropy, which 
is a measure of disorder and randomness in the system (or the surroundings). An 
organized or ordered state is a low-entropy state, whereas a disordered state is a 
high-entropy state. All else being equal, reactions involving large, positive entropy 
changes, AS, are more likely to occur than reactions for which AS is not large and 
positive. 


S=knw (3.5) 
and 
AS=kin Wanal — kin Wada (3.6) 


where Wina and Wyiia are the final and initial number of microstates, respectively, 
and where k is Boltzmann’s constant (k = 1.38 X 107” J/K). 

Seen in this way, entropy represents energy dispersion—the dispersion of energy 
among a large number of molecular motions relatable to quantized states (mi- 
crostates). An increase in entropy is just an increase in the number of microstates 
in any macrostate. On the other hand, if only one microstate corresponds to a given 
macrostate, then the system has no freedom to choose its microstate—and it has 
zero entropy. This definition is useful for statistical calculations (in fact, it is a foun- 
dation of statistical thermodynamics), but a more common form relates entropy to the 
heat transferred in a process: 


d 
AS yeversible = F (3.7) 
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A DEEPER LOOK 


Entropy, Information, and the Importance of “Negentropy” 





When a thermodynamic system undergoes an increase in entropy, it 


becomes more disordered. On the other hand, a decrease in en- N a m =, é 
tropy reflects an increase in order. A more ordered system is more s S P e~ h g 
highly organized and possesses a greater information content. To e re Joy Yoi 
appreciate the implications of decreasing the entropy of a system, ei t jê pi. 
consider the random collection of letters in the figure. This disor- as eo x f E a 
ganized array of letters possesses no inherent information content, i a a S $ 

and nothing can be learned by its perusal. On the other hand, this pi £ _ . i oh wt 


particular array of letters can be systematically arranged to construct 
the first sentence of the Einstein quotation that opened this chap- 


ter: “A theory is the more impressive the greater is the simplicity of i d © m a oq © a g 
its premises, the more different are the kinds of things it relates and p š ap t f., 
the more extended is its range of applicability.” t h i p e tow 
Arranged in this way, this same collection of 151 letters possesses vo € e . i o ® 
enormous information content—the profound words of a great sci- P a ey < 
entist. Just as it would have required significant effort to rearrange t e oF. 5 ee r . 
these 151 letters in this way, so large amounts of energy are re- a T pä E i ” 
quired to construct and maintain living organisms. Energy input is m e “p £ ~ o 
required to produce information-rich, organized structures such as mé a » t 5 t e 
proteins and nucleic acids. Information content can be thought of e e n a h 5 
as negative entropy. In 1945 Erwin Schrödinger took time out from i , »b y é ae o e 
a ii n 


his studies of quantum mechanics to publish a delightful book ti- 
tled What Is Life? In it, Schrödinger coined the term negentropy to 
describe the negative entropy changes that confer organization 
and information content to living organisms. Schrödinger pointed 
out that organisms must “acquire negentropy” to sustain life. 











CENGAGENOW” See this figure animated at 
www.cengage.com/login 











where dSreversible iS the entropy change of the system in a reversible? process, qis the 
heat transferred, and Tis the temperature at which the heat transfer occurs. 

Equation 3.6 says that entropy change measures the dispersal of energy in a 
process. When Whyitia is less than Wina energy is dispersed from a small number of 
microstates (Wpita) to a larger number of microstates (Wana), and AS is a positive 
quantity. That is, dispersal of energy into a larger number of microstates results in an 
increase in entropy. 


The Third Law: Why Is “Absolute Zero” So Important? 


The third law of thermodynamics states that the entropy of any crystalline, perfectly 
ordered substance must approach zero as the temperature approaches 0 K, and, at 
T = 0K, entropy is exactly zero. Based on this, it is possible to establish a quantitative, 
absolute entropy scale for any substance as 


T 
=f Cpdln T (3.8) 
0 


where Cp is the heat capacity at constant pressure. The heat capacity of any substance is 
the amount of heat 1 mole ofit can store as the temperature of that substance is raised 
by 1 degree. For a constant pressure process, this is described mathematically as 


dH 


Cp = — 
P? dT 


(3.9) 
If the heat capacity can be evaluated at all temperatures between 0 K and the tem- 
perature of interest, an absolute entropy can be calculated. For biological processes, 
entropy changes are more useful than absolute entropies. The entropy change for a 
process can be calculated if the enthalpy change and /ree energy change are known. 


2A reversible process is one that can be reversed by an infinitesimal modification of a variable. 


3.1 What Are the Basic Concepts of Thermodynamics? 


Free Energy Provides a Simple Criterion for Equilibrium 


An important question for chemists, and particularly for biochemists, is, “Will the 
reaction proceed in the direction written?” J. Willard Gibbs, one of the founders of 
thermodynamics, realized that the answer to this question lay in a comparison of 
the enthalpy change and the entropy change for a reaction at a given temperature. 
The Gibbs free energy, G, is defined as 
G=H-TS (3.10) 

For any process A—= 
is given by 





B at constant pressure and temperature, the free energy change 


AG=AH-— TAS (3.11) 


If A Gis equal to 0, the process is at equilibrium and there is no net flow either in the 
forward or reverse direction. When AG = 0, AS = AH/T and the enthalpic and en- 
tropic changes are exactly balanced. Any process with a nonzero AG proceeds spon- 
taneously to a final state of lower free energy. If AGis negative, the process proceeds 
spontaneously in the direction written. If A Gis positive, the reaction or process pro- 
ceeds spontaneously in the reverse direction. (The sign and value of AG do not al- 
low us to determine how fast the process will go.) If the process has a negative AG, it 
is said to be exergonic, whereas processes with positive AG values are endergonic. 


The Standard-State Free Energy Change The free energy change, AG, for any re- 
action depends upon the nature of the reactants and products, but it is also affected 
by the conditions of the reaction, including temperature, pressure, pH, and the 
concentrations of the reactants and products. As explained earlier, it is useful to de- 
fine a standard state for such processes. If the free energy change for a reaction is 
sensitive to solution conditions, what is the particular significance of the standard- 
state free energy change? To answer this question, consider a reaction between two 
reactants A and B to produce the products C and D. 





A+B—cC+D (3.12) 


The free energy change for non-standard-state concentrations is given by 


C] [D 
AG=AG° + RTIn a (3.13) 
At equilibrium, AG = 0 and [C][D]/[A][B] = Keq. We then have 
AG? = -RTIn Kea (3.14) 
or, in base 10 logarithms, 
AG? = —2.3RT logio Keq (3.15) 
This can be rearranged to 
Keq = 10740°/23RT (3.16) 


In any of these forms, this relationship allows the standard-state free energy change 
for any process to be determined if the equilibrium constant is known. More im- 
portant, it states that the point of equilibrium for a reaction in solution is a function of the 
standard-state free energy change for the process. That is, A G° is another way of writing an 
equilibrium constant. 





EWI The equilibrium constants determined by Brandts at several temper- 
atures for the denaturation of chymotrypsinogen (see previous example) can be 
used to calculate the free energy changes for the denaturation process. For exam- 
ple, the equilibrium constant at 54.5°C is 0.27, so 


AG? = — (8.314 J/mol : K) (327.5 K) In (0.27) 


AG° = —(2.72 kJ/mol) In (0.27) 
AG = 3.56 kJ/mol 
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AG? (kJ/mol) 
© 
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FIGURE 3.3 The dependence of AG° on temperature for 
the denaturation of chymotrypsinogen. (Adapted from 
Brandts, J. F., 1964. The thermodynamics of protein denaturation. 
|. The denaturation of chymotrypsinogen. Journal of the Ameri- 
can Chemical Society 86:4291-4301.) 
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FIGURE 3.4 The dependence of AS° on temperature for 
the denaturation of chymotrypsinogen. (Adapted from 
Brandts, J. F., 1964. The thermodynamics of protein denaturation. 
|. The denaturation of chymotrypsinogen. Journal of the Ameri- 
can Chemical Society 86:4291-4301.) 


CENGAGENOW* Go to CengageNOW at www 
.cengage.com/login and click BiochemistrylInteractive 
to see the relationships between free energies and 
the following: changes to temperature, equilibrium 
constants, and concentrations of reactants and 
products. 





The positive sign of A G° means that the unfolding process is unfavorable; that is, 
the stable form of the protein at 54.5°C is the folded form. On the other hand, the 
relatively small magnitude of AG? means that the folded form is only slightly fa- 
vored. Figure 3.3 shows the dependence of AG° on temperature for the denatura- 
tion data at pH 3 (from the data given in the example on page 50). 

Having calculated both A H° and A G° for the denaturation of chymotrypsinogen, 
we can also calculate A S°, using Equation 3.11: 


(AG - AH’) 
T 








AS? (3.17) 


At 54.5°C (327.5 K), 


AS? 
AS? 


— (3560 — 533,000 J/mol) /327.5 K 
1620 J/mol : K 


Figure 3.4 presents the dependence of A S° on temperature for chymotrypsinogen 
denaturation at pH 3. A positive A S° indicates that the protein solution has become 
more disordered as the protein unfolds. Comparison of the value of 1.62 kJ/mol - K 
with the values of A S° in Table 3.1 shows that the present value (for chymotrypsino- 
gen at 54.5°C) is quite large. The physical significance of the thermodynamic param- 
eters for the unfolding of chymotrypsinogen becomes clear later in this chapter. 


302» What Is the Effect of Concentration on Net 
Free Energy Changes? 


Equation 3.13 shows that the free energy change for a reaction can be very differ- 
ent from the standard-state value if the concentrations of reactants and products 
differ significantly from unit activity (1 M for solutions). The effects can often be 
dramatic. Consider the hydrolysis of phosphocreatine: 


Phosphocreatine + HO —> creatine + P; (3.18) 


This reaction is strongly exergonic, and A G° at 37°C is —42.8 kJ/mol. Physiological 
concentrations of phosphocreatine, creatine, and inorganic phosphate are nor- 
mally between 1 and 10 mM. Assuming 1 mM concentrations and using Equation 
3.13, the AG for the hydrolysis of phosphocreatine is 


(3.19) 


[0.001] [0.001] 
AG= —42.8 kJ/mol + (8.314 J/mol - K) (310 K) In Sacre 


[0.001] 


AG = —60.5 kJ/mol (3.20) 


At 37°C, the difference between standard-state and 1 mM concentrations for such a 
reaction is thus approximately — 17.7 kJ/mol. 


Br3m What Is the Effect of pH on Standard-State 
Free Energies? 


For biochemical reactions in which hydrogen ions (H*) are consumed or pro- 
duced, the usual definition of the standard state is awkward. Standard state for the 
H* ion is 1 M, which corresponds to pH 0. At this pH, nearly all enzymes would be 
denatured and biological reactions could not occur. It makes more sense to use free 
energies and equilibrium constants determined at pH 7. Biochemists have thus 
adopted a modified standard state, designated with prime (') symbols, as in A G°’, 
Keq’, AH?', and so on. For values determined in this way, a standard state of 107-7 M 
H* and unit activity (1 M for solutions, 1 atm for gases and pure solids defined as 
unit activity) for all other components (in the ionic forms that exist at pH 7) is as- 





3.4 What Can Thermodynamic Parameters Tell Us About Biochemical Events? 55 
sumed. The two standard states can be related easily. For a reaction in which H* is 
produced, 
A—>B + H* (3.21) 
the relation of the equilibrium constants for the two standard states is 
Keg = Keq’ [H*] (3.22) 
and A G°’ is given by 
AG” =AG° + RT In [H*] (3.23) 
For a reaction in which H* is consumed, 
A` + Ht—>B (3.24) 


the equilibrium constants are related by 





— Keq! 
Kq = HA (3.25) 
and A G°’ is given by 
1 
AG” =AG° + RTIn (ma) = AG° — RTIn [H*] (3.26) 


Br4= What Can Thermodynamic Parameters Tell Us 
About Biochemical Events? 


The best answer to this question is that a single parameter (AH or AS, for example) 
is not very meaningful. A positive A H° for the unfolding of a protein might reflect 
either the breaking of hydrogen bonds within the protein or the exposure of hy- 
drophobic groups to water (Figure 3.5). However, comparison of several thermodynamic 
parameters can provide meaningful insights about a process. For example, the transfer of 
Na* and Cl” ions from the gas phase to aqueous solution involves a very large neg- 
ative AH’ (thus a very favorable stabilization of the ions) and a comparatively small 
AS° (Table 3.2). The negative entropy term reflects the ordering of water molecules 
in the hydration shells of the Na* and Cl” ions. The unfavorable — TAS contribu- 
tion is more than offset by the large heat of hydration, which makes the hydration 
of ions a very favorable process overall. The negative entropy change for the disso- 
ciation of acetic acid in water also reflects the ordering of water molecules in the 
ion hydration shells. In this case, however, the enthalpy change is much smaller in 
magnitude. As a result, A G° for dissociation of acetic acid in water is positive, and 
acetic acid is thus a weak (largely undissociated) acid. 


Folded 






CENGAGENOW” ANIMATED FIGURE 3.5 Unfolding of a 
soluble protein exposes significant numbers of nonpo- 
lar groups to water, forcing order on the solvent and 
resulting in a negative AS° for the unfolding process. 
Orange spheres represent nonpolar groups; blue 
spheres are polar and/or charged groups. See this 
figure animated at www.cengage.com/login 
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TABLE 3.2 [fj 

AH° AS° AG° ACp 
Process kJ/mol kJ/mol « K kJ/mol kJ/mol - K 
Hydration of ionst 
Nat (g) + CI (g)—> Nat (aq) + Cl (aq) —760.0 —0.185 —705.0 
Dissociation of ions in solution? 
HO + CHz;COOH—>H;0+ + CH3;COO- —10.3 —0.126 27.26 —0.143 
Transfer of hydrocarbon from pure liquid to water? 
Toluene (in pure toluene) —— toluene (aqueous) 1.72 —0.071 22.7 0.265 











*All data collected for 25°C. 
tBerry, R. S., Rice, S. A., and Ross, J., 1980. Physical Chemistry. New York: John Wiley. 
*Tanford, C., 1980. The Hydrophobic Effect. New York: John Wiley. 


The transfer of a nonpolar hydrocarbon molecule from its pure liquid to water 
is an appropriate model for the exposure of protein hydrophobic groups to solvent 
when a protein unfolds. The transfer of toluene from liquid toluene to water in- 
volves a negative A S°, a positive A G°, and a AH” that is small compared to AG’ (a 
pattern similar to that observed for the dissociation of acetic acid). What distin- 
guishes these two very different processes is the change in heat capacity (Table 3.2). A posi- 
tive heat capacity change for a process indicates that the molecules have acquired 
new ways to move (and thus to store heat energy). A negative A Cp means that the 
process has resulted in less freedom of motion for the molecules involved. A Cp is 
negative for the dissociation of acetic acid and positive for the transfer of toluene 
to water. The explanation is that polar and nonpolar molecules both induce organi- 
zation of nearby water molecules, but in different ways. The water molecules near a 
nonpolar solute are organized but labile. Hydrogen bonds formed by water molecules 
near nonpolar solutes rearrange more rapidly than the hydrogen bonds of pure wa- 
ter. On the other hand, the hydrogen bonds formed between water molecules near 
an ion are less labile (rearrange more slowly) than they would be in pure water. This 
means that ACp should be negative for the dissociation of ions in solution, as ob- 
served for acetic acid (Table 3.2). 


3x50 What Are the Characteristics of High-Energy 
Biomolecules? 


Virtually all life on earth depends on energy from the sun. Among life forms, there 
is a hierarchy of energetics: Certain organisms capture solar energy directly, 
whereas others derive their energy from this group in subsequent processes. Or- 
ganisms that absorb light energy directly are called phototrophic organisms. These 
organisms store solar energy in the form of various organic molecules. Organisms 
that feed on these latter molecules, releasing the stored energy in a series of oxida- 
tive reactions, are called chemotrophic organisms. Despite these differences, both 
types of organisms share common mechanisms for generating a useful form of 
chemical energy. Once captured in chemical form, energy can be released in con- 
trolled exergonic reactions to drive a variety of life processes (which require en- 
ergy). A small family of universal biomolecules mediates the flow of energy from ex- 
ergonic reactions to the energy-requiring processes of life. These molecules are the 
reduced coenzymes and the high-energy phosphate compounds. Phosphate compounds are 
considered high energy if they exhibit large negative free energies of hydrolysis 
(that is, if A G°’ is more negative than —25 kJ/mol). 

Table 3.3 lists the most important members of the high-energy phosphate com- 
pounds. Such molecules include phosphoric anhydrides (ATP, ADP), an enol phosphate 
(PEP), acyl phosphates (such as acetyl phosphate), and guanidino phosphates (such as 
creatine phosphate). Also included are thioesters, such as acetyl-CoA, which do not 
contain phosphorus, but which have a large negative free energy of hydrolysis. As 

















3.5 What Are 
TABLE 3.3 | 
Compound and Hydrolysis Reaction AG” (kJ/mol) Structure 
Phosphoenolpyruvate—> pyruvate + P; — 62.2 Figure 3.13 
1,3-Bisphosphoglycerate—— 3-phosphoglycerate + P; —49.6 Figure 3.12 
Creatine phosphate—— creatine + P; —43.3 Figure 13.21 
Acetyl phosphate—— acetate + P; —43.3 Figure 3.12 
Adenosine-5'-triphosphate— > ADP + P; — 35.71 Figure 3.9 
Adenosine-5'-triphosphate—- ADP + P; — 30.5 Figure 3.9 
(with excess Mg?*) 
Adenosine-5'-diphosphate—> AMP + P; =35.7 Figure 3.11 
Pyrophosphate—> P; + P; (in 5 mM Mg**) —33.6 Figure 3.10 
Adenosine-5'-triphosphate—> AMP + PP; (excess Mg”*) — 32.3 Figure 10.11 
Uridine diphosphoglucose—> UDP + glucose =$1.9 Figure 22.10 
Acetyl-coenzyme A—> acetate + CoA = 315 page 570 
S-adenosylmethionine—— methionine + adenosine —25.6 Figure 25.28 
Glucose-1-phosphate—— glucose + P; =21.0 Figure 7.13 
Sn-Glycerol-3-phosphate—— glycerol + P; — 9:2 Figure 8.5 
Adenosine-5'-monophosphate——> adenosine + P; 9:2 Figure 10.11 











*Adapted primarily from Handbook of Biochemistry and Molecular Biology, 1976, 3rd ed. In Physical and Chemical Data, 
G. Fasman, ed., Vol. 1, pp. 296-304. Boca Raton, FL: CRC Press. 

tFrom Gwynn, R. W., and Veech, R. L., 1973. The equilibrium constants of the adenosine triphosphate hydrolysis and the 
adenosine triphosphate-citrate lyase reactions. Journal of Biological Chemistry 248:6966-6972. 

*From Mudd, H., and Mann, J., 1963. Activation of methionine for transmethylation. Journal of Biological Chemistry 
238:2164-2170. 


noted earlier, the exact amount of chemical free energy available from the hydrol- 
ysis of such compounds depends on concentration, pH, temperature, and so on, 
but the AG” values for hydrolysis of these substances are substantially more nega- 
tive than those for most other metabolic species. Two important points: First, high- 
energy phosphate compounds are not long-term energy storage substances. They 
are transient forms of stored energy, meant to carry energy from point to point, 
from one enzyme system to another, in the minute-to-minute existence of the cell. 
(As we shall see in subsequent chapters, other molecules bear the responsibility for 
long-term storage of energy supplies.) Second, the term high-energy compound should 
not be construed to imply that these molecules are unstable and hydrolyze or de- 
compose unpredictably. ATP, for example, is quite a stable molecule. A substantial 
activation energy must be delivered to ATP to hydrolyze the terminal, or y, phosphate 
group. In fact, as shown in Figure 3.6, the activation energy that must be absorbed 
by the molecule to break the O—P, bond is normally 200 to 400 kJ/mol, which is 
substantially larger than the net 30.5 kJ/mol released in the hydrolysis reaction. Bio- 
chemists are much more concerned with the net release of 30.5 kJ/mol than with the 
activation energy for the reaction (because suitable enzymes cope with the latter). 
The net release of large quantities of free energy distinguishes the high-energy 
phosphoric anhydrides from their “low-energy” ester cousins, such as glycerol-3- 
phosphate (Table 3.3). The next section provides a quantitative framework for un- 
derstanding these comparisons. 


ATP Is an Intermediate Energy-Shuttle Molecule 


One last point about Table 3.3 deserves mention. Given the central importance of 
ATP asa high-energy phosphate in biology, students are sometimes surprised to find 
that ATP holds an intermediate place in the rank of high-energy phosphates. PEP, 
1,3-BPG, creatine phosphate, acetyl phosphate, and pyrophosphate all exhibit 
higher values of A G°’. This is not a biological anomaly. ATP is uniquely situated be- 
tween the very-high-energy phosphates synthesized in the breakdown of fuel mole- 
cules and the numerous lower-energy acceptor molecules that are phosphorylated 


he Characteristics of High-Energy Biomolecules? 
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FIGURE 3.6 The activation energies for phosphoryl 
group transfer reactions (200 to 400 kJ/mol) are sub- 
stantially larger than the free energy of hydrolysis of ATP 
(—30.5 kJ/mol). 





— Phosphoryl group Products 
transfer potential 
= —-30.5 kJ/mol 


in the course of further metabolic reactions. ADP can accept both phosphates and 
energy from the higher-energy phosphates, and the ATP thus formed can donate 
both phosphates and energy to the lower-energy molecules of metabolism. The 
ATP/ADP pair is an intermediately placed acceptor/donor system among high- 
energy phosphates. In this context, ATP functions as a very versatile but intermedi- 
ate energy-shuttle device that interacts with many different energy-coupling en- 
zymes of metabolism. 


Group Transfer Potentials Quantify the Reactivity 
of Functional Groups 


Many reactions in biochemistry involve the transfer of a functional group from a 
donor molecule to a specific receptor molecule or to water. The concept of group 
transfer potential explains the tendency for such reactions to occur. Biochemists de- 
fine the group transfer potential as the free energy change that occurs upon hy- 
drolysis, that is, upon transfer of the particular group to water. This concept and its 
terminology are preferable to the more qualitative notion of high-energy bonds. 

The concept of group transfer potential is not particularly novel. Other kinds of 
transfer (of hydrogen ions and electrons, for example) are commonly characterized 
in terms of appropriate measures of transfer potential (pK, and reduction poten- 
tial, é., respectively). As shown in Table 3.4, the notion of group transfer is fully 
analogous to those of ionization potential and reduction potential. The similarity is 
anything but coincidental, because all of these are really specific instances of free 
energy changes. If we write 


AH—> A5 + H* (3.27a) 


we really don’t mean that a proton has literally been removed from the acid AH. In 
the gas phase at least, this would require the input of approximately 1200 kJ/mol! 
What we really mean is that the proton has been transferred to a suitable acceptor 
molecule, usually water: 


AH + HO—> A~ + H;O+ (3.27b) 





The appropriate free energy relationship is of course 


AG 


~ 9303 RT (3.28) 


pKa 
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TABLE 3.4 T 

Proton Transfer Potential Standard Reduction Potential Group Transfer Potential 

(Acidity) (Electron Transfer Potential) (High-Energy Bond) 
Simple equation AH === A5 + H+ AS=At + & A~ PA +P; 
Equation including acceptor AH + HO == A+ Ht== A ~ PO, + HO == 

A` + H,0+ At + 3H, A—H + HPO- 
. AG’? —AG° —AG° 

Measure of transfer potential pk, = 2303 RT AG, = nF In Keg = RT 
Free energy change of transfer AG’ per mole of AG” per mole of AG’ per mole of phosphate 
is given by: H* transferred e transferred transferred 








Adapted from: Klotz, |. M., 1986. Introduction to Biomolecular Energetics. New York: Academic Press. 


Similarly, in the case of an oxidation-reduction reaction 
A—AtT +e (3.29a) 


we don’t really mean that A oxidizes independently. What we really mean (and what 
is much more likely in biochemical systems) is that the electron is transferred to a 
suitable acceptor: 





A+ Ht—>A*+ + 5H, (3.29b) 
and the relevant free energy relationship is 
CC eee 3.30 


where n is the number of equivalents of electrons transferred and F is Faraday’s 
constant. 

Similarly, the release of free energy that occurs upon the hydrolysis of ATP and 
other “high-energy phosphates” can be treated quantitatively in terms of group trans- 
Jer. Itis common to write for the hydrolysis of ATP 


ATP + H»O—-> ADP + P; (3.31) 

The free energy change, which we henceforth call the group transfer potential, is 
given by 

AG? = —RTIn Key (3.32) 
where Keq is the equilibrium constant for the group transfer, which is normally 
written as 

g — _[ADPIIP] mae 

“ [ATP] [H:O] l 


Even this set of equations represents an approximation, because ATP, ADP, and P; 
all exist in solutions as a mixture of ionic species. This problem is discussed in a 
later section. For now, it is enough to note that the free energy changes listed in 
Table 3.3 are the group transfer potentials observed for transfers to water. 


The Hydrolysis of Phosphoric Acid Anhydrides Is Highly Favorable 


ATP contains two pyrophosphoryl or phosphoric acid anhydride linkages, as shown in 
Figure 3.7. Other common biomolecules possessing phosphoric acid anhydride 
linkages include ADP, GTP, GDP and the other nucleoside diphosphates and 
triphosphates, sugar nucleotides such as UDP-glucose, and inorganic pyrophos- 
phate itself. All exhibit large negative free energies of hydrolysis, as shown in 
Table 3.3. The chemical reasons for the large negative AG°’ values for the hydrol- 
ysis reactions include destabilization of the reactant due to bond strain caused by 
electrostatic repulsion, stabilization of the products by ionization and resonance, 
and entropy factors due to hydrolysis and subsequent ionization. 
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CENGAGENOW” ACTIVE FIGURE 3.7 The triphosphate 
chain of ATP contains two pyrophosphate linkages, both 
of which release large amounts of energy upon hydroly- 
sis. Test yourself on the concepts in this figure at 
www.cengage.com/login 
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Destabilization Due to Electrostatic Repulsion Electrostatic repulsion in the 
reactants is best understood by comparing these phosphoric anhydrides with 
other reactive anhydrides, such as acetic anhydride. As shown in Figure 3.8a, the 
electronegative carbonyl oxygen atoms withdraw electrons from the C=O bonds, 
producing partial negative charges on the oxygens and partial positive charges on 
the carbonyl carbons. Each of these electrophilic carbonyl carbons is further 
destabilized by the other acetyl group, which is also electron-withdrawing in na- 
ture. As a result, acetic anhydride is unstable with respect to the products of 
hydrolysis. 

The situation with phosphoric anhydrides is similar. The phosphorus atoms of 
the pyrophosphate anion are electron-withdrawing and destabilize PP; with respect 
to its hydrolysis products. Furthermore, the reverse reaction, reformation of the an- 
hydride bond from the two anionic products, requires that the electrostatic repul- 
sion between these anions be overcome (see following). 


Stabilization of Hydrolysis Products by lonization and Resonance The pyro- 
phosphoryl moiety possesses two negative charges at pH values above 7.5 or so 
(Figure 3.8a). The hydrolysis products, two phosphate esters, each carry about two 
negative charges at pH values above 7.2. The increased ionization of the hydrolysis 
products helps stabilize the electrophilic phosphorus nuclei. 

Resonance stabilization in the products is best illustrated by the reactant anhy- 
drides (Figure 3.8b). The unpaired electrons of the bridging oxygen atom in acetic 
anhydride (and phosphoric anhydride) cannot participate in resonance structures 
with both electrophilic centers at once. This competing resonance situation is re- 
lieved in the product acetate or phosphate molecules. 


Entropy Factors Arising from Hydrolysis and lonization For the phosphoric 
anhydrides, and for most of the high-energy compounds discussed here, there is 
an additional “entropic” contribution to the free energy of hydrolysis. Most of the 
hydrolysis reactions of Table 3.3 result in an increase in the number of molecules 
in solution. As shown in Figure 3.9, the hydrolysis of ATP (at pH values above 7) 
creates three species—ADP, inorganic phosphate (P;), and a hydrogen ion—from 
only two reactants (ATP and HO). The entropy of the solution increases because 
the more particles, the more disordered the system.? (This effect is ionization- 
dependent because, at low pH, the hydrogen ion created in many of these reac- 
tions simply protonates one of the phosphate oxygens, and one fewer “particle” re- 
sults from the hydrolysis.) 


3Imagine the “disorder” created by hitting a crystal with a hammer and breaking it into many small 
pieces. 
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Acetic anhydride: Phosphoric anhydrides: 
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(b) 


Competing resonance in acetic anhydride: 
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These can only occur alternately 


Simultaneous resonance in the hydrolysis products: 


These resonances can occur simultaneously 


CENGAGENOW” ACTIVE FIGURE 3.8 (a) Electrostatic repulsion between adjacent partial positive charges (on 
carbon and phosphorus, respectively) is relieved upon hydrolysis of the anhydride bonds of acetic anhydride 
and phosphoric anhydrides. (b) The competing resonances of acetic anhydride and the simultaneous reso- 
nance forms of the hydrolysis product, acetate. Test yourself on the concepts in this figure at www 
.cengage.com/login 


The Hydrolysis AG°’ of ATP and ADP Is Greater Than That of AMP 


The concepts of destabilization of reactants and stabilization of products described 
for pyrophosphate also apply for ATP and other phosphoric anhydrides (Figure 
3.9). ATP and ADP are destabilized relative to the hydrolysis products by electro- 
static repulsion, competing resonance, and entropy. AMP, on the other hand, is a 
phosphate ester (not an anhydride) possessing only a single phosphoryl group and 
is not markedly different from the product inorganic phosphate in terms of elec- 
trostatic repulsion and resonance stabilization. Thus, the AG?’ for hydrolysis of 
AMP is much smaller than the corresponding values for ATP and ADP. 


Acetyl Phosphate and 1,3-Bisphosphoglycerate Are 
Phosphoric-Carboxylic Anhydrides 


The mixed anhydrides of phosphoric and carboxylic acids, frequently called acyl 
phosphates, are also energy-rich. Two biologically important acyl phosphates are 
acetyl phosphate and 1,3-bisphosphoglycerate. Hydrolysis of these species yields 
acetate and 3-phosphoglycerate, respectively, in addition to inorganic phosphate 
(Figure 3.10). Once again, the large A G°’ values indicate that the reactants are 
destabilized relative to products. This arises from bond strain, which can be 
traced to the partial positive charges on the carbonyl carbon and phosphorus 
atoms of these structures. The energy stored in the mixed anhydride bond 
(which is required to overcome the charge—charge repulsion) is released upon 
hydrolysis. Increased resonance possibilities in the products relative to the reac- 
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CENGAGENOW” ANIMATED FIGURE 3.9 Hydrolysis of 
ATP to ADP (and/or of ADP to AMP) leads to relief of 
electrostatic repulsion. See this figure animated at 
www.cengage.com/login 
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CH,—C + HO > CH,—C—O + + Ht 
AG" = -43.3 kJ/mol 


+ HO 





AG” = -49.6 kJ/mol 





1,3-Bisphosphoglycerate 3-Phosphoglycerate 


CENGAGENOW” ACTIVE FIGURE 3.10 The hydrolysis reactions of acetyl phosphate and 1,3-bisphosphoglycerate. 
Test yourself on the concepts in this figure at www.cengage.com/login 


3.6 What Are the Complex Equilibria Involved in ATP Hydrolysis? 
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CENGAGENOW” ANIMATED FIGURE 3.11 Phosphoenolpyruvate (PEP) is produced by the enolase reaction (in 
glycolysis; see Chapter 18) and in turn drives the phosphorylation of ADP to form ATP in the pyruvate kinase 
reaction. See this figure animated at www.cengage.com/login 
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CENGAGENOW” ANIMATED FIGURE 3.12 Hydrolysis and the subsequent tautomerization account for the very 
large AG” of PEP. See this figure animated at www.cengage.com/login 


tants also contribute to the large negative A G°’ values. The value of AG” de- 
pends on the pK, values of the starting anhydride and the product phosphoric 
and carboxylic acids, and of course also on the pH of the medium. 


Enol Phosphates Are Potent Phosphorylating Agents 


The largest value of A G°’ in Table 3.3 belongs to phosphoenolpyruvate or PEP, an ex- 
ample of an enolic phosphate. This molecule is an important intermediate in car- 
bohydrate metabolism, and due to its large negative A G°’, it is a potent phospho- 
rylating agent. PEP is formed via dehydration of 2-phosphoglycerate by enolase 
during fermentation and glycolysis. PEP is subsequently transformed into pyruvate 
upon transfer of its phosphate to ADP by pyruvate kinase (Figure 3.11). The very 
large negative value of AG°’ for the latter reaction is to a large extent the result of 
a secondary reaction of the enol form of pyruvate. Upon hydrolysis, the unstable 
enolic form of pyruvate immediately converts to the keto form with a resulting 
large negative AG°’ (Figure 3.12). Together, the hydrolysis and subsequent tau- 
tomerization result in an overall A G°’ of —62.2 kJ/mol. 


3260 What Are the Complex Equilibria Involved 
in ATP Hydrolysis? 


So far, as in Equation 3.34, the hydrolyses of ATP and other high-energy phosphates 
have been portrayed as simple processes. The situation in a real biological system is 
far more complex, owing to the operation of several ionic equilibria. First, ATP, 


AG =-33.6 kJ/mol 
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AG? (kJ/mol) 
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FIGURE 3.14 The pH dependence of the free energy of 
hydrolysis of ATP. Because pH varies only slightly in bio- 
logical environments, the effect on AG is usually small. 
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FIGURE 3.15 The free energy of hydrolysis of ATP as a 
function of total Mg?* ion concentration at 38°C and 
pH 7.0. (Adapted from Gwynn, R.W., and Veech, R. L, 1973. The 
equilibrium constants of the adenosine triphosphate hydrolysis 
and the adenosine triphosphate-citrate lyase reactions. Journal 
of Biological Chemistry 248:6966-6972.) 








Color indicates the locations of the 
dissociable protons of ATP 


FIGURE 3.13 Adenosine-5’-triphosphate (ATP). 


ADP, and the other species in Table 3.3 can exist in several different ionization 
states that must be accounted for in any quantitative analysis. Second, phosphate 
compounds bind a variety of divalent and monovalent cations with substantial affin- 
ity, and the various metal complexes must also be considered in such analyses. Con- 
sideration of these special cases makes the quantitative analysis far more realistic. 
The importance of these multiple equilibria in group transfer reactions is illus- 
trated for the hydrolysis of ATP, but the principles and methods presented are gen- 
eral and can be applied to any similar hydrolysis reaction. 


The AG°’ of Hydrolysis for ATP Is pH-Dependent 


ATP has four dissociable protons, as indicated in Figure 3.13. Three of the protons 
on the triphosphate chain dissociate at very low pH. The last proton to dissociate 
from the triphosphate chain possesses a pK, of 6.95. At higher pH values, ATP is 
completely deprotonated. ADP and phosphoric acid also undergo multiple ioniza- 
tions. These multiple ionizations make the equilibrium constant for ATP hydrolysis 
more complicated than the simple expression in Equation 3.33. Multiple ioniza- 
tions must also be taken into account when the pH dependence of A G° is consid- 
ered. The calculations are beyond the scope of this text, but Figure 3.14 shows the 
variation of A G° as a function of pH. The free energy of hydrolysis is nearly constant 
from pH 4 to pH 6. At higher values of pH, A G° varies linearly with pH, becoming 
more negative by 5.7 kJ/mol for every pH unit of increase at 37°C. Because the pH 
of most biological tissues and fluids is near neutrality, the effect on A G° is relatively 
small, but it must be taken into account in certain situations. 


Metal lons Affect the Free Energy of Hydrolysis of ATP 


Most biological environments contain substantial amounts of divalent and monova- 
lent metal ions, including Mg?*, Ca?*, Nat, K*, and so on. What effect do metal 
ions have on the equilibrium constant for ATP hydrolysis and the associated free 
energy change? Figure 3.15 shows the change in AG” with pMg (that is, 
—logio[Mg?*]) at pH 7.0 and 38°C. The free energy of hydrolysis of ATP at zero 
Mg?* is —35.7 kJ/mol, and at 5 mM total Mg?* (the minimum in the plot) the 
AG,».° is approximately —31 kJ/mol. Thus, in most real biological environments 
(with pH near 7 and Mg**concentrations of 5 mM or more) the free energy of hy- 
drolysis of ATP is altered more by metal ions than by protons. A widely used “con- 
sensus value” for AG?’ of ATP in biological systems is —30.5 kJ/mol (Table 3.3). 
This value, cited in the 1976 Handbook of Biochemistry and Molecular Biology (3rd ed., 
Physical and Chemical Data, Vol. 1, pp. 296-304, Boca Raton, FL: CRC Press), was de- 
termined in the presence of “excess Mg?*.” This is the value we use for metabolic calcu- 
lations in the balance of this text. 
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Number of Mg?* 
bound per ATP 





FIGURE 3.16 Number of Mg2* ions bound per ATP as a function of pH and [Mg?*]. pMg = — log: [Mg?*]. 
(Cengage Learning.) 


Why does the G°’ of ATP hydrolysis depend so strongly on Mg?* concentration? 
The answer lies in the strong binding of Mg** by the triphosphate oxygens of ATP. 
As shown in Figure 3.16, the binding of Mg?* to ATP is dependent on Mg** ion con- 
centration and also on pH. At pH 7 and 1 mM [Mg?*], approximately one Mg** ion 
is bound to each ATP. The decrease in binding of Mg** at low pH is the result of 
competition by H* and Mg?* for the negatively charged oxygen atoms of ATP. 


Concentration Affects the Free Energy of Hydrolysis of ATP 


Through all these calculations of the effect of pH and metal ions on the ATP hy- 
drolysis equilibrium, we have assumed “standard conditions” with respect to con- 
centrations of all species except for protons. The levels of ATP, ADP, and other 
high-energy metabolites never even begin to approach the standard state of 1 M. 
In most cells, the concentrations of these species are more typically 1 to 5 mM or 
even less. Earlier, we described the effect of concentration on equilibrium con- 
stants and free energies in the form of Equation 3.13. For the present case, we can 
rewrite this as 
[ZADP][2Pi] 


AG=AG° + RTIn SAT] (3.34) 





where the terms in brackets represent the sum (È) of the concentrations of all the 
ionic forms of ATP, ADP, and P;. 

It is clear that changes in the concentrations of these species can have large ef- 
fects on AG. The concentrations of ATP, ADP, and P; may, of course, vary rather in- 
dependently in real biological environments, but if, for the sake of some model cal- 
culations, we assume that all three concentrations are equal, then the effect of 
concentration on AG is as shown in Figure 3.17. The free energy of hydrolysis of 
ATP, which is —35.7 kJ/mol at 1 M, becomes —49.4 kJ/mol at 5 mM (that is, the 
concentration for which pC = —2.3 in Figure 3.17). At 1 mM ATP, ADP, and P;, the 
free energy change becomes even more negative at —53.6 kJ/mol. Clearly, the effects 
of concentration are much greater than the effects of protons or metal ions under physiological 
conditions. 

Does the “concentration effect” change ATP’s position in the energy hierarchy 
(in Table 3.3)? Not really. All the other high- and low-energy phosphates experi- 
ence roughly similar changes in concentration under physiological conditions and 
thus similar changes in their free energies of hydrolysis. The roles of the very-high- 
energy phosphates (PEP, 1,3-bisphosphoglycerate, and creatine phosphate) in the 
synthesis and maintenance of ATP in the cell are considered in our discussions of 
metabolic pathways. In the meantime, several of the problems at the end of this 
chapter address some of the more interesting cases. 
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Where C = concentration of 
ATP, ADP, and P; 


CENGAGENOW” ACTIVE FIGURE 3.17 The free energy of 
hydrolysis of ATP as a function of concentration at 38°C, 

pH 7.0.The plot follows the relationship described in 
Equation 3.34, with the concentrations [C] of ATP ADP and 
P; assumed to be equal. Test yourself on the concepts 
in this figure at www.cengage.com/login 
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CENGAGENOW” ANIMATED FIGURE 3.18 The pyruvate 
inase reaction. See this figure animated at www 
.cengage.com/login 





u/s Why Are Coupled Processes Important to Living Things? 


Many of the reactions necessary to keep cells and organisms alive must run against 
their thermodynamic potential, that is, in the direction of positive AG. Among these 
are the synthesis of adenosine triphosphate (ATP) and other high-energy molecules 
and the creation of ion gradients in all mammalian cells. These processes are driven 
in the thermodynamically unfavorable direction via coupling with highly favorable 
processes. Many such coupled processes are discussed later in this text. They are cru- 
cially important in intermediary metabolism, oxidative phosphorylation, and mem- 
brane transport, as we shall see. 

We can predict whether pairs of coupled reactions will proceed spontaneously 
by simply summing the free energy changes for each reaction. For example, con- 
sider the reaction from glycolysis (discussed in Chapter 18) involving the conver- 
sion of phosphoenolpyruvate (PEP) to pyruvate (Figure 3.18). The hydrolysis of 
PEP is energetically very favorable, and it is used to drive phosphorylation of 
adenosine diphosphate (ADP) to form ATP, a process that is energetically unfa- 
vorable. Using values of AG that would be typical for a human erythrocyte: 


PEP + HO—> pyruvate + P; AG= —78 kJ/mol (3.35) 
ADP + P;—>ATP + HO AG= +55 kJ/mol (3.36) 
PEP + ADP—> pyruvate + ATP Total AG = —23 kJ/mol (3.37) 





The net reaction catalyzed by this enzyme depends upon coupling between the two 
reactions shown in Equations 3.35 and 3.36 to produce the net reaction shown in 
Equation 3.37 with a net negative AG. Many other examples of coupled reactions 
are considered in our discussions of intermediary metabolism (see Part 3). In ad- 
dition, many of the complex biochemical systems discussed in the later chapters of 
this text involve reactions and processes with positive AG values that are driven for- 
ward by coupling to reactions with a negative AG. 


328) What Is the Daily Human Requirement for ATP? 


We can end this discussion of ATP and the other important high-energy compounds 
in biology by discussing the daily metabolic consumption of ATP by humans. An ap- 
proximate calculation gives a somewhat surprising and impressive result. Assume 
that the average adult human consumes approximately 11,700 kJ (2800 kcal, that is, 
2800 Calories) per day. Assume also that the metabolic pathways leading to ATP syn- 
thesis operate at a thermodynamic efficiency of approximately 50%. Thus, of the 
11,700 kJ a person consumes as food, about 5860 kJ end up in the form of synthe- 
sized ATP. As indicated earlier, the hydrolysis of 1 mole of ATP yields approximately 
50 kJ of free energy under cellular conditions. This means that the body cycles 
through 5860/50 = 117 moles of ATP each day. The disodium salt of ATP has a mol- 
ecular weight of 551 g/mol, so an average person hydrolyzes about 
551g 
(117 moles) ae 64,467 g of ATP per day 

The average adult human, with a typical weight of 70 kg or so, thus consumes ap- 
proximately 65 kg of ATP per day, an amount nearly equal to his or her own body 
weight! Fortunately, we have a highly efficient recycling system for ATP/ADP utiliza- 
tion. The energy released from food is stored transiently in the form of ATP. Once 
ATP energy is used and ADP and phosphate are released, our bodies recycle it to 
ATP through intermediary metabolism so that it may be reused. The typical 70-kg 
body contains only about 50 grams of ATP/ADP total. Therefore, each ATP mole- 
cule in our bodies must be recycled nearly 1300 times each day! Were it not for this 
fact, at current commercial prices of about $20 per gram, our ATP “habit” would cost 
more than $1 million per day! In these terms, the ability of biochemistry to sustain 
the marvelous activity and vigor of organisms gains our respect and fascination. 





A DEEPER LOOK 
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ATP Changes the K,, by a Factor of 10° 


Consider a process, AB. It could be a biochemical reaction, or 
the transport of an ion against a concentration gradient, or even a 
mechanical process (such as muscle contraction). Assume that it is 
a thermodynamically unfavorable reaction. Let’s say, for purposes 
of illustration, that AG” = +13.8 kJ/mol. From the equation, 


AG” = —RTIn Keg 
we have 


+13,800 = —(8.31 J/K - mol) (298 K) In Keg 


which yields 
In Keg = —5.57 
Therefore, 
Keq = 0.0088 = [Beq]/TAcq] 


This reaction is clearly unfavorable (as we could have foreseen 
from its positive A G°’). At equilibrium, there is one molecule of 
product B for every 263 molecules of reactant A. Not much A was 
transformed to B. 

Now suppose the reaction A == B is coupled to ATP hydro- 
lysis, as is often the case in metabolism: 


A + ATP=—B + ADP +P; 
The thermodynamic properties of this coupled reaction are the 


same as the sum of the thermodynamic properties of the partial 
reactions: 


A=B 
ATP + H:O =——ADP + P; 


AG” = +13.8 kJ/mol 
AG” = —30.5 kJ/mol 





A + ATP + HO =B + ADP +P; AG” = —16.7 kJ/mol 


That is, 
A GC" overall = —16.7 kJ/mol 
So 


—16,700 = —RT In Keq = —(8.31) (298)In Keq 
In Keg = —16,700/—2476 = 6.75 
Keq = 850 











Using this equilibrium constant, let’s now consider the cellular sit- 
uation in which the concentrations of A and B are brought to equi- 
librium in the presence of typical prevailing concentrations of 
ATP, ADP, and P;.* 


*The concentrations of ATP, ADP, and P; in a normal, healthy bacterial cell 
growing at 25°C are maintained at roughly 8 mM, 8 mM, and 1 mM, re- 
spectively. Therefore, the ratio [ADP][P;]/[ATP] is about 107°. Under 
these conditions, AG for ATP hydrolysis is approximately —47.6 kJ/mol. 


[Beq] [ADP] [P:] 
0 ~ [Aq] [ATP] 





Ke 


[Beg] [8 X 1079] [1079] 
[Acgl [8 X 10-5] 


[Beg] / [Acq] = 850,000 





850 = 


Comparison of the [Beg]/[Acq] ratio for the simple A&B reac- 

tion with the coupling of this reaction to ATP hydrolysis gives 
850,000 
0.0038 





= 2.2 x 108 


The equilibrium ratio of B to A is more than 10° greater when 
the reaction is coupled to ATP hydrolysis. A reaction that was 
clearly unfavorable (Keq = 0.0038) has become emphatically 
spontaneous! 

The involvement of ATP has raised the equilibrium ratio of 
B/A by more than 200 million—fold. It is informative to realize 
that this multiplication factor does not depend on the nature of 
the reaction. Recall that we defined A === RB in the most general 
terms. Also, the value of this equilibrium constant ratio, some 2.2 X 
10°, is not at all dependent on the particular reaction chosen or its 
standard free energy change, AG°’. You can satisfy yourself on this 
point by choosing some value for AG°’ other than +13.8 kJ/mol 
and repeating these calculations (keeping the concentrations of 
ATP, ADP, and P; at 8, 8, and 1 mM, as before). 
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SUMMARY 


The activities of living things require energy. Movement, growth, synthe- 
sis of biomolecules, and the transport of ions and molecules across mem- 
branes all demand energy input. All organisms must acquire energy from 
their surroundings and must utilize that energy efficiently to carry out 
life processes. To study such bioenergetic phenomena requires familiar- 
ity with thermodynamics. Thermodynamics also allows us to determine 
whether chemical processes and reactions occur spontaneously. 


3.1 What Are the Basic Concepts of Thermodynamics? The system is 
that portion of the universe with which we are concerned. The sur- 
roundings include everything else in the universe. An isolated system 
cannot exchange matter or energy with its surroundings. A closed sys- 
tem may exchange energy, but not matter, with the surroundings. An 
open system may exchange matter, energy, or both with the surround- 
ings. Living things are typically open systems. The first law of thermo- 
dynamics states that the total energy of an isolated system is conserved. 
Enthalpy, H, is defined as H = E+ PV. AH is equal to the heat trans- 
ferred in a constant pressure process. For biochemical reactions in liq- 
uids, volume changes are typically quite small, and enthalpy and inter- 
nal energy are often essentially equal. There are several statements of 
the second law of thermodynamics, including the following: (1) Systems 
tend to proceed from ordered (low-entropy or low-probability) states to 
disordered (high-entropy or high-probability) states. (2) The entropy of 
the system plus surroundings is unchanged by reversible processes; the 
entropy of the system plus surroundings increases for irreversible 
processes. (3) All naturally occurring processes proceed toward equilib- 
rium, that is, to a state of minimum potential energy. The third law of 
thermodynamics states that the entropy of any crystalline, perfectly 
ordered substance must approach zero as the temperature approaches 
0 K, and, at T= 0 K, entropy is exactly zero. The Gibbs free energy, G, 
defined as G = H- TS, provides a simple criterion for equilibrium. 


3.2 What Is the Effect of Concentration on Net Free Energy Changes? 
The free energy change for a reaction can be very different from the 
standard-state value if the concentrations of reactants and products dif- 
fer significantly from unit activity (1 M for solutions). For the reaction 
A+ B= —C + D, the free energy change for non-standard-state con- 
centrations is given by 


[C][D] 
[A] [B] 


3.3 What Is the Effect of pH on Standard-State Free Energies? For 
biochemical reactions in which hydrogen ions (H*) are consumed or 


AG=AG° + RTIn 


produced, a modified standard state, designated with prime (') symbols, 
as in AG*’, Kes AH?”', may be employed. For a reaction in which H* is 


produced, A G°’ is given by 
AG” =AG° + RTln [H*] 


3.4 What Can Thermodynamic Parameters Tell Us About Biochemical 
Events? A single parameter (AH or AS, for example) is not very mean- 
ingful, but comparison of several thermodynamic parameters can pro- 
vide meaningful insights about a process. Thermodynamic parameters 
can be used to predict whether a given reaction will occur as written and 
to calculate the relative contributions of molecular phenomena (for ex- 
ample, hydrogen bonding or hydrophobic interactions) to an overall 
process. 


3.5 What Are the Characteristics of High-Energy Biomolecules? A 
small family of universal biomolecules mediates the flow of energy from 
exergonic reactions to the energy-requiring processes of life. These 
molecules are the reduced coenzymes and the high-energy phosphate 
compounds. High-energy phosphates are not long-term energy storage 
substances, but rather transient forms of stored energy. 


3.6 What Are the Complex Equilibria Involved in ATP Hydrolysis? 
ATP, ADP, and similar species can exist in several different ionization 
states that must be accounted for in any quantitative analysis. Also, phos- 
phate compounds bind a variety of divalent and monovalent cations 
with substantial affinity, and the various metal complexes must also be 
considered in such analyses. 


3.7 Why Are Coupled Processes Important to Living Things? Many of 
the reactions necessary to keep cells and organisms alive must run 
against their thermodynamic potential, that is, in the direction of posi- 
tive AG. These processes are driven in the thermodynamically unfavor- 
able direction via coupling with highly favorable processes. Many such 
coupled processes are crucially important in intermediary metabolism, 
oxidative phosphorylation, and membrane transport. 


3.8 What Is the Daily Human Requirement for ATP? The average 
adult human, with a typical weight of 70 kg or so, consumes 
approximately 2800 calories per day. The energy released from food is 
stored transiently in the form of ATP. Once ATP energy is used and ADP 
and phosphate are released, our bodies recycle it to ATP through inter- 
mediary metabolism so that it may be reused. The typical 70-kg body 
contains only about 50 grams of ATP/ADP total. Therefore, each ATP 
molecule in our bodies must be recycled nearly 1300 times each day. 


PROBLEMS 
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1. An enzymatic hydrolysis of fructose-1-P, 
Fructose-1-P + HO == fructose + P; 


was allowed to proceed to equilibrium at 25°C. The original con- 
centration of fructose-1-P was 0.2 M, but when the system had 
reached equilibrium the concentration of fructose-1-P was only 
6.52 X 1075 M. Calculate the equilibrium constant for this reaction 
and the free energy of hydrolysis of fructose-1-P. 

2. The equilibrium constant for some process A === B is 0.5 at 20°C 
and 10 at 30°C. Assuming that A H° is independent of temperature, 
calculate A H° for this reaction. Determine A G° and A S° at 20° and 
at 30°C. Why is it important in this problem to assume that AH? is 
independent of temperature? 

3. The standard-state free energy of hydrolysis for acetyl phosphate is 
AG? = —42.3 kJ/mol. 


Acetyl-P + HO —> acetate + P; 


Calculate the free energy change for acetyl phosphate hydrolysis 
in a solution of 2 mM acetate, 2 mM phosphate, and 3 nM acetyl 
phosphate. 

4. Define a state function. Name three thermodynamic quantities that 
are state functions and three that are not. 

5. ATP hydrolysis at pH 7.0 is accompanied by release of a hydrogen 
ion to the medium 


ATP*~ + HO === ADP" + HPO,- + Ht 


If the AG” for this reaction is —30.5 kJ/mol, what is AG? (that is, 

the free energy change for the same reaction with all components, 

including H™, at a standard state of 1 M)? 

6. For the process A === B, Keq (AB) is 0.02 at 37°C. For the process 

B= =C, Keq (BC) = 1000 at 37°C. 

a. Determine Keq (AC), the equilibrium constant for the overall 
process A ==> C, from Keq (AB) and Keq (BC). 

b. Determine standard-state free energy changes for all three 
processes, and use A G° (AC) to determine Keq (AC). Make sure that 
this value agrees with that determined in part a of this problem. 


7. Draw all possible resonance structures for creatine phosphate 
and discuss their possible effects on resonance stabilization of the 
molecule. 


8. Write the equilibrium constant, Keq, for the hydrolysis of creatine 
phosphate and calculate a value for Keg at 25°C from the value of 
AG% in Table 3.3. 

9. Imagine that creatine phosphate, rather than ATP, is the universal 
energy carrier molecule in the human body. Repeat the calculation 
presented in Section 3.8, calculating the weight of creatine phos- 
phate that would need to be consumed each day by a typical adult 
human if creatine phosphate could not be recycled. If recycling of 
creatine phosphate were possible, and if the typical adult human 
body contained 20 grams of creatine phosphate, how many times 
would each creatine phosphate molecule need to be turned over or 
recycled each day? Repeat the calculation assuming that glycerol-3- 
phosphate is the universal energy carrier and that the body contains 
20 grams of glycerol-3-phosphate. 

10. Calculate the free energy of hydrolysis of ATP in a rat liver cell in 
which the ATP, ADP, and P; concentrations are 3.4, 1.3, and 4.8 mM, 
respectively. 

11. Hexokinase catalyzes the phosphorylation of glucose from ATP, 
yielding glucose-6-P and ADP. Using the values of Table 3.3, calcu- 
late the standard-state free energy change and equilibrium constant 
for the hexokinase reaction. 

12. Would you expect the free energy of hydrolysis of acetoacetyl- 
coenzyme A (see diagram) to be greater than, equal to, or less 
than that of acetyl-coenzyme A? Provide a chemical rationale for 
your answer. 


CH;— C — CH y— C — S — CoA 





13. Consider carbamoyl phosphate, a precursor in the biosynthesis of 
pyrimidines: 


+Z N 
HN O — PO,” 
Based on the discussion of high-energy phosphates in this chapter, 
would you expect carbamoyl phosphate to possess a high free 
energy of hydrolysis? Provide a chemical rationale for your answer. 
14. You are studying the various components of the venom of a poiso- 
nous lizard. One of the venom components is a protein that ap- 
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pears to be temperature sensitive. When heated, it denatures and is 
no longer toxic. The process can be described by the following sim- 
ple equation: 


T (toxic) = =N (nontoxic) 


There is only enough protein from this venom to carry out two 

equilibrium measurements. At 298 K, you find that 98% of the pro- 

tein is in its toxic form. However, when you raise the temperature to 

320 K, you find that only 10% of the protein is in its toxic form. 

a. Calculate the equilibrium constants for the T to N conversion at 
these two temperatures. 

b. Use the data to determine the AH®, A S°, and A G° for this process. 


15. Consider the data in Figures 3.3 and 3.4. Is the denaturation of chy- 
motrypsinogen spontaneous at 58°C? And what is the temperature 
at which the native and denaturated forms of chymotrypsinogen are 
in equilibrium? 

16. Consider Tables 3.1 and 3.2, as well as the discussion of Table 3.2 in 
the text, and discuss the meaning of the positive A Cp in Table 3.1. 

17. The difference between A G° and AG?’ was discussed in Section 3.3. 
Consider the hydrolysis of acetyl phosphate (Figure 3.12) and de- 
termine the value of AG° for each of this reaction at pH 2, 7, and 
12. The value of AG” for the enolase reaction (Figure 3.13) is 
1.8 kJ/mol. What is the value of A G° for enolase at pH 2, 7, and 12? 
Why is this case different from that of acetyl phosphate? 

18. What is the significance of the magnitude of A G°’ for ATP in the cal- 
culations in the box on page 67? Repeat these calculations for the 
case of coupling of a reaction to 1,3-bisphosphoglycerate hydrolysis 
to see what effect this reaction would have on the equilibrium ratio 
for components A and B under the conditions stated on this page. 


Preparing for the MCAT Exam 

19. The hydrolysis of 1,3-bisphosphoglycerate is favorable, due in part 
to the increased resonance stabilization of the products of the re- 
action. Draw resonance structures for the reactant and the products 
of this reaction to establish that this statement is true. 


20. The acyl-CoA synthetase reaction activates fatty acids for oxidation 
in cells: 


R-COO™ + CoASH + ATP—+R-COSCoA + AMP + pyrophosphate 


The reaction is driven forward in part by hydrolysis of ATP to AMP 
and pyrophosphate. However, pyrophosphate undergoes further 
cleavage to yield two phosphate anions. Discuss the energetics of 
this reaction both in the presence and absence of pyrophosphate 
cleavage. 
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David W. Grisham 


All objects have mirror images. Like many molecules, 
amino acids exist in mirror-image forms (stereo- 
isomers) that are not superimposable. Only the 
L-isomers of amino acids commonly occur in nature. 
(Three Sisters Wilderness, central Oregon. The Middle 
Sister, reflected in an alpine lake.) 





To hold, as ‘twere, the mirror up to nature. 


William Shakespeare 
Hamlet 
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Amino Acids 


ESSENTIAL QUESTION 


Proteins are the indispensable agents of biological function, and amino acids are 
the building blocks of proteins. The stunning diversity of the thousands of pro- 
teins found in nature arises from the intrinsic properties of only 20 commonly 
occurring amino acids. These features include (1) the capacity to polymerize, 

(2) novel acid-base properties, (3) varied structure and chemical functionality in 
the amino acid side chains, and (4) chirality. This chapter describes each of these 
properties, laying a foundation for discussions of protein structure (Chapters 5 and 
6), enzyme function (Chapters 13-15), and many other subjects in later chapters. 


Why are amino acids uniquely suited to their role as the building blocks of 
proteins? 


4=|s What Are the Structures and Properties of Amino Acids? 


Typical Amino Acids Contain a Central Tetrahedral Carbon Atom 


The structure of a single typical amino acid is shown in Figure 4.1. Central to this 
structure is the tetrahedral alpha (a) carbon (C,), which is covalently linked to both 
the amino group and the carboxyl group. Also bonded to this a-carbon are a hydro- 
gen and a variable side chain. It is the side chain, the so-called R group, that gives 
each amino acid its identity. The detailed acid-base properties of amino acids are 
discussed in the following sections. It is sufficient for now to realize that, in neutral 
solution (pH 7), the carboxyl group exists as —COO™ and the amino group as 
—NH;*. Because the resulting amino acid contains one positive and one negative 
charge, it is a neutral molecule called a zwitterion. Amino acids are also chiral mole- 
cules. With four different groups attached to it, the a-carbon is said to be asymmetric. 
The two possible configurations for the a-carbon constitute nonidentical mirror-im- 
age isomers or enantiomers. Details of amino acid stereochemistry are discussed in 
Section 4.4. 


Amino Acids Can Join via Peptide Bonds 


The crucial feature of amino acids that allows them to polymerize to form peptides 
and proteins is the existence of their two identifying chemical groups: the amino 
(—NH;*) and carboxyl (—COO7) groups, as shown in Figure 4.2. The amino and 
carboxyl groups of amino acids can react in a head-to-tail fashion, eliminating a wa- 
ter molecule and forming a covalent amide linkage, which, in the case of peptides 
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H, RO chain 
a-Carbon ~ |- ë Z 
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+/ N = 
HN COO 
Amino Carboxyl Ball-and-stick Amino acids are 
group group model tetrahedral structures 


CENGAGENOW” ANIMATED FIGURE 4.1 Anatomy of an amino acid. Except for proline and its derivatives, all of 
the amino acids commonly found in proteins possess this type of structure. See this figure animated at 
www.cengage.com/login 


4.1 What Are the Structures and Properties of Amino Acids? 


— Two amino acids 








Removal of a 
water molecule... 





Peptide bond 


\ 


Amino end Carboxyl end 


= ... formation of 
the CO— NH bond 
de 


CENGAGENOW” ANIMATED FIGURE 4.2 The a-COOH and a-NH3;* groups of two amino acids can react with 
the resulting loss of a water molecule to form a covalent amide bond. (Illustration: Irving Geis. Rights owned by 
Howard Hughes Medical Institute. Not to be reproduced without permission.) See this figure animated at www 
.cengage.com/login 


and proteins, is typically referred to as a peptide bond. The equilibrium for this re- 
action in aqueous solution favors peptide bond hydrolysis. For this reason, biologi- 
cal systems as well as peptide chemists in the laboratory must couple peptide bond 
formation in an indirect manner or with energy input. 

Repetition of the reaction shown in Figure 4.2 produces polypeptides and proteins. 
The remarkable properties of proteins, which we shall discover and come to appreci- 
ate in later chapters, all depend in one way or another on the unique properties and 
chemical diversity of the 20 common amino acids found in proteins. 


There Are 20 Common Amino Acids 


The structures and abbreviations for the 20 amino acids commonly found in pro- 
teins are shown in Figure 4.3. All the amino acids except proline have both free 
a-amino and free a-carboxyl groups (Figure 4.1). There are several ways to classify 
the common amino acids. The most useful of these classifications is based on the 
polarity of the side chains. Thus, the structures shown in Figure 4.3 are grouped 
into the following categories: (1) nonpolar or hydrophobic amino acids, (2) neu- 
tral (uncharged) but polar amino acids, (3) acidic amino acids (which have a net 
negative charge at pH 7.0), and (4) basic amino acids (which have a net positive 
charge at neutral pH). In later chapters, the importance of this classification sys- 
tem for predicting protein properties becomes clear. Also shown in Figure 4.3 are 
the three-letter and one-letter codes used to represent the amino acids. These 
codes are useful when displaying and comparing the sequences of proteins in 
shorthand form. (Note that several of the one-letter abbreviations are phonetic 
in origin: arginine = “Rginine” = R, phenylalanine = “Fenylalanine” = F, aspartic 
acid = “asparDic” = D.) 
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FIGURE 4.3 The 20 amino acids that are the building blocks of most proteins can be classified as (a) nonpolar (hydrophobic); (b) polar, neutral; (c) acidic; or (d) basic. 
(Illustration: Irving Geis. Rights owned by Howard Hughes Medical Institute. Not to be produced without permission.) 
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FIGURE 4.3 continued 
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CENGAGENOW* Go to CengageNOW at www 
.cengage.com/login and click Biochemistrylnteractive 
to find out how many amino acids you can recognize 
and name. 


Nonpolar Amino Acids The nonpolar amino acids (Figure 4.3a) are critically im- 
portant for the processes that drive protein chains to “fold,” that is to form their nat- 
ural (and functional) structures, as shown in Chapter 6. Amino acids termed nonpolar 
include all those with alkyl chain R groups (alanine, valine, leucine, and isoleucine) ; 
as well as proline (with its unusual cyclic structure); methionine (one of the two sulfur- 
containing amino acids); and two aromatic amino acids, phenylalanine and trypto- 
phan. Tryptophan is sometimes considered a borderline member of this group be- 
cause it can interact favorably with water via the N—H moiety of the indole ring. Pro- 
line, strictly speaking, is not an amino acid but rather an a-imino acid. 


Polar, Uncharged Amino Acids The polar, uncharged amino acids (Figure 4.3b), 
except for glycine, contain R groups that can (1) form hydrogen bonds with water, 
and (2) play a variety of nucleophilic roles in enzyme reactions. These amino acids 
are usually more soluble in water than the nonpolar amino acids. The amide groups 
of asparagine and glutamine; the hydroxyl groups of tyrosine, threonine, and serine; 
and the sulfhydryl group of cysteine are all good hydrogen bond-forming moieties. 
Glycine, the simplest amino acid, has only a single hydrogen for an R group, and this 
hydrogen is not a good hydrogen bond former. Glycine’s solubility properties are 
mainly influenced by its polar amino and carboxyl groups, and thus glycine is best 
considered a member of the polar, uncharged group. It should be noted that tyrosine 
has significant nonpolar characteristics due to its aromatic ring and could arguably 
be placed in the nonpolar group. However, with a pK, of 10.1, tyrosine’s phenolic hy- 
droxyl is a charged, polar entity at high pH. 


Acidic Amino Acids There are two acidic amino acids—aspartic acid and glutamic 
acid—whose R groups contain a carboxyl group (Figure 4.3c). These side-chain car- 
boxyl groups are weaker acids than the a-COOH group but are sufficiently acidic to 
exist as —COO° at neutral pH. Aspartic acid and glutamic acid thus have a net neg- 
ative charge at pH 7. These forms are appropriately referred to as aspartate and glu- 
tamate. These negatively charged amino acids play several important roles in pro- 
teins. Many proteins that bind metal ions for structural or functional purposes 
possess metal-binding sites containing one or more aspartate and glutamate side 
chains. The acid-base chemistry of such groups is considered in detail in Section 4.2. 


Basic Amino Acids Three of the common amino acids have side chains with net pos- 
itive charges at neutral pH: histidine, arginine, and lysine (Figure 4.3d). Histidine 
contains an imidazole group, arginine contains a guanidino group, and lysine con- 
tains a protonated alkyl amino group. The side chains of the latter two amino acids 
are fully protonated at pH 7, but histidine, with a side-chain pK, of 6.0, is only 10% 
protonated at pH 7. With a pK, near neutrality, histidine side chains play important 
roles as proton donors and acceptors in many enzyme reactions. Histidine-containing 
peptides are important biological buffers, as discussed in Chapter 2. Arginine and ly- 
sine side chains, which are protonated under physiological conditions, participate in 
electrostatic interactions in proteins. 


Are There Other Ways to Classify Amino Acids? 


There are alternative ways to classify the 20 common amino acids. For example, it 
would be reasonable to imagine that the amino acids could be described as hydro- 
phobic, hydrophilic, or amphipathic: 


Hydrophobic: Hydrophilic: Amphipathic: 
Alanine Proline Arginine Glutamine Lysine 
Glycine Valine Asparagine Histidine Methionine 
Isoleucine Aspartic acid Serine Tryptophan 
Leucine Cysteine Threonine Tyrosine 


Phenylalanine Glutamic acid 
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Lysine can be considered amphipathic, because its R group consists of an aliphatic 
side chain, which can interact with hydrophobic amino acids in proteins, and an 
amino group, which is normally charged at neutral pH. Methionine is the least po- 
lar of the amphipathic amino acids, but its thioether sulfur can be an effective 
metal ligand in proteins. Cysteine can deprotonate at pH values greater than 7, 
and the thiolate anion is the most potent nucleophile that can be generated 
among the 20 common acids. The imidazole ring of histidine has two nitrogen 
atoms, each with an H. The pK for dissociation of the first of these two H is around 
6. However, once one N-H has dissociated, the pK value for the other becomes 
greater than 10. 


Amino Acids 21 and 22—and More? 


Although uncommon, natural amino acids beyond the well-known 20 actually do oc- 
cur. Selenocysteine (Figure 4.4a) was first identified in 1986 (see Chapter 30, page 
954), and it has since been found in a variety of organisms. 

More recently, Joseph Krzycki and his colleagues at Ohio State University have dis- 
covered a lysine derivative—pyrrolysine—in several archaeal species, including 
Methanosarcina barker, found as a bottom-dwelling microbe of freshwater lakes. 
Pyrrolysine (Figure 4.4a) and selenocysteine both are incorporated naturally into pro- 
teins thanks to specially adapted RNA molecules. 

Both selenocysteine and pyrrolysine bring novel structural and chemical features 
to the proteins that contain them. How many more unusual amino acids might be in- 
corporated in proteins in a similar manner? 


(a) (b) 
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FIGURE 4.4 The structures of several amino acids that are less common but nevertheless found in certain pro- 
teins. Hydroxylysine and hydroxyproline are found in connective-tissue proteins; pyroglutamic acid is found in 
bacteriorhodopsin (a protein in Halobacterium halobium). Epinephrine, histamine, and serotonin, although not 
amino acids, are derived from and closely related to amino acids. 
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Several Amino Acids Occur Only Rarely in Proteins 


There are several amino acids that occur only rarely in proteins and are produced by 
modifications of one of the 20 amino acids already incorporated into a protein (Fig- 
ure 4.4b), including hydroxylysine and hydroxyproline, which are found mainly in the 
collagen and gelatin proteins, pyroglutamic acid, which is found in a light-driven pro- 
ton-pumping protein called bacteriorhodopsin, and y-carboxyglutamic acid, which is 
found in calcium-binding proteins. 

Certain amino acids and their derivatives, although not found in proteins, 
nonetheless are biochemically important. A few of the more notable examples are 
shown in Figure 4.4c. y-Aminobutyric acid, or GABA, is produced by the decar- 
boxylation of glutamic acid and is a potent neurotransmitter. Histamine, which is 
synthesized by decarboxylation of histidine, and serotonin, which is derived from 
tryptophan, similarly function as neurotransmitters and regulators. Epinephrine 
(also known as adrenaline), derived from tyrosine, is an important hormone. 


4:2) What Are the Acid-Base Properties of Amino Acids? 
Amino Acids Are Weak Polyprotic Acids 


From a chemical point of view, the common amino acids are all weak polyprotic 
acids. The ionizable groups are not strongly dissociating ones, and the degree of dis- 
sociation thus depends on the pH of the medium. All the amino acids contain at 
least two dissociable hydrogens. 

Consider the acid-base behavior of glycine, the simplest amino acid. At low pH, 
both the amino and carboxyl groups are protonated and the molecule has a net 
positive charge. If the counterion in solution is a chloride ion, this form is referred 
to as glycine hydrochloride. If the pH is increased, the carboxyl group is the first to 
dissociate, yielding the neutral zwitterionic species Gly? (Figure 4.5). A further 
increase in pH eventually results in dissociation of the amino group to yield the 
negatively charged glycinate. If we denote these three forms as Gly*, Gly’, and Gly", 
we can write the first dissociation of Gly” as 


Gly* + H,O == Gly? TF H;0* 


and the dissociation constant K; as 


[Gly°] [H,0*] 
o [Gly*] 
pH 1 Net charge +1 pH7 Net charge 0 pH 13 Net charge -1 
a a 

COOH Ht COO Ht COO 

g l Y% yl ih | 
H;N—C—H 4§& __H3N—C—H ————— _ HN—C—H 

| | | 

R R R 
Cationic form Zwitterion (neutral) Anionic form 


CENGAGENOW” ANIMATED FIGURE 4.5 The ionic forms of the amino acids, shown without consideration of 
any ionizations on the side chain. The cationic form is the low pH form, and the titration of the cationic species 
with base yields the zwitterion and finally the anionic form. (Illustration: Irving Geis. Rights owned by Howard Hughes 
Medical Institute. Not to be reproduced without permission.) See this figure animated at www.cengage.com/ 
login 
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TABLE 4.1 pK Values of Common ; o Acid: 

Amino Acid a-COOH pk, a-NH3* pKa R group pKa 
Alanine 2.4 9.7 

Arginine 2.2 9.0 12:5 
Asparagine 2.0 8.8 

Aspartic acid 2.1 9.8 3.9 
Cysteine 1.7 10.8 8.3 
Glutamic acid 2.2 9.7 4.3 
Glutamine 2.2 9.1 

Glycine 2.3 9.6 

Histidine 1.8 9.2 6.0 
Isoleucine 2.4 9.7 

Leucine 2.4 9.6 

Lysine 2.2 9.0 10.5 
Methionine 2.3 9.2 

Phenylalanine 1.8 9.1 

Proline 2.1 10.6 

Serine 2.2 9.2 =13 
Threonine 2.6 10.4 13 
Tryptophan 2.4 9.4 

Tyrosine 2.2 9.1 10.1 
Valine 2.3 9.6 











Values for K, for the common amino acids are typically 0.4 to 1.0 X 107° M, so that 
typical values of pK, center on values of 2.0 to 2.4 (Table 4.1). In a similar manner, 
we can write the second dissociation reaction as 
and the dissociation constant K as 
[Gly] [H3;07] 
í [Gly"] 





Typical values for pK are in the range of 9.0 to 9.8. At physiological pH, the 
a-carboxyl group of a simple amino acid (with no ionizable side chains) is com- 
pletely dissociated, whereas the a-amino group has not really begun its dissociation. 
The titration curve for such an amino acid is shown in Figure 4.6. 





|EXAMPLE | What is the pH of a glycine solution in which the a-NH3* group is 
one-third dissociated? 


Answer 
The appropriate Henderson—Hasselbalch equation is 
[Gly] 
H = pk, +1 ae 
P P 0810 [Gly°] 
If the a-amino group is one-third dissociated, there is 1 part Gly” for every 2 parts 
Gly®. The important pK, is the pK, for the amino group. The glycine a-amino 
group has a pK, of 9.6. The result is 


pH =9.6+ logo (1/2) 
pH = 9.3 
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FIGURE 4.6 Titration of glycine, a simple amino acid. The isoelectric point, pl, the pH where glycine has a net 
charge of 0, can be calculated as (pk; + pk2)/2. 


Note that the dissociation constants of both the a-carboxyl and a-amino groups are 
affected by the presence of the other group. The adjacent a-amino group makes 
the a-COOH group more acidic (that is, it lowers the pK,), so it gives up a proton 
more readily than simple alkyl carboxylic acids. Thus, the pK, of 2.0 to 2.1 for 
a-carboxyl groups of amino acids is substantially lower than that of acetic acid 
(pK, = 4.76), for example. What is the chemical basis for the low pK, of the 
a-COOH group of amino acids? The a-NH3* (ammonium) group is strongly 
electron-withdrawing, and the positive charge of the amino group exerts a strong 
field effect and stabilizes the carboxylate anion. (The effect of the a-COO™ group on 
the pK, of the a-NH;* group is the basis for problem 4 at the end of this chapter.) 


Side Chains of Amino Acids Undergo Characteristic lonizations 


As we have seen, the side chains of several of the amino acids also contain disso- 
ciable groups. Thus, aspartic and glutamic acids contain an additional carboxyl 
function, and lysine possesses an aliphatic amino function. Histidine contains an 
ionizable imidazolium proton, and arginine carries a guanidinium function. Typ- 
ical pK, values of these groups are shown in Table 4.1. The 6-carboxyl group of as- 
partic acid and the y-carboxyl side chain of glutamic acid exhibit pK, values in- 
termediate to the a-COOH on one hand and typical aliphatic carboxyl groups on 
the other hand. In a similar fashion, the e-amino group of lysine exhibits a pK, 
that is higher than that of the a-amino group but similar to that for a typical 
aliphatic amino group. These intermediate side-chain pK, values reflect the 
slightly diminished effect of the a-carbon dissociable groups that lie several car- 
bons removed from the side-chain functional groups. Figure 4.7 shows typical 
titration curves for glutamic acid and lysine, along with the ionic species that pre- 
dominate at various points in the titration. The only other side-chain groups that 
exhibit any significant degree of dissociation are the para-OH group of tyrosine 
and the —SH group of cysteine. The pK, of the cysteine sulfhydryl is 8.32, so it is 
about 5% dissociated at pH 7. The tyrosine para-OH group is a very weakly acidic 
group, with a pK, of about 10.1. This group is essentially fully protonated and un- 
charged at pH 7. 
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CENGAGENOW” ACTIVE FIGURE 4.7 Titrations of glutamic acid and lysine. Test yourself on the concepts in 
this figure at www.cengage.com/login 


It is important to note that side-chain pK, values for amino acids in proteins can 
be different from the values shown in Table 4.1. On average, values for side chains in 
proteins are one pH unit closer to neutrality compared to the free amino acid values. 
Moreover, environmental effects in the protein can change pK, values dramatically. 


4:3 What Reactions Do Amino Acids Undergo? 


A number of reactions of amino acids are noteworthy because they are essential to | CENGAGENOW” Go to CengageNOW at www 
the degradation, sequencing, and chemical synthesis of peptides and proteins. One cengage.com/login and click Biochemistryinteractive 
: : : A i to explore the titration behavior of amino acids. 

of these, the reaction with phenylisothiocyanate, or Edman reagent, involves nucle- 

ophilic attack by the amino acid a-amino nitrogen, followed by cyclization, to yield 

a phenylthiohydantoin (PTH) derivative of the amino acid (Figure 4.8a). PTH- 

amino acids can be easily identified and quantified, as shown in Section 4.6. An im- 

portant amino acid side-chain reaction is formation of disulfide bonds via reaction 

between two cysteines. In proteins, cysteine residues form disulfide linkages that sta- 

bilize protein structure (Figure 4.8b). Related reactions are discussed in Chapter 5. 


4:45 What Are the Optical and Stereochemical 
Properties of Amino Acids? 


Amino Acids Are Chiral Molecules 


Except for glycine, all of the amino acids isolated from proteins have four different 
groups attached to the a-carbon atom. In such a case, the a-carbon is said to be 
asymmetric or chiral (from the Greek cheir, meaning “hand”), and the two possible 
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| || — FIGURE 4.8 Some reactions of amino acids. (a) Edman reagent, 
H O Disulfide phenylisothiocyanate, reacts with the a-amino group of an amino acid 
Cys residues in or peptide to produce a phenylthiohydantoin (PTH) derivative. 
two peptide chains (b) Cysteines react to form disulfides. 
configurations for the a-carbon constitute nonsuperimposable mirror-image iso- 
mers, or enantiomers (Figure 4.9). Enantiomeric molecules display a special prop- 
erty called optical activity—the ability to rotate the plane of polarization of plane- 
polarized light. Clockwise rotation of incident light is referred to as dextrorotatory 
behavior, and counterclockwise rotation is called levorotatory behavior. The mag- 
nitude and direction of the optical rotation depend on the nature of the amino acid 
side chain. Some protein-derived amino acids at a given pH are dextrorotatory and 
others are levorotatory, even though all of them are of the L-configuration. The di- 
rection of optical rotation can be specified in the name by using a (+) for dextro- 
Ww w rotatory compounds and a (—) for levorotatory compounds, as in L(+)-leucine. 
x= Cc — 7 
: Chiral Molecules Are Described by the p,t and R,5 Naming Conventions 





The discoveries of optical activity and enantiomeric structures (see Critical De- 
velopments in Biochemistry, page 84) made it important to develop suitable 
nomenclature for chiral molecules. Two systems are in common use today: the so- 


Perspective drawing 





| w called D,L system and the (R,S) system. 

X Z Z X In the D,L system of nomenclature, the (+) and (—) isomers of glyceraldehyde 
| are denoted as D-glyceraldehyde and L-glyceraldehyde, respectively (see Critical 
Y Y Developments in Biochemistry, page 84). Absolute configurations of all other 


Fischer projections carbon-based molecules are referenced to D- and L-glyceraldehyde. When suffi- 


cient care is taken to avoid racemization of the amino acids during hydrolysis of 
proteins, it is found that all of the amino acids derived from natural proteins are 

, i of the L-configuration. Amino acids of the D-configuration are nonetheless found 
molecules based on a chiral carbon atom. Enantiomers A t iall ts of tai üd ëbióti R li 
arenonsupeninposablemitorimagësot acimother: in nature, especially as components of certain peptide antibiotics, such as vali- 
See this figure animated at www.cengage.com/ nomycin, gramicidin, and actinomycin D, and in the cell walls of certain micro- 
login organisms. 











CENGAGENOW” ANIMATED FIGURE 4.9 Enantiomeric 
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CRITICAL DEVELOPMENTS IN BIOCHEMISTRY 





Green Fluorescent Protein—The “Light Fantastic” from Jellyfish to Gene Expression 


Aquorea victoria, a species of jellyfish found in the northwest Pacific 
Ocean, contains a green fluorescent protein (GFP) that works 
together with another protein, aequorin, to provide a defense 
mechanism for the jellyfish. When the jellyfish is attacked or 
shaken, aequorin produces a blue light. This light energy is cap- 
tured by GFP, which then emits a bright green flash that presum- 
ably blinds or startles the attacker. Remarkably, the fluorescence of 
GFP occurs without the assistance of a prosthetic group—a “helper 
molecule” that would mediate GFP’s fluorescence. Instead, the 
light-transducing capability of GFP is the result of a reaction be- 
tween three amino acids in the protein itself. As shown below, ad- 
jacent serine, tyrosine, and glycine in the sequence of the protein 
react to form the pigment complex—termed a chromophore. No 
enzymes are required; the reaction is autocatalytic. 

Because the light-transducing talents of GFP depend only on 
the protein itself (upper photo, chromophore highlighted), GFP 
has quickly become a darling of genetic engineering laboratories. 
The promoter of any gene whose cellular expression is of interest 
can be fused to the DNA sequence coding for GFP. Telltale green 
fluorescence tells the researcher when this fused gene has been 
expressed (see lower photo and also Chapter 12). 





< Amino acid substi- 
tutions in GFP can 
tune the color of 
emitted light; exam- 
ples include YFP, 
CEP, and BFP (yel- 
low, cyan, and blue 
fluorescent protein). 
Shown here is an 
image of African 
green monkey 


P \ 
O b SNS Gin 
Phe-Ser-Tyr-Gly-Val-Gln ———> N Val 
64 69 Nw/ 7 
HO O 
H 


\N OL kidney cells express- 
i H ing YFP fused to 
_Phe a-tubulin, a major 


cytoskeletal protein. 
(Image courtesy of 
Michelle E. King and 
George S. Bloom, University 
of Virginia.) 











Despite its widespread acceptance, problems exist with the D,L system of nomen- 
clature. For example, this system can be ambiguous for molecules with two or more 
chiral centers. To address such problems, the (R,S) system of nomenclature for chi- 
ral molecules was proposed in 1956 by Robert Cahn, Sir Christopher Ingold, and 
Vladimir Prelog. In this more versatile system, priorities are assigned to each of the 
groups attached to a chiral center on the basis of atomic number, atoms with higher 
atomic numbers having higher priorities. 

The newer (R,S) system of nomenclature is superior to the older D,L system in one 
important way: The configuration of molecules with more than one chiral center can 
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CRITICAL DEVELOPMENTS IN BIOCHEMISTRY 


The optical activity of quartz and certain other materials was first 
discovered by Jean-Baptiste Biot in 1815 in France, and in 1848 a 
young chemist in Paris named Louis Pasteur made a related and 
remarkable discovery. Pasteur noticed that preparations of opti- 
cally inactive sodium ammonium tartrate contained two visibly 
different kinds of crystals that were mirror images of each other. 
Pasteur carefully separated the two types of crystals, dissolved 
them each in water, and found that each solution was optically ac- 
tive. Even more intriguing, the specific rotations of these two so- 
lutions were equal in magnitude and of opposite sign. Because 
these differences in optical rotation were apparent properties of 


Discovery of Optically Active Molecules and Determination of Absolute Configuration 


J. M. Bijvoet in Utrecht, the Netherlands, used a new X-ray dif- 
fraction technique to show that Emil Fischer’s arbitrary guess 
60 years earlier had been correct. 

It was M. A. Rosanoff, a chemist and instructor at New York Uni- 
versity, who first proposed (in 1906) that the isomers of glycer- 
aldehyde be the standards for denoting the stereochemistry of sug- 
ars and other molecules. Later, when experiments showed that the 
configuration of (+)-glyceraldehyde was related to (+)-glucose, 
(+)-glyceraldehyde was given the designation D. Emil Fischer re- 
jected the Rosanoff convention, but it was universally accepted. 
Ironically, this nomenclature system is often mistakenly referred to 


the dissolved molecules, Pasteur eventually proposed that the as the Fischer convention. 
molecules themselves were mirror images of each other, just like 
their respective crystals. Based on this and other related evi- 
dence, van’t Hoff and LeBel proposed the tetrahedral arrange- 


ment of valence bonds to carbon. 





In 1888, Emil Fischer decided that it should be possible to de- CRO 
termine the relative configuration of (+)-glucose, a six-carbon sugar H—C—oH 
with four asymmetric centers (see figure). Because each of the four 
C could be either of two configurations, glucose conceivably could HO—C—H 
exist in any one of 16 possible isomeric structures. It took 3 years to 
complete the solution of an elaborate chemical and logical puzzle. H—C—OH 
By 1891, Fischer had reduced his puzzle to a choice between two 
enantiomeric structures. (Methods for determining absolute config- H—C— OH 
uration were not yet available, so Fischer made a simple guess, se- 
lecting the structure shown in the figure.) For this remarkable feat, CHOH 


Fischer received the Nobel Prize in Chemistry in 1902. In 1951, A The absolute configuration of (+)-glucose. 











be more easily, completely, and unambiguously described with (R,S) notation. Sev- 
eral amino acids, including isoleucine, threonine, hydroxyproline, and hydroxyly- 
sine, have two chiral centers. In the (R,S) system, L-threonine is (25,3R)-threonine. 


4:5» What Are the Spectroscopic Properties of Amino Acids? 


One of the most important and exciting advances in modern biochemistry has 
been the application of spectroscopic methods, which measure the absorption and 
emission of energy of different frequencies by molecules and atoms. Spectroscopic 
studies of proteins, nucleic acids, and other biomolecules are providing many new 
insights into the structure and dynamic processes in these molecules. 


Phenylalanine, Tyrosine, and Tryptophan Absorb Ultraviolet Light 


Many details of the structure and chemistry of the amino acids have been elucidated 
or at least confirmed by spectroscopic measurements. None of the amino acids ab- 
sorbs light in the visible region of the electromagnetic spectrum. Several of the 
amino acids, however, do absorb ultraviolet radiation, and all absorb in the infrared 
region. The absorption of energy by electrons as they rise to higher-energy states oc- 
curs in the ultraviolet/visible region of the energy spectrum. Only the aromatic 
amino acids phenylalanine, tyrosine, and tryptophan exhibit significant ultraviolet 
absorption above 250 nm, as shown in Figure 4.10. These strong absorptions can be 
used for spectroscopic determinations of protein concentration. The aromatic 
amino acids also exhibit relatively weak fluorescence, and it has recently been 
shown that tryptophan can exhibit phosphorescence—a relatively long-lived emission 
of light. These fluorescence and phosphorescence properties are especially useful 
in the study of protein structure and dynamics. 
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FIGURE 4.10 The ultraviolet absorption spectra of the aromatic amino acids at pH 6. (From Wetlaufer, D. B., 1962. 
Ultraviolet spectra of proteins and amino acids. Advances in Protein Chemistry 17:303-390.) 


Amino Acids Can Be Characterized by Nuclear Magnetic Resonance 


The development in the 1950s of nuclear magnetic resonance (NMR), a spectro- 
scopic technique that involves the absorption of radio frequency energy by certain 
nuclei in the presence of a magnetic field, played an important part in the chemical 
characterization of amino acids and proteins. Several important principles emerged 
from these studies. First, the chemical shift! of amino acid protons depends on their 


A DEEPER LOOK 





The Murchison Meteorite—Discovery of Extraterrestrial Handedness 





The predominance of L-amino acids in biological systems is one of 
life’s intriguing features. Prebiotic syntheses of amino acids would 
be expected to produce equal amounts of L- and D-enantiomers. 
Some kind of enantiomeric selection process must have intervened 
to select L-amino acids over their D-counterparts as the constituents 
of proteins. Was it random chance that chose L- over D-isomers? 
Analysis of carbon compounds—even amino acids—from ex- 
traterrestrial sources might provide deeper insights into this mys- 
tery. John Cronin and Sandra Pizzarello have examined the enan- 
tiomeric distribution of unusual amino acids obtained from the 
Murchison meteorite, which struck the earth on September 28, 
1969, near Murchison, Australia. (By selecting unusual amino 








NH,* NH,* 
CH; — CHə— CH — C — COOH CH; — CHy— C — COOH CHs 
| 
CH; CH; CH3 
2-Amino-2,3-dimethylpentanoic acid“ Isovaline 


*The four stereoisomers of this amino acid include the D- and L-forms of 
a-methylisoleucine and a-methylalloisoleucine. 

Cronin, J. R., and Pizzarello, S., 1997. Enantiomeric excesses in meteoritic 
amino acids. Science 275:951-955. 


acids for their studies, Cronin and Pizzarello ensured that they 
were examining materials that were native to the meteorite and 
not earth-derived contaminants.) Four a-dialkyl amino acids— 
a-methylisoleucine, a-methylalloisoleucine, a-methylnorvaline, 
and isovaline—were found to have an L-enantiomeric excess of 
2% to 9%. 

This may be the first demonstration that a natural L-enantiomer 
enrichment occurs in certain cosmological environments. Could 
these observations be relevant to the emergence of L-enantiomers as 
the dominant amino acids on the earth? And, if so, could there be 
life elsewhere in the universe that is based upon the same amino 
acid handedness? 





NH3t 
CH» — CHy— C — COOH 
CH; ; : : ; 
~ <“ Amino acids found in the Murchison 
a-Methylnorvaline meteorite. 





1The chemical shift for any NMR signal is the difference in resonant frequency between the ob- 
served signal and a suitable reference signal. If two nuclei are magnetically coupled, the NMR sig- 
nals of these nuclei split, and the separation between such split signals, known as the coupling con- 
stant, is likewise dependent on the structural relationship between the two nuclei. 
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CRITICAL DEVELOPMENTS IN BIOCHEMISTRY 


Naming a chiral center in the (R,S) system is accomplished by 
viewing the molecule from the chiral center to the atom with the 
lowest priority. If the other three atoms facing the viewer then de- 


Rules for Description of Chiral Centers in the (R,S) System 


For such purposes, the priorities of certain functional groups 
found in amino acids and related molecules are in the following 
order: 


crease in priority in a clockwise direction, the center is said to 
have the (R) configuration (where R is from the Latin rectus, 
meaning “right”). If the three atoms in question decrease in pri- 
ority in a counterclockwise fashion, the chiral center is of the (S) 
configuration (where S is from the Latin sinistrus, meaning 
“left”). If two of the atoms coordinated to a chiral center are iden- 
tical, the atoms bound to these two are considered for priorities. 


SH > OH > NH: > COOH > CHO > CHOH > CH; 


From this, it is clear that D-glyceraldehyde is (R)-glyceraldehyde 
and L-alanine is (S)-alanine (see figure). Interestingly, the a-carbon 
configuration of all the L-amino acids except for cysteine is (S). Cys- 
teine, by virtue of its thiol group, is in fact (R)-cysteine. 


OH OH 


HOH,C*\— "CHO 


ne 


HO 


HO—C—=H — 


OHC —7"CH,OH 


HO 


O 


CHOH CH,OH 
L-Glyceraldehyde (S)-Glyceraldehyde p-Glyceraldehyde (R)-Glyceraldehyde 
+ 
COOH PE 
+ A 

insi ——> A 
CH; ~OOC ~7 CH; 

t-Alanine (S)-Alanine 


à The assignment of (R) and (S) notation for glyceraldehyde and L-alanine. 











particular chemical environment and thus on the state of ionization of the amino 
acid. Second, the change in electron density during a titration is transmitted 
throughout the carbon chain in the aliphatic amino acids and the aliphatic portions 
of aromatic amino acids, as evidenced by changes in the chemical shifts of relevant 
protons. Finally, the magnitude of the coupling constants between protons on adja- 
cent carbons depends in some cases on the ionization state of the amino acid. This 
apparently reflects differences in the preferred conformations in different ionization 
states. Proton NMR spectra of two amino acids are shown in Figure 4.11. Because 
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FIGURE 4.11 Proton NMR spectra of several amino acids. Zero on the chemical shift scale is defined by the res- 
onance of tetramethylsilane (TMS). (The large resonance at approximately 5 ppm is due to the normal HDO 
impurity in the D20 solvent.) (Adapted from Aldrich Library of NMR Spectra.) 


4.6 How Are Amino Acid Mixtures Separated and Analyzed? 
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FIGURE 4.12 A plot of chemical shifts versus pH for the carbons of lysine. Changes in chemical shift are most 
pronounced for atoms near the titrating groups. Note the correspondence between the pK, values and the par- 
ticular chemical shift changes. All chemical shifts are defined relative to tetramethylsilane (TMS). (From Suprenant, 
H, et al., 1980. Carbon-13 NMR studies of amino acids: Chemical shifts, protonation shifts, microscopic protonation behavior. Jour- 
nal of Magnetic Resonance 40:231-243.) 


they are highly sensitive to their environment, the chemical shifts of individual NMR 
signals can detect the pH-dependent ionizations of amino acids. Figure 4.12 shows 
the °C chemical shifts occurring in a titration of lysine. Note that the chemical shifts 
of the carboxyl C, C,, and Cg carbons of lysine are sensitive to dissociation of the 
nearby a-COOH and a-NH3;* protons (with pK, values of about 2 and 9, respec- 
tively), whereas the Cs and Ce carbons are sensitive to dissociation of the e-NH3~* 
group. Such measurements have been very useful for studies of the ionization be- 
havior of amino acid residues in proteins. More sophisticated NMR measurements at 
very high magnetic fields are also used to determine the three-dimensional struc- 
tures of peptides and proteins. 


4:6) How Are Amino Acid Mixtures Separated and Analyzed? 
Amino Acids Can Be Separated by Chromatography 


A wide variety of methods is available for the separation and analysis of amino acids 
(and other biological molecules and macromolecules). All of these methods take 
advantage of the relative differences in the physical and chemical characteristics of 
amino acids, particularly ionization behavior and solubility characteristics. Separa- 
tions of amino acids are usually based on partition properties (the tendency to as- 
sociate with one solvent or phase over another) and separations based on electrical 
charge. In all of the partition methods discussed here, the molecules of interest are 
allowed (or forced) to flow through a medium consisting of two phases—solid— 
liquid, liquid-liquid, or gas-liquid. The molecules partition, or distribute them- 
selves, between the two phases in a manner based on their particular properties and 
their consequent preference for associating with one or the other phase. 

In 1903, a separation technique based on repeated partitioning between phases 
was developed by Mikhail Tswett for the separation of plant pigments (carotenes and 
chlorophylls). Due to the colorful nature of the pigments thus separated, Tswett 
called his technique chromatography. This term is now applied to a wide variety of 
separation methods, regardless of whether the products are colored. The success of 
all chromatography techniques depends on the repeated microscopic partitioning of 
a solute mixture between the available phases. The more frequently this partitioning 
can be made to occur within a given time span or over a given volume, the more ef- 
ficient is the resulting separation. Chromatographic methods have advanced rapidly 
in recent years, due in part to the development of sophisticated new solid-phase 
materials. Methods important for amino acid separations include ion exchange 
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FIGURE 4.13 Gradient separation of common PTH-amino acids, which absorb UV light. Absorbance was moni- 
tored at 269 nm. PTH peaks are identified by single-letter notation for amino acid residues and by other abbre- 
viations. D, Asp; CMC, carboxymethyl Cys; E, Glu; N, Asn; S, Ser; Q, Gln; H, His; T, Thr; G, Gly; R, Arg; MO2, Met sulfox- 
ide; A, Ala; Y, Tyr; M, Met; V, Val; P, Pro; W, Trp; K, Lys; F, Phe; |, lle; L, Leu. See Figure 4.8a for PTH derivatization. (Adapted 
from Persson, B., and Eaker, D., 1990. An optimized procedure for the separation of amino acid phenylthiohydantoins by re- 
versed phase HPLC. Journal of Biochemical and Biophysical Methods 21:341-350.) 


chromatography, gas chromatography (GC), and high-performance liquid chroma- 
tography (HPLC). 

A typical HPLC chromatogram using precolumn modification of amino acids to 
form phenylthiohydantoin (PTH) derivatives is shown in Figure 4.13. HPLC is the 
chromatographic technique of choice for most modern biochemists. The very high 
resolution, excellent sensitivity, and high speed of this technique usually outweigh 
the disadvantage of relatively low capacity. 


47+ What Is the Fundamental Structural Pattern in Proteins? 


Chemically, proteins are unbranched polymers of amino acids linked head to tail, 
from carboxyl group to amino group, through formation of covalent peptide 
bonds, a type of amide linkage (Figure 4.14). 

Peptide bond formation results in the release of HO. The peptide “backbone” 
of a protein consists of the repeated sequence —N—C,—C,—, where the N repre- 


-~> 


H,O 





Amino acid 1 Amino acid 2 Dipeptide 


CENGAGENOW” ANIMATED FIGURE 4.14 Peptide formation is the creation of an amide bond between the 
carboxyl group of one amino acid and the amino group of another amino acid. See this figure animated at 
www.cengage.com/login 
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CENGAGENOW” ANIMATED FIGURE 4.15 The peptide bond is shown in its usual trans conformation of car- 
bonyl O and amide H.The C, atoms are the a-carbons of two adjacent amino acids joined in peptide linkage. 
The dimensions and angles are the average values observed by crystallographic analysis of amino acids and 
small peptides. The peptide bond is the light-colored bond between C and N. (Adapted from Ramachandran, G.N, 
et al, 1974. The mean geometry of the peptide unit from crystal structure data. Biochimica et Biophysica Acta 359:298-302.) See 
this figure animated at www.cengage.com/login 


sents the amide nitrogen, the C, is the a-carbon atom of an amino acid in the poly- 
mer chain, and the final C, is the carbonyl carbon of the amino acid, which in turn 
is linked to the amide N of the next amino acid down the line. The geometry of the 
peptide backbone is shown in Figure 4.15. Note that the carbonyl oxygen and the 
amide hydrogen are trans to each other in this figure. This conformation is favored 
energetically because it results in less steric hindrance between nonbonded atoms 
in neighboring amino acids. Because the a-carbon atom of the amino acid is a chi- 
ral center (in all amino acids except glycine), the polypeptide chain is inherently 
asymmetric. Only L-amino acids are found in proteins. 


The Peptide Bond Has Partial Double-Bond Character 


The peptide linkage is usually portrayed by a single bond between the carbonyl car- 
bon and the amide nitrogen (Figure 4.16a). Therefore, in principle, rotation may 
occur about any covalent bond in the polypeptide backbone because all three 
kinds of bonds (N—C,, C,—C,, and the C,—N peptide bond) are single bonds. In 
this representation, the C, and N atoms of the peptide grouping are both in pla- 
nar sp* hybridization and the C, and O atoms are linked by a 7 bond, leaving the 
nitrogen with a lone pair of electrons in a 2p orbital. However, another resonance 
form for the peptide bond is feasible in which the C, and N atoms participate in a 
m bond, leaving a lone e pair on the oxygen (Figure 4.16b). This structure pre- 
vents free rotation about the C,—N peptide bond because it becomes a double 
bond. The real nature of the peptide bond lies somewhere between these ex- 
tremes; that is, it has partial double-bond character, as represented by the inter- 
mediate form shown in Figure 4.16c. 

Peptide bond resonance has several important consequences. First, it restricts 
free rotation around the peptide bond and leaves the peptide backbone with only 
two degrees of freedom per amino acid group: rotation around the N—C, bond 
and rotation around the C,—C, bond.' Second, the six atoms composing the pep- 
tide bond group tend to be coplanar, forming the so-called amide plane of the 
polypeptide backbone (Figure 4.17). Third, the C,—N bond length is 0.133 nm, 
which is shorter than normal C—N bond lengths (for example, the C,—N bond 


'The angle of rotation about the N—C, bond is designated ¢, phi, whereas the C,—C, angle of ro- 
tation is designated yw, psi. 
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FIGURE 4.17 The coplanar relationship of the atoms in 
the amide group is highlighted as an imaginary shaded 
plane lying between two successive a-carbon atoms in 
the peptide backbone. (Illustration: Irving Geis. Rights owned 
by Howard Hughes Medical Institute. Not to be reproduced 
without permission.) 
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keep the amide group planar. 


CENGAGENOW” ACTIVE FIGURE 4.16 The partial double-bond character of the peptide bond. Resonance 
interactions among the carbon, oxygen, and nitrogen atoms of the peptide group can be represented by two 
resonance extremes (a and b). (a) The usual way the peptide atoms are drawn. (b) In an equally feasible form, 
the peptide bond is now a double bond; the amide N bears a positive charge and the carbonyl O has a nega- 
tive charge. (c) The actual peptide bond is best described as a resonance hybrid of the forms in (a) and (b). 
Significantly, all of the atoms associated with the peptide group are coplanar, rotation about Ca—N is 
restricted, and the peptide is distinctly polar. (Illustration: Irving Geis. Rights owned by Howard Hughes Medical Institute. 
Not to be reproduced without permission.) Test yourself on the concepts in this figure at www.cengage 
.com/login 
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of 0.145 nm) but longer than typical C=N bonds (0.125 nm). The peptide bond 
is estimated to have 40% double-bond character. 


The Polypeptide Backbone Is Relatively Polar 


Peptide bond resonance also causes the peptide backbone to be relatively polar. As 
shown in Figure 4.16b, the amide nitrogen is in a protonated or positively charged 
form, and the carbonyl oxygen is a negatively charged atom in this double-bonded 
resonance state. In actuality, the hybrid state of the partially double-bonded peptide 
arrangement gives a net positive charge of 0.28 on the amide N and an equivalent 
net negative charge of 0.28 on the carbonyl O. The presence of these partial 
charges means that the peptide bond has a permanent dipole. Nevertheless, the 
peptide backbone is relatively unreactive chemically, and protons are gained or lost 
by the peptide groups only at extreme pH conditions. 


Peptides Can Be Classified According to How Many 
Amino Acids They Contain 


Peptide is the name assigned to short polymers of amino acids. Peptides are classi- 
fied according to the number of amino acid units in the chain. Each unit is called an 
amino acid residue, the word residue denoting what is left after the release of HO 
when an amino acid forms a peptide link upon joining the peptide chain. Dipeptides 
have two amino acid residues, tripeptides have three, tetrapeptides four, and so on. 
After about 12 residues, this terminology becomes cumbersome, so peptide chains 
of more than 12 and less than about 20 amino acid residues are usually referred to 
as oligopeptides, and when the chain exceeds several dozen amino acids in length, 
the term polypeptide is used. The distinctions in this terminology are not precise. 


Proteins Are Composed of One or More Polypeptide Chains 


The terms polypeptide and protein are used interchangeably in discussing single 
polypeptide chains. The term protein broadly defines molecules composed of one 
or more polypeptide chains. Proteins with one polypeptide chain are monomeric 
proteins. Proteins composed of more than one polypeptide chain are multimeric 
proteins. Multimeric proteins may contain only one kind of polypeptide, in which 
case they are homomultimeric, or they may be composed of several different kinds 
of polypeptide chains, in which instance they are heteromultimeric. Greek letters 
and subscripts are used to denote the polypeptide composition of multimeric pro- 
teins. Thus, an &œytype protein is a dimer of identical polypeptide subunits, or a 
homodimer. Hemoglobin (Table 4.2) consists of four polypeptides of two different 
kinds; it is an a 9B, heteromultimer. 

Polypeptide chains of proteins typically range in length from about 100 amino 
acids to around 2000, the number found in each of the two polypeptide chains of 
myosin, the contractile protein of muscle. However, exceptions abound, including 
human cardiac muscle titin, which has 26,926 amino acid residues and a molecular 
weight of 2,993,497. The average molecular weight of polypeptide chains in eu- 
karyotic cells is about 31,700, corresponding to about 270 amino acid residues. 
Table 4.2 is a representative list of proteins according to size. The molecular weights 
(M,) of proteins can be estimated by a number of physicochemical methods such as 
polyacrylamide gel electrophoresis or ultracentrifugation (see Appendix to Chap- 
ter 5). Precise determinations of protein molecular masses can be obtained by sim- 
ple calculations based on knowledge of their amino acid sequence, which is often 
available in genome databases. No simple generalizations correlate the size of pro- 
teins with their functions. For instance, the same function may be fulfilled in dif- 
ferent cells by proteins of different molecular weight. The Escherichia coli enzyme 
responsible for glutamine synthesis (a protein known as glutamine synthetase) has a 
molecular weight of 600,000, whereas the analogous enzyme in brain tissue has 
a molecular weight of 380,000. 
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TABLE 4.2 











Insulin 
Cytochrome c 





Hemoglobin 











Ribonuclease 





Immunoglobulin 


Protein M, Number of Residues per Chain Subunit Organization 
Insulin (bovine) 5,733 21 (A) aß 
30 (B) 
Cytochrome c (equine) 12,500 104 a 
Ribonuclease A (bovine pancreas) 12,640 124 a 
Lysozyme (egg white) 13,930 129 æ 
Myoglobin (horse) 16,980 153 a 
Chymotrypsin (bovine pancreas) 22,600 13 (a) aBy 
132 (B) 
97 (y) 
Hemoglobin (human) 64,500 141 (a) Abo 
146 (8) 
Serum albumin (human) 68,500 550 Qy 
Hexokinase (yeast) 96,000 200 a4 
y-Globulin (horse) 149,900 214 (a) AsB5 
446 (B) 
Glutamate dehydrogenase (liver) 332,694 500 a 
Myosin (rabbit) 470,000 2,000 (heavy, h) haa Bo 
190 (a) 
149 (a’) 
160 (8) 
Ribulose bisphosphate carboxylase (spinach) 560,000 475 (a) asBs 
123 (8) 
Glutamine synthetase (E. coli) 600,000 468 Q192 











Glutamine synthetase 








*lllustrations of selected proteins listed in Table 4.2 are drawn to constant scale. 
Adapted from Goodsell, D. S., and Olson, A. J., 1993. Soluble proteins: Size, shape and function. Trends in Biochemical Sciences 18:65-68. 
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SUMMARY 


4.1 What Are the Structures and Properties of Amino Acids? The 
central tetrahedral alpha (æ) carbon (C,) atom of typical amino acids is 
linked covalently to both the amino group and the carboxyl group. Also 
bonded to this a-carbon are a hydrogen and a variable side chain. It is 
the side chain, the so-called R group, that gives each amino acid its iden- 
tity. In neutral solution (pH 7), the carboxyl group exists as —COO- 
and the amino group as —NH;*. The amino and carboxyl groups of 
amino acids can react in a head-to-tail fashion, eliminating a water mol- 
ecule and forming a covalent amide linkage, which, in the case of pep- 
tides and proteins, is typically referred to as a peptide bond. Amino 
acids are also chiral molecules. With four different groups attached to 
it, the a-carbon is said to be asymmetric. The two possible configura- 
tions for the a-carbon constitute nonidentical mirror-image isomers or 
enantiomers. The structures of the 20 common amino acids are 
grouped into the following categories: (1) nonpolar or hydrophobic 
amino acids, (2) neutral (uncharged) but polar amino acids, (3) acidic 
amino acids (which have a net negative charge at pH 7.0), and (4) ba- 
sic amino acids (which have a net positive charge at neutral pH). 


4.2 What Are the Acid-Base Properties of Amino Acids? The com- 
mon amino acids are all weak polyprotic acids. The ionizable groups are 
not strongly dissociating ones, and the degree of dissociation thus de- 
pends on the pH of the medium. All the amino acids contain at least two 
dissociable hydrogens. The side chains of several of the amino acids also 
contain dissociable groups. Thus, aspartic and glutamic acids contain an 
additional carboxyl function, and lysine possesses an aliphatic amino 
function. Histidine contains an ionizable imidazolium proton, and argi- 
nine carries a guanidinium function. 


4.3 What Reactions Do Amino Acids Undergo? The reactivities of 
amino acids are essential to the degradation, sequencing, and chemical 
synthesis of peptides and proteins. Reaction with phenylisthiocyanate 
(Edman reagent) forms PTH derivatives of amino acids, which can be 
easily identified and quantified. 


4.4 What Are the Optical and Stereochemical Properties of Amino 
Acids? Except for glycine, all of the amino acids isolated from proteins 


are said to be asymmetric or chiral (from the Greek chein meaning 
“hand”), and the two possible configurations for the a-carbon constitute 
nonsuperimposable mirror-image isomers, or enantiomers. Enantiomeric 
molecules display a special property called optical activity—the ability to 
rotate the plane of polarization of plane-polarized light. The magnitude 
and direction of the optical rotation depend on the nature of the amino 
acid side chain. 


4.5 What Are the Spectroscopic Properties of Amino Acids? Many de- 
tails of the structure and chemistry of the amino acids have been eluci- 
dated or at least confirmed by spectroscopic measurements. None of 
the amino acids absorbs light in the visible region of the electromag- 
netic spectrum. Several of the amino acids, however, do absorb ultravi- 
olet radiation, and all absorb in the infrared region. Proton NMR spec- 
tra of amino acids are highly sensitive to their environment, and the 
chemical shifts of individual NMR signals can detect the pH-dependent 
ionizations of amino acids. 


4.6 How Are Amino Acid Mixtures Separated and Analyzed? Separa- 
tion can be achieved on the basis of the relative differences in the phys- 
ical and chemical characteristics of amino acids, particularly ionization 
behavior and solubility characteristics. The methods important for 
amino acids include separations based on partition properties and sep- 
arations based on electrical charge. HPLC is the chromatographic tech- 
nique of choice for most modern biochemists. The very high resolution, 
excellent sensitivity, and high speed of this technique usually outweigh 
the disadvantage of relatively low capacity. 


4.7 What Is the Fundamental Structural Pattern in Proteins? Proteins 
are linear polymers joined by peptide bonds. The defining characteris- 
tic of a protein is the amino acid sequence. The partial double-bonded 
character of the peptide bond has profound influences on protein con- 
formation. Proteins are also classified according to the length of their 
polypeptide chains (how many amino acid residues they contain) and 
the number and kinds of polypeptide chains. 


PROBLEMS 


CENGAGENOW*” Preparing for an exam? Create you own study path for this 
chapter at www.cengage.com/login 


1. Without consulting chapter figures, draw Fischer projection for- 
mulas for glycine, aspartate, leucine, isoleucine, methionine, and 
threonine. 

2. Without reference to the text, give the one-letter and three-letter 
abbreviations for asparagine, arginine, cysteine, lysine, proline, ty- 
rosine, and tryptophan. 

3. Write equations for the ionic dissociations of alanine, glutamate, 
histidine, lysine, and phenylalanine. 

4. How is the pK, of the a-NH3* group affected by the presence on an 
amino acid of the a-COO™? 

5. (Integrates with Chapter 2.) Draw an appropriate titration curve for 
aspartic acid, labeling the axes and indicating the equivalence 
points and the pK, values. 

6. (Integrates with Chapter 2.) Calculate the concentrations of all 
ionic species in a 0.25 M solution of histidine at pH 2, pH 6.4, and 
pH 9.3. 

7. (Integrates with Chapter 2.) Calculate the pH at which the y-carboxyl 
group of glutamic acid is two-thirds dissociated. 

8. (Integrates with Chapter 2.) Calculate the pH at which the e-amino 
group of lysine is 20% dissociated. 


9. (Integrates with Chapter 2.) Calculate the pH of a 0.3 M solution of 
(a) leucine hydrochloride, (b) sodium leucinate, and (c) isoelectric 
leucine. 

10. Absolute configurations of the amino acids are referenced to D- and 
L-glyceraldehyde on the basis of chemical transformations that can 
convert the molecule of interest to either of these reference iso- 
meric structures. In such reactions, the stereochemical conse- 
quences for the asymmetric centers must be understood for each re- 
action step. Propose a sequence of reactions that would demonstrate 
that L(—)-serine is stereochemically related to L(—)-glyceraldehyde. 

11. Describe the stereochemical aspects of the structure of cystine, the 
structure that is a disulfide-linked pair of cysteines. 


12. Draw a simple mechanism for the reaction of a cysteine sulfhydryl 
group with iodoacetamide. 


13. A previously unknown protein has been isolated in your laboratory. 
Others in your lab have determined that the protein sequence con- 
tains 172 amino acids. They have also determined that this protein 
has no tryptophan and no phenylalanine. You have been asked to 
determine the possible tyrosine content of this protein. You know 
from your study of this chapter that there is a relatively easy way to 
do this. You prepare a pure 50 uM solution of the protein, and you 
place it in a sample cell with a l-cm path length, and you measure 
the absorbance of this sample at 280 nm in a UV-visible spectro- 
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photometer. The absorbance of the solution is 0.372. Are there ty- 
rosines in this protein? How many? (Hint: You will need to use 
Beer’s Law, which is described in any good general chemistry or 
physical chemistry textbook. You will also find it useful to know that 
the units of molar absorptivity are M~'cm7!.) 

14. The simple average molecular weight of the 20 common amino 
acids is 138, but most biochemists use 110 when estimating the num- 
ber of amino acids in a protein of known molecular weight. Why do 
you suppose this is? (Hint: there are two contributing factors to the 
answer. One of them will be apparent from a brief consideration of 
the amino acid compositions of common proteins. See for example 
Figure 5.16 of this text.) 

15. The artificial sweeteners Equal and Nutrasweet contain aspartame, 
which has the structure: 











CO; 
| 
CHo i CHo 1 
+ | | 
H3N— CH— C— NH— CH— C— OCH, 
Aspartame 


What are the two amino acids that are components of aspartame? 
What kind of bond links these amino acids? What do you suppose 


might happen if a solution of aspartame was heated for several hours 
at a pH near neutrality? Suppose you wanted to make hot chocolate 
sweetened only with aspartame, and you stored it in a thermos for 
several hours before drinking it. What might it taste like? 

16. Individuals with phenylketonuria must avoid dietary phenylalanine 
because they are unable to convert phenylalanine to tyrosine. Look 
up this condition and find out what happens if phenylalanine accu- 
mulates in the body. Would you advise a person with phenylke- 
tonuria to consume foods sweetened with aspartame? Why or 
why not? 

17. In this chapter, the concept of prochirality was discussed. Citrate 
(see Figure 19.2) is a prochiral molecule. Describe the process by 
which you would distinguish between the (R—) and (S—) portions 
of this molecule and how an enzyme could discriminate between 
similar but distinct moieties. 

18. Amino acids are frequently used as buffers. Describe the pH range 
of acceptable buffering behavior for the amino acids alanine, histi- 
dine, aspartic acid, and lysine. 


Preparing for the MCAT Exam 

19. Although the other common amino acids are used as buffers, cys- 
teine is rarely used for this purpose. Why? 

20. Draw all the possible isomers of threonine and assign (R,S) nomen- 
clature to each. 
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Proteins: Their Primary 
Structure and Biological 
Functions 





ESSENTIAL QUESTIONS 


Proteins are polymers composed of hundreds or even thousands of amino acids 
linked in series by peptide bonds. 


What structural forms do these polypeptide chains assume, how can the se- 
quence of amino acids in a protein be determined, and what are the biological 
roles played by proteins? 


Proteins are a diverse and abundant class of biomolecules, constituting more than 
50% of the dry weight of cells. Their diversity and abundance reflect the central 
role of proteins in virtually all aspects of cell structure and function. An extraordi- 
nary diversity of cellular activity is possible only because of the versatility inherent 
in proteins, each of which is specifically tailored to its biological role. The pattern 
by which each is tailored resides within the genetic information of cells, encoded 
in a specific sequence of nucleotide bases in DNA. Each such segment of encoded 
information defines a gene, and expression of the gene leads to synthesis of the 
specific protein encoded by it, endowing the cell with the functions unique to that 
particular protein. Proteins are the agents of biological function; they are also the 
expressions of genetic information. 


Sala What Architectural Arrangements Characterize 
Protein Structure? 


Proteins Fall into Three Basic Classes According to Shape and Solubility 


As a first approximation, proteins can be assigned to one of three global classes on 
the basis of shape and solubility: fibrous, globular, or membrane (Figure 5.1). Fibrous 
proteins tend to have relatively simple, regular linear structures. These proteins often 
serve structural roles in cells. Typically, they are insoluble in water or in dilute salt so- 
lutions. In contrast, globular proteins are roughly spherical in shape. The polypeptide 
chain is compactly folded so that hydrophobic amino acid side chains are in the in- 
terior of the molecule and the hydrophilic side chains are on the outside exposed to 
the solvent, water. Consequently, globular proteins are usually very soluble in aqueous 
solutions. Most soluble proteins of the cell, such as the cytosolic enzymes, are globu- 
lar in shape. Membrane proteins are found in association with the various membrane 
systems of cells. For interaction with the nonpolar phase within membranes, mem- 
brane proteins have hydrophobic amino acid side chains oriented outward. As such, 
membrane proteins are insoluble in aqueous solutions but can be solubilized in so- 
lutions of detergents. Membrane proteins characteristically have fewer hydrophilic 
amino acids than cytosolic proteins. 


Protein Structure Is Described in Terms of Four Levels of Organization 


The architecture of protein molecules is quite complex. Nevertheless, this com- 
plexity can be resolved by defining various levels of structural organization. 


Primary Structure The amino acid sequence is, by definition, the primary (1°) 
structure of a protein, such as that for bovine pancreatic RNase in Figure 5.2, for 
example. 


© Jan Halaska/Photo Researchers, Inc. 





Although helices sometimes appear as decorative or 
utilitarian motifs in manmade structures, they are a com- 
mon structural theme in biological macromolecules— 
proteins, nucleic acids, and even polysaccharides. 





...by small and simple things are great things 
brought to pass. 


ALMA 37.6 
The Book of Mormon 





KEY QUESTIONS 


5.1 What Architectural Arrangements 
Characterize Protein Structure? 

















5.2 ow Are Proteins Isolated and Purified from 
Cells? 

5.3 ow Is the Amino Acid Analysis of Proteins 
Performed? 

5.4 ow Is the Primary Structure of a Protein 
Determined? 

5.5 What Is the Nature of Amino Acid 
Sequences? 

5.6 Can Polypeptides Be Synthesized in the 
Laboratory? 

5.7 Do Proteins Have Chemical Groups Other 
Than Amino Acids? 

5.8 What Are the Many Biological Functions of 


Proteins? 
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(a) 


Collagen, 
a fibrous protein 





Myoglobin, a globular protein Bacteriorhodopsin, a membrane protein 


FIGURE 5.1 (a) Proteins having structural roles in cells are typically fibrous and often water insoluble. (b) Myoglo- 
bin is a globular protein. (c) Membrane proteins fold so that hydrophobic amino acid side chains are exposed in 
their membrane-associated regions. Bacteriorhodopsin binds the light-absorbing pigment, cis-retinal, shown here 
in blue. 





FIGURE 5.2 Bovine pancreatic ribonuclease A contains 124 amino acid residues, none of which are tryptophan. 
Four intrachain disulfide bridges (S—S) form crosslinks in this polypeptide between Cys% and Cys, Cys"? and 
Cys”, Cys*®* and Cys!!°, and Cys® and Cys’. 
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Secondary Structure Through hydrogen-bonding interactions between adjacent 
amino acid residues (discussed in detail in Chapter 6), the polypeptide chain can 
arrange itself into characteristic helical or pleated segments. These segments con- 
stitute structural conformities, so-called regular structures, which extend along one 
dimension, like the coils of a spring. Such architectural features of a protein are 
designated secondary (2°) structures (Figure 5.3). Secondary structures are just one 
of the higher levels of structure that represent the three-dimensional arrangement 
of the polypeptide in space. 


Tertiary Structure When the polypeptide chains of protein molecules bend and 
fold in order to assume a more compact three-dimensional shape, the tertiary (3°) 
level of structure is generated (Figure 5.4). It is by virtue of their tertiary structure 
that proteins adopt a globular shape. A globular conformation gives the lowest sur- 
face-to-volume ratio, minimizing interaction of the protein with the surrounding 
environment. 


Quaternary Structure Many proteins consist of two or more interacting poly- 
peptide chains of characteristic tertiary structure, each of which is commonly re- 
ferred to as a subunit of the protein. Subunit organization constitutes another level 
in the hierarchy of protein structure, defined as the protein’s quaternary (4°) struc- 
ture (Figure 5.5). Questions of quaternary structure address the various kinds of 
subunits within a protein molecule, the number of each, and the ways in which they 
interact with one another. 


a-Helix B-Strand 

Only the N — Cg — C backbone The N — Cg— Co backbone as well 
is represented. The vertical line as the Cg of R groups are represented 
is the helix axis. here. Note that the amide planes 


are perpendicular to the page. 





“Shorthand” a-helix “Shorthand” B-strand 
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FIGURE 5.3 The a-helix and the B-pleated strand are 
the two principal secondary structures found in protein. 
Simple representations of these structures are the flat, 
helical ribbon for the a-helix and the flat, wide arrow for 
B-structures. 
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(a) Chymotrypsin tertiary structure (b) 
ey. 
LY, 
ONS Oo 
\; 
P j 








a-Chains 


FIGURE 5.5 Hemoglobin is a tetramer consisting of two 
æ and two B polypeptide chains. 


Chymotrypsin ribbon 





Chymotrypsin space-filling model 


FIGURE 5.4 Folding of the polypeptide chain into a compact, roughly spherical conformation creates the ter- 
tiary level of protein structure. Shown here are (a) a tracing showing the position of all of the C, carbon atoms, 
(b) a ribbon diagram that shows the three-dimensional track of the polypeptide chain, and (c) a space-filling 
representation of the atoms as spheres. The protein is chymotrypsin. 


Noncovalent Forces Drive Formation of the Higher Orders 
of Protein Structure 


Whereas the primary structure of a protein is determined by the covalently linked 
amino acid residues in the polypeptide backbone, secondary and higher orders of 
structure are determined principally by noncovalent forces such as hydrogen bonds 
and ionic, van der Waals, and hydrophobic interactions. It is important to empha- 
size that all the information necessary for a protein molecule to achieve its intricate architec- 
ture is contained within its 1° structure, that is, within the amino acid sequence of its 
polypeptide chain(s). Chapter 6 presents a detailed discussion of the 2°, 3°, and 4° 
structure of protein molecules. 


A Protein’s Conformation Can Be Described as Its Overall 
Three-Dimensional Structure 


The overall three-dimensional architecture of a protein is generally referred to as 
its conformation. This term is not to be confused with configuration, which denotes 
the geometric possibilities for a particular set of atoms (Figure 5.6). In going from 
one configuration to another, covalent bonds must be broken and rearranged. In 
contrast, the conformational possibilities of a molecule are achieved without breaking 
any covalent bonds. In proteins, rotations about each of the single bonds along the 
peptide backbone have the potential to alter the course of the polypeptide chain in 
three-dimensional space. These rotational possibilities create many possible orien- 
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FIGURE 5.6 Configuration and conformation are not synonymous. (a) Rearrangements between 
configurational alternatives of a molecule can be achieved only by breaking and remaking 
bonds, as in the transformation between the D- and L-configurations of glyceraldehyde. (b) The 
intrinsic free rotation around single covalent bonds creates a great variety of three-dimensional 
conformations, even for relatively simple molecules, such as 1,2-dichloroethane. (c) Imagine the 
conformational possibilities for a protein in which two of every three bonds along its backbone 
are freely rotating single bonds. (Illustration: Irving Geis. Rights owned by Howard Hughes Medical Institute. 
Not to be reproduced without permission.) 


tations for the protein chain, referred to as its conformational possibilities. Of the 
great number of theoretical conformations a given protein might adopt, only a very 
few are favored energetically under physiological conditions. At this time, the rules 
that direct the folding of protein chains into energetically favorable conformations 
are still not entirely clear; accordingly, they are the subject of intensive contempo- 
rary research. 


5:2» How Are Proteins Isolated and Purified from Cells? 


Cells contain thousands of different proteins. A major problem for protein chemists 
is to purify a chosen protein so that they can study its specific properties in the ab- 
sence of other proteins. Proteins can be separated and purified on the basis of their 
two prominent physical properties: size and electrical charge. A more direct approach 
is to use affinity purification strategies that take advantage of the biological function 
or specific recognition properties of a protein (see Chapter Appendix). 


A Number of Protein Separation Methods Exploit Differences 
in Size and Charge 


Separation methods based on size include size exclusion chromatography, ultrafil- 
tration, and ultracentrifugation (see Chapter Appendix). The ionic properties of 
peptides and proteins are determined principally by their complement of amino 
acid side chains. Furthermore, the ionization of these groups is pH-dependent. 

A variety of procedures have been designed to exploit the electrical charges 
on a protein as a means to separate proteins in a mixture. These procedures in- 
clude ion exchange chromatography, electrophoresis (see Chapter Appendix), 
and solubility. Proteins tend to be least soluble at their isoelectric point, the pH 
value at which the sum of their positive and negative electrical charges is zero. At 
this pH, electrostatic repulsion between protein molecules is minimal and they 
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Estimation of Protein Concentrations in Solutions of Biological Origin 


Biochemists are often interested in knowing the protein concen- 
tration in various preparations of biological origin. Such quantita- 
tive analysis is not straightforward. Cell extracts are complex mix- 
tures that typically contain protein molecules of many different 
molecular weights, so the results of protein estimations cannot be 
expressed on a molar basis. Also, aside from the rather unreactive 


color upon noncovalent binding to proteins. The binding is quan- 
titative and less sensitive to variations in the protein's amino acid 
composition. The color change is easily measured by a spec- 
trophotometer. A similar, very sensitive method capable of quanti- 
fying nanogram amounts of protein is based on the shift in color 
of colloidal gold upon binding to proteins. 


repeating peptide backbone, little common chemical identity is 
seen among the many proteins found in cells that might be readi- 
ly exploited for exact chemical analysis. Most of their chemical 
properties vary with their amino acid composition, for example, 
nitrogen or sulfur content or the presence of aromatic, hydroxyl, 
or other functional groups. 

Several methods rely on the reduction of Cu®* ions to Cu* by 
readily oxidizable protein components, such as cysteine or the 
phenols and indoles of tyrosine and tryptophan. For example, 
bicinchoninic acid (BCA) forms a purple complex with Cu* in alka- 
line solution, and the amount of this product can be easily mea- 
sured spectrophotometrically to provide an estimate of protein 
concentration. 

Other assays are based on dye binding by proteins. The Brad- 
ford assay is a rapid and reliable technique that uses a dye called 
Coomassie Brilliant Blue G-250, which undergoes a change in its 


-OOC 


Cu’ + BCA —> 





O complex 








are more likely to coalesce and precipitate out of solution. Ionic strength also 
profoundly influences protein solubility. Most globular proteins tend to become 
increasingly soluble as the ionic strength is raised. This phenomenon, the 
salting-in of proteins, is attributed to the diminishment of electrostatic attrac- 
tions between protein molecules by the presence of abundant salt ions. Such 
electrostatic interactions between the protein molecules would otherwise lead to 
precipitation. However, as the salt concentration reaches high levels (greater 
than 1 M), the effect may reverse so that the protein is salted out of solution. In 
such cases, the numerous salt ions begin to compete with the protein for waters 
of solvation, and as they win out, the protein becomes insoluble. The solubility 
properties of a typical protein are shown in Figure 5.7. 

Although the side chains of nonpolar amino acids in soluble proteins are 
usually buried in the interior of the protein away from contact with the aqueous 
solvent, a portion of them may be exposed at the protein’s surface, giving it a 
partially hydrophobic character. Hydrophobic interaction chromatography is a 
protein purification technique that exploits this hydrophobicity (see Chapter 
Appendix). 


oo 


Solubility, milligrams of protein per milliliter 
no 
Ee ES ST Py 
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A Typical Protein Purification Scheme Uses a Series 
of Separation Methods 


Most purification procedures for a particular protein are developed in an empir- 
ical manner, the overriding principle being purification of the protein to a 
homogeneous state with acceptable yield. Table 5.1 presents a summary of a pu- 
rification scheme for a desired enzyme. Note that the specific activity of the enzyme 
in the immunoaffinity purified fraction (fraction 5) has been increased 
152/0.108, or 1407 times the specific activity in the crude extract (fraction 1). 
Thus, the concentration of this protein has been enriched more than 1400-fold by 
the purification procedure. 





© 
: 


FIGURE 5.7 The solubility of most globular proteins is 
markedly influenced by pH and ionic strength. This figure 
shows the solubility of a typical protein as a function of 
pH and various salt concentrations. 
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TABLE 5.1 | 


Volume Total Total Specific Percent 





Fraction (mL) Protein (mg) Activity* Activityt Recovery* 
1. Crude extract 3,800 22,800 2,460 0.108 100 
2. Salt precipitate 165 2,800 1,190 0.425 48 
3. Ion exchange chromatography 65 100 720 7.2 29 
4. Molecular sieve chromatography 40 14.5 555 38.3 23 
5. Immunoaffinity chromatography’ 6 1.8 275 152 ll 








*The relative enzymatic activity of each fraction is cited as arbitrarily defined units. 

‘The specific activity is the total activity of the fraction divided by the total protein in the fraction. This value gives an indication of the increase in purity attained during the course of the 
purification as the samples become enriched for the enzyme. 

+The percent recovery of total activity is a measure of the yield of the desired enzyme. 

SThe last step in the procedure is an affinity method in which antibodies specific for the enzyme are covalently coupled to a chromatography matrix and packed into a glass tube to make a 
chromatographic column through which fraction 4 is passed. The enzyme is bound by this immunoaffinity matrix while other proteins pass freely out. The enzyme is then recovered by 
passing a strong salt solution through the column, which dissociates the enzyme-antibody complex. 
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Acid Hydrolysis Liberates the Amino Acids of a Protein 


Peptide bonds of proteins are hydrolyzed by either strong acid or strong base. Acid 
hydrolysis is the method of choice for analysis of the amino acid composition of pro- 
teins and polypeptides because it proceeds without racemization and with less de- 
struction of certain amino acids (Ser, Thr, Arg, and Cys). Typically, samples of a pro- 
tein are hydrolyzed with 6 NHC] at 110°C. Tryptophan is destroyed by acid and must 
be estimated by other means to determine its contribution to the total amino acid 
composition. The OH-containing amino acids serine and threonine are slowly de- 
stroyed. In contrast, peptide bonds involving hydrophobic residues such as valine and 
isoleucine are only slowly hydrolyzed in acid. Another complication arises because the 
B- and y-amide linkages in asparagine (Asn) and glutamine (Gln) are acid labile. The 
amino nitrogen is released as free ammonium, and all of the Asn and Gln residues of 
the protein are converted to aspartic acid (Asp) and glutamic acid (Glu), respectively. 
The amount of ammonium released during acid hydrolysis gives an estimate of the to- 
tal number of Asn and Gln residues in the original protein, but not the amounts of 
either. 


Chromatographic Methods Are Used to Separate the Amino Acids 


The complex amino acid mixture in the hydrolysate obtained after digestion of a 
protein in 6 NHCI can be separated into the component amino acids by using either 
ion exchange chromatography or reversed-phase high-pressure liquid chromatogra- 
phy (HPLC) (see Chapter Appendix). The amount of each amino acid can then be 
determined. These methods of separation and analysis are fully automated in in- 
struments called amino acid analyzers. Analysis of the amino acid composition of 
a 30-kD protein by these methods requires less than 1 hour and only 6 wg (0.2 nmol) 
of the protein. 


The Amino Acid Compositions of Different Proteins Are Different 


Amino acids almost never occur in equimolar ratios in proteins, indicating that pro- 
teins are not composed of repeating arrays of amino acids. There are a few excep- 
tions to this rule. Collagen, for example, contains large proportions of glycine and 
proline, and much of its structure is composed of (Gly-x-Pro) repeating units, where 
x is any amino acid. Other proteins show unusual abundances of various amino 
acids. For example, histones are rich in positively charged amino acids such as argi- 
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FIGURE 5.8 The hormone insulin consists of two 
polypeptide chains, A and B, held together by two disul- 
fide cross-bridges (S—S).The A chain has 21 amino acid 
residues and an intrachain disulfide; the B polypeptide 
contains 30 amino acids. The sequence shown is for 
bovine insulin. (Illustration: Irving Geis. Rights owned by 
Howard Hughes Medical Institute. Not to be reproduced with- 
out permission.) 


nine and lysine. Histones are a class of proteins found associated with the anionic 
phosphate groups of eukaryotic DNA. 

Amino acid analysis itself does not directly give the number of residues of each 
amino acid in a polypeptide, but if the molecular weight and the exact amount of the 
protein analyzed are known (or the number of amino acid residues per molecule is 
known), the molar ratios of amino acids in the protein can be calculated. Amino acid 
analysis provides no information on the order or sequence of amino acid residues in 
the polypeptide chain. 
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The Sequence of Amino Acids in a Protein Is Distinctive 


The unique characteristic of each protein is the distinctive sequence of amino acid 
residues in its polypeptide chain(s). Indeed, it is the amino acid sequence of pro- 
teins that is encoded by the nucleotide sequence of DNA. This amino acid se- 
quence, then, is a form of genetic information. Because polypeptide chains are un- 
branched, a polypeptide chain has only two ends, an amino-terminal, or N-terminal, 
end and a carboxy-terminal, or C-terminal, end. By convention, the amino acid se- 
quence is read from the N-terminal end of the polypeptide chain through to the 
C-terminal end. As an example, every molecule of ribonuclease A from bovine 
pancreas has the same amino acid sequence, beginning with N-terminal lysine at 
position 1 and ending with C-terminal valine at position 124 (Figure 5.2). Given 
the possibility of any of the 20 amino acids at each position, the number of unique 
amino acid sequences is astronomically large. The astounding sequence variation 
possible within polypeptide chains provides a key insight into the incredible func- 
tional diversity of protein molecules in biological systems discussed later in this 
chapter. 


Sanger Was the First to Determine the Sequence of a Protein 


In 1953, Frederick Sanger of Cambridge University in England reported the 
amino acid sequences of the two polypeptide chains composing the protein in- 
sulin (Figure 5.8). Not only was this a remarkable achievement in analytical chem- 
istry, but it helped demystify speculation about the chemical nature of proteins. 
Sanger’s results clearly established that all of the molecules of a given protein 
have a fixed amino acid composition, a defined amino acid sequence, and there- 
fore an invariant molecular weight. In short, proteins are well defined chemically. 
Today, the amino acid sequences of hundreds of thousands of proteins are known. 
Although many sequences have been determined from application of the princi- 
ples first established by Sanger, most are now deduced from knowledge of the nu- 
cleotide sequence of the gene that encodes the protein. In addition, in recent 
years, the application of mass spectrometry to the sequence analysis of proteins 
has largely superseded the protocols based on chemical and enzymatic degrada- 
tion of polypeptides that Sanger pioneered. 


Both Chemical and Enzymatic Methodologies Are Used 
in Protein Sequencing 


The chemical strategy for determining the amino acid sequence of a protein in- 
volves six basic steps: 


1. If the protein contains more than one polypeptide chain, the chains are sepa- 
rated and purified. 

2. Intrachain S—S (disulfide) cross-bridges between cysteine residues in the poly- 
peptide chain are cleaved. (If these disulfides are interchain linkages, then step 
2 precedes step 1.) 

3. The N-terminal and C-terminal residues are identified. 


A DEEPER LOOK 
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The Virtually Limitless Number of Different Amino Acid Sequences 


Given 20 different amino acids, a polypeptide chain of n residues 
can have any one of 20” possible sequence arrangements. To por- 
tray this, consider the number of tripeptides possible if there were 
only three different amino acids, A, B, and C (tripeptide = 3 = n; 


For a polypeptide chain of 100 residues in length, a rather modest 
size, the number of possible sequences is 20!°, or because 20 = 
101%, 10'*° unique possibilities. These numbers are more than as- 
tronomical! Because an average protein molecule of 100 residues 


3% = "3? = 27): would have a mass of 12,000 daltons (assuming the average molec- 


ular mass of an amino acid residue = 120), 10!°° such molecules 


foe er on would have a mass of 1.2 x 104 daltons. The mass of the ob- 
AAC BBC CCB servable universe is estimated to be 10% proton masses (about 10% 
ABA BAB CBC daltons). Thus, the universe lacks enough material to make just 
ACA BCB CAC one molecule of each possible polypeptide sequence for a protein 
ABC BAA CBA only 100 residues in length. 

ACB BCC CAB 

ABB BAC CBB 

ACC BCA CAA 








4. Each polypeptide chain is cleaved into smaller fragments, and the amino acid 
composition and sequence of each fragment are determined. 

5. Step 4 is repeated, using a different cleavage procedure to generate a different 
and therefore overlapping set of peptide fragments. 

6. The overall amino acid sequence of the protein is reconstructed from the 
sequences in overlapping fragments. 


Each of these steps is discussed in greater detail in the following sections. 


Step 1. Separation of Polypeptide Chains 


If the protein of interest is a heteromultimer (composed of more than one type of 
polypeptide chain), then the protein must be dissociated into its component 
polypeptide chains, which then must be separated from one another and se- 
quenced individually. Because subunits in multimeric proteins typically associate 
through noncovalent interactions, most multimeric proteins can be dissociated by 
exposure to pH extremes, 8 M urea, 6 M guanidinium hydrochloride, or high salt 
concentrations. (All of these treatments disrupt polar interactions such as hydrogen 
bonds both within the protein molecule and between the protein and the aqueous 
solvent.) Once dissociated, the individual polypeptides can be isolated from one an- 
other on the basis of differences in size and/or charge. Occasionally, heteromulti- 
mers are linked together by interchain S—S bridges. In such instances, these 
crosslinks must be cleaved before dissociation and isolation of the individual chains. 
The methods described under step 2 are applicable for this purpose. 


Step 2. Cleavage of Disulfide Bridges 


A number of methods exist for cleaving disulfides. An important consideration is to 
carry out these cleavages so that the original or even new S—S links do not form. Ox- 
idation of a disulfide by performic acid results in the formation of two equivalents of 
cysteic acid (Figure 5.9a). Because these cysteic acid side chains are ionized SO3~ 
groups, electrostatic repulsion (as well as altered chemistry) prevents S—S recombi- 
nation. Alternatively, sulfhydryl compounds such as 2-mercaptoethanol or dithiothre- 
itol (DTT) readily reduce S—S bridges to regenerate two cysteine—SH side chains, as 
in a reversal of the reaction shown in Figure 4.8b. However, these SH groups recom- 
bine to re-form either the original disulfide link or, if other free Cys—SHs are 
available, new disulfide links. To prevent this, S—S reduction must be followed by 
treatment with alkylating agents such as iodoacetate or 3-bromopropylamine, which 
modify the SH groups and block disulfide bridge formation (Figure 5.9b). 
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FIGURE 5.9 Methods for cleavage of disulfide bonds in proteins. (a) Oxidative cleavage by reaction with per- 
formic acid. (b) Disulfide bridges can be broken by reduction with sulfhydryl agents such as B-mercaptoethanol 
or dithiothreitol. Because reaction between the newly reduced —SH groups to reestablish disulfide bonds is a 
likelihood, S—S reduction must be followed by —SH modification: (1) alkylation with iodoacetate (ICH;COOH) 
or (2) modification with 3-bromopropylamine (Br—(CH)3—NH)). 

















Step 3. 


A. N-Terminal Analysis The amino acid residing at the N-terminal end of a pro- 
tein can be identified in a number of ways; one method, Edman degradation, has 
become the procedure of choice. This method is preferable because it allows the se- 
quential identification of a series of residues beginning at the N-terminus. In weakly 
basic solutions, phenylisothiocyanate, or Edman reagent (phenyl—N==C=S), com- 
bines with the free amino terminus of a protein (see Figure 4.8a), which can be ex- 
cised from the end of the polypeptide chain and recovered as a PTH derivative. 
Chromatographic methods can be used to identify this PTH derivative. Importantly, 
in this procedure, the rest of the polypeptide chain remains intact and can be sub- 
jected to further rounds of Edman degradation to identify successive amino acid 
residues in the chain. Often, the carboxyl terminus of the polypeptide under analy- 
sis is coupled to an insoluble matrix, allowing the polypeptide to be easily recovered 
by filtration or centrifugation following each round of Edman reaction. Thus, the 
Edman reaction not only identifies the N-terminal residue of proteins but through 
successive reaction cycles can reveal further information about sequence. Auto- 
mated instruments (so-called Edman sequenators) have been designed to carry out 
repeated rounds of the Edman procedure. In practical terms, as many as 50 cycles 
of reaction can be accomplished on 50 pmol (about 0.1 wg) of a polypeptide 100 to 
200 residues long, revealing the sequential order of the first 50 amino acid residues 
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in the protein. The efficiency with larger proteins is less; a typical 2000—amino acid 
protein provides only 10 to 20 cycles of reaction. 


B. C-Terminal Analysis For the identification of the C-terminal residue of polypep- 
tides, an enzymatic approach is commonly used. Carboxypeptidases are enzymes that 
cleave amino acid residues from the C-termini of polypeptides in a successive fashion. 
Four carboxypeptidases are in general use: A, B, C, and Y. Carboxypeptidase A (from 
bovine pancreas) works well in hydrolyzing the C-terminal peptide bond of all residues 
except proline, arginine, and lysine. The analogous enzyme from hog pancreas, car- 
boxypeptidase B, is effective only when Arg or Lys are the C-terminal residues. Carboxy- 
peptidase C from citrus leaves and carboxypeptidase Y from yeast act on any C-terminal 
residue. Because the nature of the amino acid residue at the end often determines the 
rate at which it is cleaved and because these enzymes remove residues successively, care 
must be taken in interpreting results. Carboxypeptidase Y cleavage has been adapted 
to an automated protocol analogous to that used in Edman sequenators. 


Steps 4 and 5. Fragmentation of the Polypeptide Chain 


The aim at this step is to produce fragments useful for sequence analysis. The cleav- 
age methods employed are usually enzymatic, but proteins can also be fragmented by 
specific or nonspecific chemical means (such as partial acid hydrolysis). Proteolytic 
enzymes offer an advantage in that many hydrolyze only specific peptide bonds, and 
this specificity immediately gives information about the peptide products. As a first 
approximation, fragments produced upon cleavage should be small enough to yield 
their sequences through end-group analysis and Edman degradation, yet not so small 
that an overabundance of products must be resolved before analysis. 


A.Trypsin The digestive enzyme trypsin is the most commonly used reagent for 
specific proteolysis. Trypsin will only hydrolyze peptide bonds in which the carbonyl 
function is contributed by an arginine or a lysine residue. That is, trypsin cleaves on 
the C-side of Arg or Lys, generating a set of peptide fragments having Arg or Lys at 
their C-termini. The number of smaller peptides resulting from trypsin action is 
equal to the total number of Arg and Lys residues in the protein plus one—the pro- 
tein’s C-terminal peptide fragment (Figure 5.10). 


B. Chymotrypsin Chymotrypsin shows a strong preference for hydrolyzing pep- 
tide bonds formed by the carboxyl groups of the aromatic amino acids, phen- 
ylalanine, tyrosine, and tryptophan. However, over time, chymotrypsin also hy- 
drolyzes amide bonds involving amino acids other than Phe, Tyr, or Trp. For 
instance, peptide bonds having leucine-donated carboxyls are also susceptible. 
Thus, the specificity of chymotrypsin is only relative. Because chymotrypsin pro- 
duces a very different set of products than trypsin, treatment of separate samples 
of a protein with these two enzymes generates fragments whose sequences over- 
lap. Resolution of the order of amino acid residues in the fragments yields the 
amino acid sequence in the original protein. 


C. Other Endopeptidases A number of other endopeptidases (proteases that cleave 
peptide bonds within the interior of a polypeptide chain) are also used in sequence 
investigations. These include clostripain, which acts only at Arg residues; endopepti- 
dase Lys-C, which cleaves only at Lys residues; and staphylococcal protease, which acts 
at the acidic residues, Asp and Glu. Other, relatively nonspecific endopeptidases are 
handy for digesting large tryptic or chymotryptic fragments. Pepsin, papain, subtil- 
isin, thermolysin, and elastase are some examples. Papain is the active ingredient in 
meat tenderizer, soft contact lens cleaner, and some laundry detergents. 


D. Cyanogen Bromide Several highly specific chemical methods of proteolysis are 
available, the most widely used being cyanogen bromide (CNBr) cleavage. CNBr acts 
upon methionine residues (Figure 5.11). The nucleophilic sulfur atom of Met reacts 
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CENGAGENOW” ANIMATED FIGURE 5.10 (a) Trypsin is a 
proteolytic enzyme, or protease, that specifically cleaves 
only those peptide bonds in which arginine or lysine 
contributes the carbonyl function. (b) The products of 
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CENGAGENOW” ANIMATED FIGURE 5.11 Cyanogen 
bromide (CNBr) is a highly selective reagent for cleavage 
of peptides only at methionine residues. (1) Nucleophilic 
attack of the Met S atom on the —C=N carbon atom, 
with displacement of Br. (2) Nucleophilic attack by the 
Met carbonyl oxygen atom on the R group. The cyclic 
derivative is unstable in aqueous solution. (3) Hydrolysis 
cleaves the Met peptide bond. C-terminal homoserine 
residues occur where Met residues once were. See 

this figure animated at www.cengage.com/ 

login 
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E Jypeptide Cleavage Procedures Used in Sequence Analysis 

Peptide Bond on 
Carboxyl (C) or Amino (N) Susceptible 

Method Side of Susceptible Residue Residue(s) 

Proteolytic enzymes* 

Trypsin C Arg or Lys 

Chymotrypsin C Phe, Trp, or Tyr; Leu 

Clostripain C Arg 

Staphylococcal protease C Asp or Glu 

Chemical methods 

Cyanogen bromide C Met 

NHOH Asn-Gly bonds 

pH 2.5, 40°C Asp-Pro bonds 











*Some proteolytic enzymes, including trypsin and chymotrypsin, will not cleave peptide bonds where proline is the 
amino acid contributing the N-atom. 


with CNBr, yielding a sulfonium ion that undergoes a rapid intramolecular re- 
arrangement to form a cyclic iminolactone. Water readily hydrolyzes this iminolac- 
tone, cleaving the polypeptide and generating peptide fragments having C-terminal 
homoserine lactone residues at the former Met positions. 


E. Other Chemical Methods of Fragmentation A number of other chemical 
methods give specific fragmentation of polypeptides, including cleavage at 
asparagine-glycine bonds by hydroxylamine (NHOH) at pH 9 and selective hy- 
drolysis at aspartyl-prolyl bonds under mildly acidic conditions. Table 5.2 summa- 
rizes the various procedures described here for polypeptide cleavage. These meth- 
ods are only a partial list of the arsenal of reactions available to protein chemists. 
Cleavage products generated by these procedures must be isolated and individually 
sequenced to accumulate the information necessary to reconstruct the protein’s 
complete amino acid sequence. Peptide sequencing today is most commonly done 
by Edman degradation of relatively large peptides or by mass spectrometry (see fol- 
lowing discussion). 


Step 6. Reconstruction of the Overall Amino Acid Sequence 


The sequences obtained for the sets of fragments derived from two or more cleav- 
age procedures are now compared, with the objective being to find overlaps that es- 
tablish continuity of the overall amino acid sequence of the polypeptide chain. The 
strategy is illustrated by the example shown in Figure 5.12. Peptides generated from 
specific fragmentation of the polypeptide can be aligned to reveal the overall amino 
acid sequence. Such comparisons are also useful in eliminating errors and validat- 
ing the accuracy of the sequences determined for the individual fragments. 


The Amino Acid Sequence of a Protein Can Be Determined 
by Mass Spectrometry 


Mass spectrometers exploit the difference in the mass-to-charge (m/z) ratio of ion- 
ized atoms or molecules to separate them from each other. The m/z ratio of a mol- 
ecule is also a highly characteristic property that can be used to acquire chemical 
and structural information. Furthermore, molecules can be fragmented in distinc- 
tive ways in mass spectrometers, and the fragments that arise also provide quite spe- 
cific structural information about the molecule. The basic operation of a mass spec- 
trometer is to (1) evaporate and ionize molecules in a vacuum, creating gas-phase 
ions; (2) separate the ions in space and/or time based on their m/z ratios; and 
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CENGAGENOW” ANIMATED FIGURE 5.12 Summary of 
the sequence analysis of catrocollastatin-C, a 23.6-kD 
protein found in the venom of the western diamond- 
back rattlesnake Crotalus atrox. Sequences shown are 
given in the one-letter amino acid code. The overall 
amino acid sequence (216 amino acid residues long) for 
catrocollastatin-C as deduced from the overlapping 
sequences of peptide fragments is shown on the lines 
headed CAT-C. The other lines report the various 
sequences used to obtain the overlaps. These sequences 
were obtained from (a) N-term: Edman degradation of 
the intact protein in an automated Edman sequenator; 
(b) M: proteolytic fragments generated by CNBr cleav- 
age, followed by Edman sequencing of the individual 
fragments (numbers denote fragments M1 through M5); 
(c) K: proteolytic fragments from endopeptidase Lys-C 
cleavage, followed by Edman sequencing (only frag- 
ments K3 through K6 are shown); (d) E: proteolytic frag- 
ments from Staphylococcus protease digestion of catrocol- 
lastatin sequenced in the Edman sequenator (only E13 
through E15 are shown). (Adapted from Shimokawa, K, et al, 
1997. Sequence and biological activity of catrocollastatin-C: A disin- 





tegrin-like/cysteine-rich two-domain protein from Crotalus atrox 
venom. Archives of Biochemistry and Biophysics 343:35-43.) See 
this figure animated at www.cengage.com/ 

login 


1 10 20 30 40 50 60 
CAT-C LGTDIISPPVCGNELLEVGEECDCGTPENCQNECCDAATCKLKSGSQCGHGDCCEQCKFS 
N-Term LGTDIUSPPVCGNELLEVGEECDCGTPENCQNECCDAAT 
M1 LGTDIISPPVCGNELLEVGEECDCGTPENCQNECCDAATCKLKSGSQCGHGDCCEQCK 


K3 SGSQCGHGDCCEQCK 

K4 FS 
70 80 90 100 110 120 

CAT-C KSGTECRASMSECDPAEHCTGQSSECPADVFHKNGQPCLDNYGYCYNGNCPIMYHQCYDL 

M2 SECDPAEHCTGQSSECPADVFHKNGQPCLDNYGYCY 

M3 YHQCYDL 

K4 K 

K5 SGTECRASMSECDPAEHCTGQSSECPADVF 

K6 NGQPCLDNYGYCYNGNCPIMYHQCYDL 
130 140 150 160 170 180 

CAT-C FGADVYEAEDSCFERNQKGNYYGYCRKENGNKIPCCAPEDVKCGRLYCKDNSPGQNNPCKM 





M3 FGADVYEAEDSCF-RNQKGNYYGYCRKENGNKIPCCAPEDVKCGRLYCKDN-PGQN- PCK 
K6 FGA 

E13 -SCFERNQKGN 

E15 DVKCGRLYCKDNSPGQNNPCKM 


190 200 210 
CAT-C FYSNEDEHKGMVLPGTKCADGKVCSNGHCVDVATAY 
M4 FYSNEDEHKGM 
M5 VLPGTKCADGKVCSNGHCVDVATAY 
E15 FYSNEDEHKGMVLPGTKCADGKVC 


(3) measure the amount of ions with specific m/z ratios. Because proteins (as well 
as nucleic acids and carbohydrates) decompose upon heating, rather than evapo- 
rating, methods to ionize such molecules for mass spectrometry (MS) analysis re- 
quire innovative approaches. The two most prominent MS modes for protein analy- 
sis are summarized in Table 5.3. 

Figure 5.13 illustrates the basic features of electrospray mass spectrometry (ESI 
MS). In this technique, the high voltage at the electrode causes proteins to pick up 





Electrospray lonization (ESI-MS) 








A solution of macromolecules is sprayed in the form of fine droplets from a glass 
capillary under the influence of a strong electrical field. The droplets pick up positive 
charges as they exit the capillary; evaporation of the solvent leaves multiply charged 
molecules. The typical 20-kD protein molecule will pick up 10 to 30 positive charges. 
The MS spectrum of this protein reveals all of the differently charged species as a 
series of sharp peaks whose consecutive m/z values differ by the charge and mass of a 
single proton (see Figure 5.14). Note that decreasing m/z values signify increasing 
number of charges per molecule, z. Tandem mass spectrometers downstream from the 
ESI source (ESI-MS/MS) can analyze complex protein mixtures (such as tryptic 
digests of proteins or chromatographically separated proteins emerging from a liquid 
chromatography column), selecting a single m/z species for collision-induced 
dissociation and acquisition of amino acid sequence information. 


Matrix-Assisted Laser Desorption lonization-Time of Flight (MALDI-TOF MS) 

The protein sample is mixed with a chemical matrix that includes a light-absorbing 
substance excitable by a laser. A laser pulse is used to excite the chemical matrix, creating 
a microplasma that transfers the energy to protein molecules in the sample, ionizing 
them and ejecting them into the gas phase. Among the products are protein molecules 
that have picked up a single proton. These positively charged species can be selected by 
the MS for mass analysis. MALDI-TOF MS is very sensitive and very accurate; as little as 
attomole (107 '* moles) quantities of a particular molecule can be detected at accuracies 
better than 0.001 atomic mass units (0.001 daltons). MALDI-TOF MS is best suited for 
very accurate mass measurements. 
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FIGURE 5.13 The three principal steps in electrospray ionization mass spectrometry (ESI-MS). (a) Small, highly 
charged droplets are formed by electrostatic dispersion of a protein solution through a glass capillary sub- 
jected to a high electric field; (b) protein ions are desorbed from the droplets into the gas phase (assisted by 
evaporation of the droplets in a stream of hot N; gas); and (c) the protein ions are separated in a mass spec- 
trometer and identified according to their m/z ratios. (Adapted from Figure 1 in Mann, M, and Wilm, M., 1995. Electro- 
spray mass spectrometry for protein characterization. Trends in Biochemical Sciences 20:219-224.) 


protons from the solvent, such that, on average, individual protein molecules ac- 
quire about one positive charge (proton) per kilodalton, leading to the spectrum of 
m/z ratios for a single protein species (Figure 5.14). Computer analysis can convert 
these data into a single spectrum that has a peak at the correct protein mass (Figure 
5.14, inset). 


Sequencing by Tandem Mass Spectrometry Tandem MS (or MS/MS) allows se- 
quencing of proteins by hooking two mass spectrometers in tandem. The first mass 
spectrometer is used as a filter to sort the oligopeptide fragments in a protein digest 
based on differences in their m/z ratios. Each of these oligopeptides can then be se- 
lected by the mass spectrometer for further analysis. A selected ionized oligopeptide 
is directed toward the second mass spectrometer; on the way, this oligopeptide is frag- 
mented by collision with helium or argon gas molecules (a process called collision- 
induced dissociation, or c.i.d.), and the fragments are analyzed by the second mass 
spectrometer (Figure 5.15). Fragmentation occurs primarily at the peptide bonds 
linking successive amino acids in the oligopeptide. Thus, the products include a se- 
ries of fragments that represent a nested set of peptides differing in size by one 
amino acid residue. The various members of this set of fragments differ in mass by 
56 atomic mass units [the mass of the peptide backbone atoms (NH—CH—CO)] 
plus the mass of the R group at each position, which ranges from 1 atomic mass unit 
(Gly) to 130 (Trp). MS sequencing has the advantages of very high sensitivity, fast 
sample processing, and the ability to work with mixtures of proteins. Subpicomoles 
(less than 107! moles) of peptide can be analyzed with these spectrometers. In prac- 
tice, tandem MS is limited to rather short sequences (no longer than 15 or so amino 
acid residues). Nevertheless, capillary HPLC-separated peptide mixtures from 
trypsin digests of proteins can be directly loaded into the tandem MS spectrometer. 
Furthermore, separation of a complex mixture of proteins from a whole-cell extract 
by two-dimensional gel electrophoresis (see Chapter Appendix), followed by trypsin 
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FIGURE 5.14 Electrospray ionization mass spectrum of the protein aerolysin K. The attachment of many pro- 
tons per protein molecule (from less than 30 to more than 50 here) leads to a series of m/z peaks for this sin- 
gle protein. The equation describing each m/z peak is: m/z = [M + n(mass of proton)]/n(charge on proton), 
where M = mass of the protein and n = number of positive charges per protein molecule. Thus, if the number 
of charges per protein molecule is known and m/z is known, M can be calculated. The inset shows a computer 
analysis of the data from this series of peaks that generates a single peak at the correct molecular mass of the 
protein. (Adapted from Figure 2 in Mann, M, and Wilm, M., 1995. Electrospray mass spectrometry for protein characterization. 
Trends in Biochemical Sciences 20:219-224.) 


digestion of a specific protein spot on the gel and injection of the digest into the 
HPLC/tandem MS, gives sequence information that can be used to identify specific 
proteins. Often, by comparing the mass of tryptic peptides from a protein digest 
with a database of all possible masses for tryptic peptides (based on all known pro- 
tein and DNA sequences), one can identify a protein of interest without actually 
sequencing it. 


Peptide Mass Fingerprinting Peptide mass fingerprinting is used to uniquely identify 
a protein based on the masses of its proteolytic fragments, usually produced by 
trypsin digestion. MALDI-TOF MS instruments are ideal for this purpose because 
they yield highly accurate mass data. The measured masses of the proteolytic frag- 
ments can be compared to databases (see following discussion) of peptide masses 
of known sequence. Such information is easily generated from genomic databases: 
Nucleotide sequence information can be translated into amino acid sequence in- 
formation, from which very accurate peptide mass compilations are readily calcu- 
lated. For example, the SWISS-PROT database lists 1197 proteins with a tryptic 
fragment of m/z = 1335.63 (+0.2 D), 16 proteins with tryptic fragments of m/z = 
1335.63 and m/z = 1405.60, but only a single protein (human tissue plasminogen 
activator [tPA]) with tryptic fragments of m/z = 1335.63, m/z = 1405.60, and m/z = 
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FIGURE 5.15 Tandem mass spectrometry. (a) Configuration used in tandem MS. (b) Schematic description of 
tandem MS:Tandem MS involves electrospray ionization of a protein digest (IS in this figure), followed by selec- 
tion of a single peptide ion mass for collision with inert gas molecules (He) and mass analysis of the fragment 
ions resulting from the collisions. (c) Fragmentation usually occurs at peptide bonds, as indicated. (Adapted from 
Yates, J. R., 1996. Protein structure analysis by mass spectrometry. Methods in Enzymology 271:35 1-376; and Gillece-Castro, B. L., 
and Stults, J.T., 1996. Peptide characterization by mass spectrometry. Methods in Enzymology 271:427-447.) 


1272.60.! Although the identities of many proteins revealed by genomic analysis re- 
main unknown, peptide mass fingerprinting can assign a particular protein exclu- 
sively to a specific gene in a genomic database. 


Sequence Databases Contain the Amino Acid Sequences 
of Millions of Different Proteins 


The first protein sequence databases were compiled by protein chemists using chem- 
ical sequencing methods. Today, the vast preponderance of protein sequence infor- 
mation has been derived from translating the nucleotide sequences of genes into 
codons and, thus, amino acid sequences (see Chapter 12). Sequencing the order of 
nucleotides in cloned genes is a more rapid, efficient, and informative process than 
determining the amino acid sequences of proteins by chemical methods. Several 
electronic databases containing continuously updated sequence information are ac- 
cessible by personal computer. Prominent among these is the SWISS-PROT protein 


'The tPA amino acid sequences corresponding to these masses are m/z = 1335.63: HEALSPFYSER; 
m/z = 1405.60: ATCYEDQGISYR; and m/z = 1272.60: DSKPWCYVFK. 
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sequence database on the ExPASy (Expert Protein Analysis System) Molecular Biology 
server at hitp://us.expasy.org and the PIR (Protein Identification Resource Protein 
Sequence Database) at hitp://pir.georgetown.edu, as well as protein information from 
genomic sequences available in databases such as GenBank, accessible via the National 
Center for Biotechnology Information (NCBI) Web site located at hitp://www.ncbi.nlm 
.nih.gov. The protein sequence databases contain several hundred thousand entries, 
whereas the genomic databases list nearly 100 million nucleotide sequences cover- 
ing over 100 gigabases (100 billion bases) from over 165,000 organisms. The Protein 
Data Bank (PDB; hitp://www.rcsb.org/pdb) is a protein database that provides three- 
dimensional structure information on more than 50,000 proteins and nucleic acids. 


5:50 What Is the Nature of Amino Acid Sequences? 


Figure 5.16 illustrates the relative frequencies of the amino acids in proteins. It is 
very unusual for a globular protein to have an amino acid composition that deviates 
substantially from these values. Apparently, these abundances reflect a distribution 
of amino acid polarities that is optimal for protein stability in an aqueous milieu. 
Membrane proteins tend to have relatively more hydrophobic and fewer ionic 
amino acids, a condition consistent with their location. Fibrous proteins may show 
compositions that are atypical with respect to these norms, indicating an underly- 
ing relationship between the composition and the structure of these proteins. 
Proteins have unique amino acid sequences, and it is this uniqueness of sequence 
that ultimately gives each protein its own particular personality. Because the number 
of possible amino acid sequences in a protein is astronomically large, the probability 
that two proteins will, by chance, have similar amino acid sequences is negligible. 
Consequently, sequence similarities between proteins imply evolutionary relatedness. 
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FIGURE 5.16 Amino acid composition: frequencies of the various amino acids in proteins for all the proteins in 
the SWISS-PROT protein knowedgebase. These data are derived from the amino acid composition of more 
than 100,000 different proteins (representing more than 40,000,000 amino acid residues). The range is from 
leucine at 9.55% to tryptophan at 1.18% of all residues. 
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Homologous Proteins from Different Organisms Have Homologous 
Amino Acid Sequences 


Proteins sharing a significant degree of sequence similarity and structural resem- 
blance are said to be homologous. Proteins that perform the same function in differ- 
ent organisms are also referred to as homologous. For example, the oxygen transport 
protein hemoglobin serves a similar role and has a similar structure in all vertebrates. 
The study of the amino acid sequences of homologous proteins from different or- 
ganisms provides very strong evidence for their evolutionary origin within a common 
ancestor. Homologous proteins characteristically have polypeptide chains that are 
nearly identical in length, and their sequences share identity in direct correlation to 
the relatedness of the species from which they are derived. 

Homologous proteins can be further subdivided into orthologous and paralo- 
gous proteins. Orthologous proteins are proteins from different species that have 
homologous amino acid sequences (and often a similar function). Orthologous 
proteins arose from a common ancestral gene during evolution. Paralogous pro- 
teins are proteins found within a single species that have homologous amino acid 
sequences; paralogous proteins arose through gene duplication. For example, the 
a- and -globin chains of hemoglobin are paralogs. How is homology revealed? 


Computer Programs Can Align Sequences and Discover Homology 
between Proteins 


Protein and nucleic acid sequence databases (see page 110) provide enormous re- 
sources for sequence comparisons. If two proteins share homology, it can be re- 
vealed through alignment of their sequences using powerful computer programs. 
In such studies, a given amino acid sequence is used to query the databases for pro- 
teins with similar sequences. BLAST (Basic Local Alignment Search Tool) is one 
commonly used program for rapid searching of sequence databases. The BLAST 
program detects local as well as global alignments where sequences are in close 
agreement. Even regions of similarity shared between otherwise unrelated proteins 
can be detected. Discovery of sequence similarities between proteins can be an im- 
portant clue to the function of uncharacterized proteins. Similarities are also useful 
in assigning related proteins to protein families. 

The process of sequence alignment is an operation akin to sliding one sequence 
along another in a search for regions where the two sequences show a good match. 
Positive scores are assigned everywhere the amino acid in one sequence is similar to 
or identical with the amino acid in the other; the greater the overall score, the bet- 
ter the match between the two protein sequences. Sometimes two sequences match 
well at several places along their lengths, but, in one of the proteins, the matching 
segments are interrupted by a sequence that is dissimilar. When such an interrup- 
tion is found by the computer program, it inserts a gap in the uninterrupted se- 
quence to bring the matching segments of the two sequences into better alignment 
(Figure 5.17). Because any two sequences would show similarity if a sufficient num- 
ber of gaps were introduced, a gap penalty is imposed for each gap. Gap penalties 
are negative numbers that lower the overall similarity score. Gaps arise naturally 
during evolution through insertion and deletion mutations socalled indels, which 


S. acidocaldarius FPIAKGGTAAIPGPFGSGKTVTLQSLAKWSAAK--—-VVIYVGCGERGNEMTD 
E. coli CPFAKGGKVGLFGGAGVGKTVNMMELIRNIAIEHSGYSVFAGVGERTREGND 


FIGURE 5.17 Alignment of the amino acid sequences of two protein homologs using gaps. Shown are parts of 
the amino acid sequences of the catalytic subunits from the major ATP-synthesizing enzyme (ATP synthase) in 
a representative archaea (Sulfolobus acidocaldarius) and a bacterium (Escherichia coli). These protein segments 
encompass the nucleotide-binding site of these enzymes. Identical residues in the two sequences are shown 
in red. Introduction of a three-residue-long gap in the archaeal sequence optimizes alignment of the two 
sequences. 
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add or remove residues in the gene and, consequently, the protein. The optimal se- 
quence alignment between two proteins is one that maximizes sequence alignments 
while minimizing gaps. 

Methods for alignment and comparison of protein sequences depend upon 
some quantitative measure of how similar any two sequences are. One way to mea- 
sure similarity is to use a matrix that assigns scores for all possible substitutions of 
one amino acid for another. BLOSUM62 is the substitution matrix most often 
used with BLAST. This matrix assigns a probability score for each position in an 
alignment based on the frequency with which that substitution occurs in the con- 
sensus sequences of related proteins. BLOSUM is an acronym for Blocks Substi- 
tution Matrix, a matrix that scores each position on the basis of observed fre- 
quencies of different amino acid substitutions within blocks of local alignments in 
related proteins. In the BLOSUM62 matrix, the most commonly used matrix, the 
scores are derived using sequences sharing no more than 62% identity (Figure 
5.18). BLOSUM substitution scores range from —4 (lowest probability of substi- 
tution) to 11 (highest probability of substitution). For example, to look up the 
value corresponding to the substitution of an asparagine (N) by a tryptophan 
(W), or vice versa, find the intersection of the “N” column with the “W” row in Fig- 
ure 5.18. The value —4 means that the substitution of N for W, or vice versa, is not 
very likely. On the other hand, the substitution of V for I, (BLOSUM score: 3) or 
vice versa, is very likely. Amino acids whose side chains have unique qualities (such 
as C, H, P, or W) have high BLOSUM62 scores, because replacing them with any 
other amino acid may change the protein significantly. Amino acids that are sim- 
ilar (such as R and K, or D and E, or A, V, L, and I) have low scores, since one can 
replace the other with less likelihood of serious change to the protein structure. 


Cytochrome c The electron transport protein cytochrome c, found in the mi- 
tochondria of all eukaryotic organisms, provides a well-studied example of or- 
thology. Amino acid sequencing of cytochrome c from more than 40 different 
species has revealed that there are 28 positions in the polypeptide chain where 
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FIGURE 5.18 The BLOSUM62 substitution matrix provides scores for all possible exchanges of one amino acid 


with another. (From Henikoff, S. and Henikoff, J.G., 1992. Amino acid substitution matrices from protein blocks. Proceedings of 
the National Academy of Sciences, USA 89:10915-10919.) 
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the same amino acid residues are always found (Figure 5.19). These invariant 
residues serve roles crucial to the biological function of this protein, and thus 
substitutions of other amino acids at these positions cannot be tolerated. The 
number of amino acid differences between two cytochrome c sequences is propor- 
tional to the phylogenetic difference between the species from which they are de- 
rived. Cytochrome c in humans and in chimpanzees is identical; human and 
another mammalian (sheep) cytochrome c differ at 10 residues. The human cyto- 
chrome c sequence has 14 variant residues from a reptile sequence (rattlesnake), 
18 from a fish (carp), 29 from a mollusc (snail), 31 from an insect (moth), and 
more than 40 from yeast or higher plants (cauliflower). 


The Phylogenetic Tree for Cytochrome c Figure 5.20 displays a phylogenetic tree 
(a diagram illustrating the evolutionary relationships among a group of organisms) 
constructed from the sequences of cytochrome c. The tips of the branches are occu- 
pied by contemporary species whose sequences have been determined. The tree has 
been deduced by computer analysis of these sequences to find the minimum num- 
ber of mutational changes connecting the branches. Other computer methods can 
be used to infer potential ancestral sequences represented by nodes, or branch 
points, in the tree. Such analysis ultimately suggests a primordial cytochrome c se- 
quence lying at the base of the tree. Evolutionary trees constructed in this manner, 
that is, solely on the basis of amino acid differences occurring in the primary se- 
quence of one selected protein, show remarkable agreement with phylogenetic re- 
lationships derived from more classic approaches and have given rise to the field of 
molecular evolution. 


Related Proteins Share a Common Evolutionary Origin 


Amino acid sequence analysis reveals that proteins with related functions often 
show a high degree of sequence similarity. Such findings suggest a common 
ancestry for these proteins. 


Oxygen-Binding Heme Proteins Myoglobin and the a- and #-globin chains of 
hemoglobin constitute a set of paralogous proteins. Myoglobin, the oxygen-binding 
heme protein of muscle, consists of a single polypeptide chain of 153 residues. 
Hemoglobin, the oxygen transport protein of erythrocytes, is a tetramer composed 
of two a-chains (141 residues each) and two -chains (146 residues each). These glo- 
bin paralogs—myoglobin, a-globin, and f-globin—share a strong degree of 
sequence homology (Figure 5.21). Human myoglobin and the human a-globin 
chain show 38 amino acid identities, whereas human a-globin and human f-globin 
have 64 residues in common. The relatedness suggests an evolutionary sequence of 
events in which chance mutations led to amino acid substitutions and divergence in 
primary structure. The ancestral myoglobin gene diverged first, after duplication of 
a primordial globin gene had given rise to its progenitor and an ancestral hemo- 
globin gene (Figure 5.22). Subsequently, the ancestral hemoglobin gene duplicated 
to generate the progenitors of the present-day a-globin and -globin genes. The 
ability to bind Oy» via a heme prosthetic group is retained by all three of these 
polypeptides. 


Serine Proteases Whereas the globins provide an example of gene duplication 
giving rise to a set of proteins in which the biological function has been highly con- 
served, other sets of proteins united by strong sequence homology show more di- 
vergent biological functions. Trypsin, chymotrypsin (see Chapter 14), and elastase 
are members of a class of proteolytic enzymes called serine proteases because of the 
central role played by specific serine residues in their catalytic activity. Thrombin, 
an essential enzyme in blood clotting, is also a serine protease. These enzymes show 
sufficient sequence homology to conclude that they arose via duplication of a prog- 
enitor serine protease gene, even though their substrate preferences are now quite 
different. 

















FIGURE 5.19 The sequence of cyto- 
chrome c from more than 40 different 
species reveals that 28 residues are in- 
variant. These invariant residues are scat- 
tered irregularly along the polypeptide 
chain, except for a cluster between 
residues 70 and 80. All cytochrome c 
polypeptide chains have a cysteine 
residue at position 17, and all but one 
have another Cys at position 14. These 
Cys residues serve to link the heme 
prosthetic group of cytochrome c to the 
protein. 
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FIGURE 5.20 This phylogenetic tree depicts the evolutionary relationships among organisms as determined 
by the similarity of their cytochrome c amino acid sequences. The numbers along the branches give the 
amino acid changes between a species and a hypothetical progenitor. Note that extant species are located 
only at the tips of branches. Below, the sequence of human cytochrome c is compared with an inferred an- 
cestral sequence represented by the base of the tree. Uncertainties are denoted by question marks. (Adapted 
from Creighton, T. E., 1983. Proteins: Structure and Molecular Properties. San Francisco: W. H. Freeman.) 
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FIGURE 5.21 The amino acid sequences of the globin chains of human hemoglobin and myoglobin show a 
strong degree of homology. The æ- and B-globin chains share 64 residues of their approximately 140 residues 
in common. Myoglobin and the a-globin chain have 38 amino acid sequence identities. This homology is fur- 
ther reflected in these proteins’ tertiary structure. 


Myoglobin 





FIGURE 5.22 This evolutionary tree is inferred from 
the homology between the amino acid sequences 
of the a-globin, B-globin, and myoglobin chains. 
Duplication of an ancestral globin gene allowed 
the divergence of the myoglobin and ancestral he- 
moglobin genes. Another gene duplication event 
subsequently gave rise to ancestral a and £ forms, 
as indicated. Gene duplication is an important evo- 
lutionary force in creating diversity. 
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FIGURE 5.23 The tertiary structures of hen egg white 
lysozyme and human a-lactalbumin are very similar. 
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Human milk a-lactalbumin Hen egg white lysozyme 


Apparently Different Proteins May Share a Common Ancestry 


A more remarkable example of evolutionary relatedness is inferred from sequence 
homology between hen egg white lysozyme and human milk æ-lactalbumin, pro- 
teins of different biological activity and origin. Lysozyme (129 residues) and 
a-lactalbumin (123 residues) are identical at 48 positions. Lysozyme hydrolyzes the 
polysaccharide wall of bacterial cells, whereas a-lactalbumin regulates milk sugar 
(lactose) synthesis in the mammary gland. Although both proteins act in reactions 
involving carbohydrates, their functions show little similarity otherwise. Neverthe- 
less, their tertiary structures are strikingly similar (Figure 5.23). It is conceivable 
that many proteins are related in this way, but time and the course of evolutionary 
change erased most evidence of their common ancestry. In contrast to this case, the 
proteins G-actin and hexokinase (Figure 5.24) share essentially no sequence homol- 





(b) 


FIGURE 5.24 The tertiary structures of (a) hexokinase and (b) actin; ADP is bound to both proteins (purple). 
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TABLE 5.4 ] 

Abnormal Hemoglobin* Normal Residue and Position Substitution 
a-chain 

Torino Phenylalanine 43 Valine 
MbBoston Histidine 58 Tyrosine 
Chesapeake Arginine 92 Leucine 
Georgia Proline 95 Leucine 
Tarrant Aspartate 126 Asparagine 
Suresnes Arginine 141 Histidine 
B-chain 

S Glutamate 6 Valine 
Riverdale—Bronx Glycine 24 Arginine 
Genova Leucine 28 Proline 
Zurich Histidine 63 Arginine 
Mmilwaukee Valine 67 Glutamate 
Muyde Park Histidine 92 Tyrosine 
Yoshizuka Asparagine 108 Aspartate 
Hiroshima Histidine 146 Aspartate 











*Hemoglobin variants are often given the geographical name of their origin. 
Adapted from Dickerson, R. E., and Geis, l., 1983. Hemoglobin: Structure, Function, Evolution and Pathology. Menlo Park, CA: 
Benjamin/Cummings. 


ogy, yet they have strikingly similar three-dimensional structures, even though their 
biological roles and physical properties are very different. Actin forms a filamentous 
polymer that is a principal component of the contractile apparatus in muscle; hex- 
okinase is a cytosolic enzyme that catalyzes the first reaction in glucose catabolism. 


A Mutant Protein Is a Protein with a Slightly Different 
Amino Acid Sequence 


Given a large population of individuals, a considerable number of sequence variants 
can be found for a protein. These variants are a consequence of mutations in a gene 
(base substitutions in DNA) that have arisen naturally within the population. Gene mu- 
tations lead to mutant forms of the protein in which the amino acid sequence is altered 
at one or more positions. Many of these mutant forms are “neutral” in that the func- 
tional properties of the protein are unaffected by the amino acid substitution. Others 
may be nonfunctional (if loss of function is not lethal to the individual), and still oth- 
ers may display a range of aberrations between these two extremes. The severity of the 
effects on function depends on the nature of the amino acid substitution and its role 
in the protein. These conclusions are exemplified by the hundreds of human hemo- 
globin variants that have been discovered to date. Some of these are listed in Table 5.4. 

A variety of effects on the hemoglobin molecule are seen in these mutants, in- 
cluding alterations in oxygen affinity, heme affinity, stability, solubility, and subunit 
interactions between the a-globin and 6-globin polypeptide chains. Some variants 
show no apparent changes, whereas others, such as HbS, sickle-cell hemoglobin 
(see Chapter 15), result in serious illness. This diversity of response indicates that 
some amino acid changes are relatively unimportant, whereas others drastically al- 
ter one or more functions of a protein. 


5:6: Can Polypeptides Be Synthesized in the Laboratory? 


Chemical synthesis of peptides and polypeptides of defined sequence can be car- 
ried out in the laboratory. Formation of peptide bonds linking amino acids together 
is not a chemically complex process, but making a specific peptide can be chal- 
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5.7 Do Proteins Have Chemical Groups Other Than Amino Acids? 


lenging because various functional groups present on side chains of amino acids 
may also react under the conditions used to form peptide bonds. Furthermore, if 
correct sequences are to be synthesized, the a-COOH group of residue x must be 
linked to the a-NHg, group of neighboring residue yin a way that prevents reaction 
of the amino group of x with the carboxyl group of y. In essence, any functional 
groups to be protected from reaction must be blocked while the desired coupling 
reactions proceed. Also, the blocking groups must be removable later under condi- 
tions in which the newly formed peptide bonds are stable. An ingenious synthetic 
strategy to circumvent these technical problems is orthogonal synthesis. An orthogo- 
nal system is defined as a set of distinctly different blocking groups—one for side- 
chain protection, another for a-amino protection, and a third for a-carboxyl pro- 
tection or anchoring to a solid support (see following discussion). Ideally, any of the 
three classes of protecting groups can be removed in any order and in the presence 
of the other two, because the reaction chemistries of the three classes are suffi- 
ciently different from one another. In peptide synthesis, all reactions must proceed 
with high yield if peptide recoveries are to be acceptable. Peptide formation be- 
tween amino and carboxyl groups is not spontaneous under normal conditions (see 
Chapter 4), so one or the other of these groups must be activated to facilitate the 
reaction. Despite these difficulties, biologically active peptides and polypeptides 
have been recreated by synthetic organic chemistry. Milestones include the pio- 
neering synthesis of the nonapeptide posterior pituitary hormones oxytocin and va- 
sopressin by Vincent du Vigneaud in 1953 and, in later years, larger proteins such 
as insulin (21 A-chain and 30 B-chain residues), ribonuclease A (124 residues), and 
HIV protease (99 residues). 


Solid-Phase Methods Are Very Useful in Peptide Synthesis 


Bruce Merrifield and his collaborators pioneered a clever solution to the problem 
of recovering intermediate products in the course of a synthesis. The carboxyl- 
terminal residues of synthesized peptide chains are covalently anchored to an in- 
soluble resin (polystyrene particles) that can be removed from reaction mixtures 
simply by filtration. After each new residue is added successively at the free amino- 
terminus, the elongated product is recovered by filtration and readied for the 
next synthetic step. Because the growing peptide chain is coupled to an insoluble 
resin bead, the method is called solid-phase synthesis. The procedure is detailed 
in Figure 5.25. This cyclic process is automated and computer controlled so that 
the reactions take place in a small cup with reagents being pumped in and re- 
moved as programmed. 


S/a Do Proteins Have Chemical Groups Other 
Than Amino Acids? 


Many proteins consist of only amino acids and contain no other chemical groups. 
The enzyme ribonuclease and the contractile protein actin are two such examples. 
Such proteins are called simple proteins. However, many other proteins contain var- 
ious chemical constituents as an integral part of their structure. Some of these con- 
stituents arise through covalent modification of amino acid side chains in proteins 
after the protein has been synthesized. Such alterations are called post-translational 
modifications. For example, the reaction of two cysteine residues in a protein to 
form a disulfide linkage (Figure 4.8b) is a post-translational modification. Many of 
the prominent post-translational modifications, such as those listed in Table 5.5, can 
act as “on-off switches” that regulate the function or cellular location of the pro- 
tein. Approximately 500 post-translational modifications are listed in the RESID 
database accessible through the National Cancer Institute Frederick Advanced Bio- 
medical Computing Center at hitp://www.ncifcrf. gou/RESID. 

A common form of post-translational modification not to be found in such a 
database or in Table 5.5 is the removal of amino acids from the protein by prote- 
olytic cleavage. Many proteins localized in specific subcellular compartments have 
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Proteome is the complete catalog of proteins 
encoded by a genome; in cell-specific terms, a 
proteome is the complete set of proteins found 
in a particular cell type at a particular time. 





UES Some P 





Amino Acid Side 
Name Nonprotein Part Chain Modified Examples 
Phosphorylation —PO;7- S, T, Y Hormone receptors, 
regulatory enzymes 
Acetylation —CH,COO- K Histones 
Methylation —CH3 K,R Histones 
Acylation Palmitic acid C G-protein-coupled 
receptors 
Prenylation Prenyl group C Ras p21 
ADP-ribosylation ADP-ribose H, R G proteins, eukaryotic 
elongation factors 
Adenylylation AMP Y Glutamine synthetase 











N-terminal signal sequences that stipulate their proper destination. Such signal se- 
quences typically are clipped off during their journey. Other proteins, such as some 
hormones or potentially destructive proteases, are synthesized in an inactive form 
and converted into an active form through proteolytic removal of some of their 
amino acids. 

The general term for proteins containing nonprotein constituents is conjugated 
proteins (Table 5.6). Because association of the protein with the conjugated group 
does not occur until the protein has been synthesized, these associations are post- 
translational as well, although such terminology is usually not applied to these pro- 
teins (with the possible exception of glycoproteins). As Table 5.6 indicates, conju- 
gated proteins are typically classified according to the chemistry of the nonprotein 
part. If the nonprotein part participates in the protein’s function, it is referred to as 
a prosthetic group. Conjugation of proteins with these different nonprotein con- 
stituents dramatically enhances the repertoire of functionalities available to proteins. 


5:8: What Are the Many Biological Functions of Proteins? 


Proteins are the agents of biological function. Virtually every cellular activity is dependent 
on one or more particular proteins. Thus, a convenient way to classify the enormous 
number of proteins is to group them according to the biological roles they serve. 
Figure 5.26 summarizes the classification of proteins found in the human proteome 
according to their function. 

Proteins fill essentially every biological role, with the exception of information 
storage. The ability to bind other molecules (ligands) is common to many proteins. 
Binding proteins typically interact noncovalently with their specific ligands. Trans- 
port proteins are one class of binding proteins. Transport proteins include mem- 





TABLE 5.6 


Nonprotein Part 





Metalloproteins and 
metal-activated proteins 


Hemoproteins 


Flavoproteins 





Name Association Examples 

Lipoproteins Lipids Noncovalent Blood lipoprotein complexes (HDL, LDL) 
Nucleoproteins RNA, DNA Noncovalent Ribosomes, chromosomes 

Glycoproteins Carbohydrate groups Covalent Immunoglobulins, LDL receptor 


Ca, Kt, Fe?t, Zn2*, 


Co?t, others 
Heme group 


FMN, FAD 


Covalent to noncovalent Metabolic enzymes, kinases, phosphatases, 


among others 
Covalent or noncovalent Hemoglobin, cytochromes 


Covalent or noncovalent Electron transfer enzymes 








5.8 What Are the Many Biological Functions of Proteins? 


brane proteins that transport substances across membranes, as well as soluble pro- 
teins that deliver specific nutrients or waste products throughout the body. Scaffold 
proteins are a class of binding proteins that uses protein-protein interactions to re- 
cruit other proteins into multimeric assemblies whose purpose is to mediate and co- 
ordinate the flow of information in cells. Catalytic proteins (enzymes) mediate al- 
most every metabolic reaction. Regulatory proteins that bind to specific nucleotide 
sequences within DNA control gene expression. Hormones are another kind of reg- 
ulatory protein in that they convey information about the environment and deliver 
this information to cells when they bind to specific receptors. Switch proteins such 
as G-proteins can switch between two conformational states—an “on” state and an 
“off” state—and act via this conformational switching, as regulatory proteins. Struc- 
tural proteins give form to cells and subcellular structures. The great diversity in 
function that characterizes biological systems is based on attributes that proteins 
possess. 


All Proteins Function through Specific Recognition and Binding of Some Target 
Molecule Although the classification of proteins according to function has ad- 
vantages, many proteins are not assigned readily to one of the traditional groupings. 
Further, classification can be somewhat arbitrary, because many proteins fit more 
than one category. However, for all categories, the protein always functions through 


Cell adhesion (577, 1.9%) 
Miscellaneous (1318, 4.3%) 























Chaperone (159, 0.5%) 
Viral protein (100, 0.3%) 
Transfer/carrier protein (203, 0.7%) 


Transcription factor (1850, 6.0%) 


Nucleic acid enzyme (2308, 7.5%) 


Signaling molecule (376, 1.2%) 


Receptor (1543, 5.0%) 


Kinase (868, 2.8%) 


WOON psuer peusis 


Select regulatory 
molecule (988, 3.2%) 


Transferase (610, 2.0%) 
Synthase and synthetase 
(313, 1.0%) 
Oxidoreductase (656, 2.1%) 
Lyase (117, 0.4%) 

Ligase (56, 0.2%) 

Isomerase (163, 0.5%) 

Hydrolase (1227, 4.0%) 


FIGURE 5.26 Proteins of the human genome grouped according to their molecular function. The numbers and 
percentages within each functional category are enclosed in parentheses. Note that the function of more than 
40% of the proteins encoded by the human genome remains unknown. Considering those of known function, 
enzymes (including kinases and nucleic acid enzymes) account for about 20% of the total number of proteins; 
nucleic acid-binding proteins of various kinds, about 14%, among which almost half are gene-regulatory proteins 
(transcription factors). Transport proteins collectively constitute about 5% of the total; and structural proteins, 
another 5%. (Adapted from Figure 15 in Venter, J.C, et al, 2001. The sequence of the human genome. Science 291:1304-1351.) 


Cytoskeletal structural protein (876, 2.8%) 

Extracellular matrix (437, 1.4%) 

Immunoglobulin (264, 0.9%) 

Jon channel (406, 1.3%) 

Motor (376, 1.2%) 

Structural protein of muscle (296, 1.0%) 


Protooncogene (902, 2.9%) 


wt Select calcium-binding protein (34, 0.1%) 
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Transporter (533, 1.7%) 


Molecular function unknown (12809, 41.7%) 
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FIGURE 5.27 Saturation curve or binding isotherm. 


specific recognition and binding of some other molecule, although for structural 
proteins, it is usually self-recognition and assembly into stable multimeric arrays. 
Protein behavior provides the cardinal example of molecular recognition through struc- 
tural complementarity, a fundamental principle of biochemistry that was presented in 
Chapter 1. 


Protein Binding The interaction of a protein with its target usually can be de- 
scribed in simple quantitative terms. Let’s explore the simplest situation in which a 
protein has a single binding site for the molecule it binds (its ligand; Chapter 1). If 
we treat the interaction between the protein (P) and the ligand (L) as a dissociation 
reaction: 





PL P+L. 





The equilibrium constant for the reaction as written, 
Keq = [P][L]/[PL], 


is a dissociation constant, because it describes the dissociation of the ligand from 
the protein. Biochemists typically use dissociation constants (Kp) to describe bind- 
ing phenomena. Because brackets ([ ]) denote molar concentrations, dissociation 
constants have the units of M. 

Typically, the ligand concentration is much greater than the protein concentra- 
tion. Under such conditions, a plot of the moles of ligand bound per mole of pro- 
tein (defined as [PL]/([P] + [PL])) versus [L] yields a hyperbolic curve known as 
a saturation curve or binding isotherm (Figure 5.27). 

If we define the fractional saturation of P with L, [PL]/([P] + [PL]), as v, a lit- 
tle algebra yields 


v = [L]/ (K + [L]). 
Thus, when v = 0.5, 
[L] = Kp 


That is, the concentration of L where half the protein has L bound is equal to the 
value of Kp. The smaller this number is, the better the ligand binds to the protein; 
that is, a small Kp means that the protein is half-saturated with L at a low con- 
centration of L. In other words, if Kp is small, the protein binds the ligand avidly. 
Typical Kp values fall in a range from 107 M to 107}? M. 


The Ligand-Binding Site Ligand binding occurs through noncovalent interac- 
tions between the protein and ligand. The lack of covalent interactions means that 
binding is readily reversible. Proteins display specificity in ligand binding because 
they possess a specific site, the binding site, within their structure that is comple- 
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mentary to the structure of the ligand, its charge distribution, and any H-bond 
donors or acceptors it might have. Structural complementarity within the binding 
site is achieved because part of the three-dimensional structure of the protein pro- 
vides an ensemble of amino acid side chains (and polypeptide backbone atoms) 
that establish an interactive cavity complementary to the ligand molecule. When a 
ligand binds to the protein, the protein usually undergoes a conformational 
change. This new protein conformation provides an even better fit with the ligand 
than before. Such changes are called ligand-induced conformational changes, and 
the result is an even more stable interaction between the protein and its ligand. 
Thus, in a general sense, most proteins are binding proteins because ligand 
binding is a hallmark of protein function. Catalytic proteins (enzymes) bind sub- 
strates; regulatory proteins bind hormones or other proteins or regulatory se- 
quences in genes; structural proteins bind to and interact with each other; and the 
many types of transport proteins bind ligands, facilitating their movement from one 
place to another. Many proteins accomplish their function through the binding of 
other protein molecules, a phenomenon called protein-protein interaction. Some 
proteins engage in protein-protein interactions with proteins that are similar or 
identical to themselves so that an oligomeric structure is formed, as in hemoglobin. 
Other proteins engage in protein-protein interactions with proteins that are very 
different from themselves, as in the anchoring proteins or the scaffolding proteins 
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SUMMARY 


The primary structure (the amino acid sequence) of a protein is en- 
coded in DNA in the form of a nucleotide sequence. Expression of this 
genetic information is realized when the polypeptide chain is synthe- 
sized and assumes its functional, three-dimensional architecture. Pro- 
teins are the agents of biological function. 


5.1 What Architectural Arrangements Characterize Protein Structure? 
Proteins are generally grouped into three fundamental structural 
classes—soluble, fibrous, and membrane—based on their shape and sol- 
ubility. In more detail, protein structure is described in terms of a hier- 
archy of organization: 


Primary (1°) structure—the protein’s amino acid sequence 


Secondary (2°) structure—regular elements of structure (helices, 
sheets) within the protein created by hydrogen bonds 


Tertiary (3°) structure—the folding of the polypeptide chain in 
three-dimensional space 


Quaternary (4°) structure—the subunit organization of multimeric 
proteins 


The three higher levels of protein structure form and are maintained 
exclusively through noncovalent interactions. 


5.2 How Are Proteins Isolated and Purified from Cells? Cells contain 
thousands of different proteins. A protein of choice can be isolated and 
purified from such complex mixtures by exploiting two prominent phys- 
ical properties: size and electrical charge. A more direct approach is to 
employ affinity purification strategies that take advantage of the biolog- 
ical function or specific recognition properties of a protein. A typical 
protein purification strategy will use a series of separation methods to 
obtain a pure preparation of the desired protein. 


5.3 How Is the Amino Acid Analysis of Proteins Performed? Acid 
treatment of a protein hydrolyzes all of the peptide bonds, yielding a 
mixture of amino acids. Chromatographic analysis of this hydrolysate 
reveals the amino acid composition of the protein. Proteins vary in their 
amino acid composition, but most proteins contain at least one of each 
of the 20 common amino acids. To a very rough approximation, pro- 
teins contain about 30% charged amino acids and about 30% hydro- 


phobic amino acids (when aromatic amino acids are included in this 
number), the remaining being polar, uncharged amino acids. 


5.4 How Is the Primary Structure of a Protein Determined? The pri- 
mary structure (amino acid sequence) of a protein can be determined 
by a variety of chemical and enzymatic methods. Alternatively, mass 
spectroscopic methods can also be used. In the chemical and enzymatic 
protocols, a pure polypeptide chain whose disulfide linkages have been 
broken is the starting material. Methods that identify the N-terminal 
and C-terminal residues of the chain are used to determine which 
amino acids are at the ends, and then the protein is cleaved into defined 
sets of smaller fragments using enzymes such as trypsin or chymotrypsin 
or chemical cleavage by agents such as cyanogen bromide. The se- 
quences of these products can be obtained by Edman degradation. Ed- 
man degradation is a powerful method for stepwise release and se- 
quential identification of amino acids from the N-terminus of the 
polypeptide. The amino acid sequence of the entire protein can be re- 
constructed once the sequences of overlapping sets of peptide frag- 
ments are known. In mass spectrometry, an ionized protein chain is bro- 
ken into an array of overlapping fragments. Small differences in the 
masses of the individual amino acids lead to small differences in the 
masses of the fragments, and the ability of mass spectrometry to mea- 
sure mass-to-charge ratios very accurately allows computer devolution of 
the data into an amino acid sequence. The amino acid sequences of 
about a million different proteins are known. The vast majority of these 
amino acid sequences were deduced from nucleotide sequences avail- 
able in genomic databases. 


5.5 What Is the Nature of Amino Acid Sequences? Proteins have 
unique amino acid sequences, and similarity in sequence between pro- 
teins implies evolutionary relatedness. Homologous proteins share se- 
quence similarity and show structural resemblance. These relationships 
can be used to trace evolutionary histories of proteins and the organisms 
that contain them, and the study of such relationships has given rise to 
the field of molecular evolution. Related proteins, such as the oxygen- 
binding proteins of myoglobin and hemoglobin or the serine proteases, 
share a common evolutionary origin. Sequence variation within a protein 
arises from mutations that result in amino acid substitution, and the op- 
eration of natural selection on these sequence variants is the basis of evo- 
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lutionary change. Occasionally, a sequence variant with a novel biological 
function may appear, upon which selection can operate. 


5.6 Can Polypeptides Be Synthesized in the Laboratory? It is possi- 
ble, although difficult, to synthesize proteins in the laboratory. The ma- 
jor obstacles involve joining desired amino acids to a growing chain us- 
ing chemical methods that avoid side reactions and the creation of 
undesired products, such as the modification of side chains or the ad- 
dition of more than one residue at a time. Solid-state techniques along 
with orthogonal protection methods circumvent many of these prob- 
lems, and polypeptide chains having more than 100 amino acid residues 
have been artificially created. 


5.7 Do Proteins Have Chemical Groups Other Than Amino Acids? Al- 
though many proteins are composed of just amino acids, other proteins 
undergo post-translational modifications to certain amino acid side 
chains. These modifications often regulate the function of the proteins. 
In addition, many proteins are conjugated with various other chemical 
components, including carbohydrates, lipids, nucleic acids, metal and 
other inorganic ions, and a host of novel structures such as heme or 
flavin. Association with these nonprotein substances dramatically ex- 


tends the physical and chemical properties that proteins possess, in turn 
creating a much greater repertoire of functional possibilities. 


5.8 What Are the Many Biological Functions of Proteins? Proteins are 
the agents of biological function. Their ability to bind various ligands is 
intimately related to their function and thus forms the basis of most clas- 
sification schemes. Transport proteins bind molecules destined for 
transport across membranes or around the body. Enzymes bind the re- 
actants unique to the reactions they catalyze. Regulatory proteins are of 
two general sorts: those that bind small molecules that are physiological 
or environmental cues, such as hormone receptors, or those that bind 
to DNA and regulate gene expression, such as transcription activators. 
These are just a few prominent examples. Indeed, the great diversity in 
function that characterizes biological systems is based on the attributes 
that proteins possess. Proteins usually interact noncovalently with their 
ligands, and often the interaction can be defined in simple quantitative 
terms by a protein-ligand dissociation constant. Proteins display speci- 
ficity in ligand binding because the structure of the protein’s ligand- 
binding site is complementary to the structure of the ligand. Some pro- 
teins act through binding other proteins. Such protein-protein 
interactions lie at the heart of many biological functions. 


PROBLEMS 


CENGAGENOW" Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login 


1. The element molybdenum (atomic weight 95.95) constitutes 0.08% 
of the weight of nitrate reductase. If the molecular weight of nitrate 
reductase is 240,000, what is its likely quaternary structure? 


2. Amino acid analysis of an oligopeptide 7 residues long gave 
Asp Leu Lys Met Phe Tyr 


The following facts were observed: 
a. Trypsin treatment had no apparent effect. 
b. The phenylthiohydantoin released by Edman degradation was 
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c. Brief chymotrypsin treatment yielded several products, including 
a dipeptide and a tetrapeptide. The amino acid composition of 
the tetrapeptide was Leu, Lys, and Met. 


d. Cyanogen bromide treatment yielded a dipeptide, a tetrapeptide, 
and free Lys. 


What is the amino acid sequence of this heptapeptide? 
3. Amino acid analysis of another heptapeptide gave 
Asp Glu Leu Lys 
Met Tyr Trp NH4* 
(NH,°* is released by acid hydrolysis of N and/or Q amides.) 
The following facts were observed: 


a. Trypsin had no effect. 
b. The phenylthiohydantoin released by Edman degradation was 
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c. Brief chymotrypsin treatment yielded several products, including 


a dipeptide and a tetrapeptide. The amino acid composition of 
the tetrapeptide was Glx, Leu, Lys, and Met. 


d. Cyanogen bromide treatment yielded a tetrapeptide that had a 
net positive charge at pH 7 and a tripeptide that had a zero net 
charge at pH 7. 

What is the amino acid sequence of this heptapeptide? 


4. Amino acid analysis of a decapeptide revealed the presence of the 
following products: 


NH,* Asp Glu Tyr Arg 
Met Pro Lys Ser Phe 


The following facts were observed: 

a. Neither carboxypeptidase A or B treatment of the decapeptide 
had any effect. 

b. Trypsin treatment yielded two tetrapeptides and free Lys. 

c. Clostripain treatment yielded a tetrapeptide and a hexapeptide. 

d. Cyanogen bromide treatment yielded an octapeptide and a dipep- 
tide of sequence NP (using the one-letter codes). 

e. Chymotrypsin treatment yielded two tripeptides and a tetrapep- 
tide. The N-terminal chymotryptic peptide had a net charge of —1 
at neutral pH and a net charge of —3 at pH 12. 

f. One cycle of Edman degradation gave the PTH derivative 
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What is the amino acid sequence of this decapeptide? 

5. Analysis of the blood of a catatonic football fan revealed large con- 
centrations of a psychotoxic octapeptide. Amino acid analysis of this 
octapeptide gave the following results: 


2Ala 1l1Arg 1lAsp 1Met 2Tyr 1Va 1 NH,* 


The following facts were observed: 
a. Partial acid hydrolysis of the octapeptide yielded a dipeptide of 
the structure 





B,C CH; 
ai, 9 CH 
H; 'N—C—C—N—C—COOH 


b. Chymotrypsin treatment of the octapeptide yielded two tetra- 
peptides, each containing an alanine residue. 

c. Trypsin treatment of one of the tetrapeptides yielded two dipep- 
tides. 

d. Cyanogen bromide treatment of another sample of the same 
tetrapeptide yielded a tripeptide and free Tyr. 

e. End-group analysis of the other tetrapeptide gave Asp. 


What is the amino acid sequence of this octapeptide? 


6. Amino acid analysis of an octapeptide revealed the following 


composition: 
2 Arg 1Gly 1Met 1Trp 1 Tyr 1 Phe 1 Lys 


The following facts were observed: 
a. Edman degradation gave 
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b. CNBr treatment yielded a pentapeptide and a tripeptide contain- 
ing phenylalanine. 

c. Chymotrypsin treatment yielded a tetrapeptide containing a 
C-terminal indole amino acid and two dipeptides. 

d. Trypsin treatment yielded a tetrapeptide, a dipeptide, and free Lys 
and Phe. 

e. Clostripain yielded a pentapeptide, a dipeptide, and free Phe. 


What is the amino acid sequence of this octapeptide? 


7. Amino acid analysis of an octapeptide gave the following results: 


1Ala 1Arg 1Asp 1Gly 3Ile 1Va 1NH,* 


The following facts were observed: 

a. Trypsin treatment yielded a pentapeptide and a tripeptide. 

b. Chemical reduction of the free a-COOH and subsequent acid hy- 
drolysis yielded 2-aminopropanol. 

c. Partial acid hydrolysis of the tryptic pentapeptide yielded, 
among other products, two dipeptides, each of which contained 
C-terminal isoleucine. One of these dipeptides migrated as an 
anionic species upon electrophoresis at neutral pH. 

d. The tryptic tripeptide was degraded in an Edman sequenator, 
yielding first A, then B: 
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What is an amino acid sequence of the octapeptide? Four sequences 
are possible, but only one suits the authors. Why? 


8. An octapeptide consisting of 2 Gly, 1 Lys, 1 Met, 1 Pro, 1 Arg, 1 Trp, 


and 1 Tyr was subjected to sequence studies. The following was found: 
a. Edman degradation yielded 
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b. Upon treatment with carboxypeptidases A, B, and C, only car- 
boxypeptidase C had any effect. 

c. Trypsin treatment gave two tripeptides and a dipeptide. 

d. Chymotrypsin treatment gave two tripeptides and a dipeptide. 
Acid hydrolysis of the dipeptide yielded only Gly. 

e. Cyanogen bromide treatment yielded two tetrapeptides. 

f. Clostripain treatment gave a pentapeptide and a tripeptide. 


What is the amino acid sequence of this octapeptide? 


. Amino acid analysis of an oligopeptide containing nine residues 


revealed the presence of the following amino acids: 
Arg Cys Gly Leu Met Pro Tyr Val 


The following was found: 
a. Carboxypeptidase A treatment yielded no free amino acid. 
b. Edman analysis of the intact oligopeptide released 


a H H 
VA C— CH 
`c- R CH, 
f H 


c. Neither trypsin nor chymotrypsin treatment of the nonapeptide 
released smaller fragments. However, combined trypsin and chy- 
motrypsin treatment liberated free Arg. 

d. CNBr treatment of the 8-residue fragment left after combined 
trypsin and chymotrypsin action yielded a 6-residue fragment con- 
taining Cys, Gly, Pro, Tyr, and Val; and a dipeptide. 

e. Treatment of the 6-residue fragment with -mercaptoethanol 
yielded two tripeptides. Brief Edman analysis of the tripeptide 
mixture yielded only PTH-Cys. (The sequence of each tripeptide, 
as read from the N-terminal end, is alphabetical if the one-letter 
designation for amino acids is used.) 


What is the amino acid sequence of this nonapeptide? 


Describe the synthesis of the dipeptide Lys-Ala by Merrifield’s solid- 
phase chemical method of peptide synthesis. What pitfalls might be 
encountered if you attempted to add a leucine residue to Lys-Ala to 
make a tripeptide? 

Electrospray ionization mass spectrometry (ESI-MS) of the polypep- 
tide chain of myoglobin yielded a series of m/z peaks (similar to 
those shown in Figure 5.14 for aerolysin K). Two successive peaks 
had m/z values of 1304.7 and 1413.2, respectively. Calculate the 
mass of the myoglobin polypeptide chain from these data. 
Phosphoproteins are formed when a phosphate group is esterified 
to an —OH group of a Ser, Thr, or Tyr side chain. At typical cel- 
lular pH values, this phosphate group bears two negative charges 
—OPO;?-. Compare this side-chain modification to the 20 side 
chains of the common amino acids found in proteins and com- 
ment on the novel properties that it introduces into side-chain 
possibilities. 

A quantitative study of the interaction of a protein with its ligand 
yielded the following results: 


Ligand concentration 1 2 3 4 5 6 9 12 
(mM) 

v (moles of ligand 
bound per mole 
of protein) 


0.28 0.45 0.56 0.60 0.71 0.75 0.79 0.83 


Plot a graph of [L] versus v. Determine Kp, the dissociation constant 
for the interaction between the protein and its ligand, from the graph. 


Biochemistry on the Web 
14. 


The human insulin receptor substrate-1 (IRS-1) is designated pro- 
tein P35568 in the protein knowledge base on the ExPASy Web site 
(http://us.expasy.org/). Go to the PeptideMass tool on this Web site 
and use it to see the results of trypsin digestion of IRS-1. How many 
amino acids does IRS-1 have? What is the average molecular mass 
of IRS-1? What is the amino acid sequence of the tryptic peptide of 
IRS-1 that has a mass of 1741.9629? 
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Preparing for the MCAT Exam 

15. Proteases such as trypsin and chymotrypsin cleave proteins at dif- 
ferent sites, but both use the same reaction mechanism. Based on 
your knowledge of organic chemistry, suggest a “universal” protease 
reaction mechanism for hydrolysis of the peptide bond. 

16. Table 5.4 presents some of the many known mutations in the genes 
encoding the a- and -globin subunits of hemoglobin. 
a. Some of these mutations affect subunit interactions between the 

subunits. In an examination of the tertiary structure of globin 


chains, where would you expect to find amino acid changes in mu- 
tant globins that affect formation of the hemoglobin a2, quater- 
nary structure? 

b. Other mutations, such as the S form of the 6-globin chain, in- 
crease the tendency of hemoglobin tetramers to polymerize into 
very large structures. Where might you expect the amino acid sub- 
stitutions to be in these mutants? 
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APPENDIX TO CHAPTER 5 





Protein Techniques' 


Dialysis and Ultrafiltration 


If a solution of protein is separated from a bathing solution by a semipermeable 
membrane, small molecules and ions can pass through the semipermeable mem- 
brane to equilibrate between the protein solution and the bathing solution, called 
the dialysis bath or dialysate (Figure 5A.1). This method is useful for removing small 
molecules from macromolecular solutions or for altering the composition of the 
protein-containing solution. 

Ultrafiltration is an improvement on the dialysis principle. Filters with pore sizes 
over the range of biomolecular dimensions are used to filter solutions to select for 
molecules in a particular size range. Because the pore sizes in these filters are mi- 
croscopic, high pressures are often required to force the solution through the filter. 
This technique is useful for concentrating dilute solutions of macromolecules. The 
concentrated protein can then be diluted into the solution of choice. 


lon Exchange Chromatography Can Be Used to Separate Molecules 
on the Basis of Charge 


Charged molecules can be separated using ion exchange chromatography, a process in 
which the charged molecules of interest (ions) are exchanged for another ion (usually 
a salt ion) on a charged solid support. In a typical procedure, solutes in a liquid phase, 
usually water, are passed through a column filled with a porous solid phase composed 
of synthetic resin particles containing charged groups. Resins containing positively 
charged groups attract negatively charged solutes and are referred to as anion ex- 
change resins. Resins with negatively charged groups are cation exchangers. Figure 5A.2 
shows several typical anion and cation exchange resins. Weakly acidic or basic groups 
on ion exchange resins exhibit charges that are dependent on the pH of the bathing 
solution. Changing the pH will alter the ionic interaction between the resin groups 


Semipermeable bag 
containing protein solution 
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Stir bar 


Magnetic stirrer 
for mixing 


FIGURE 5A.1 A dialysis experiment. The solution of macromolecules to be dialyzed is placed in a semiperme- 
able membrane bag, and the bag is immersed in a bathing solution. A magnetic stirrer gently mixes the solu- 
tion to facilitate equilibrium of diffusible solutes between the dialysate and the solution contained in the bag. 


TAlthough this appendix is titled Protein Techniques, these methods are also applicable to other macro- 
molecules such as nucleic acids. 
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FIGURE 5A.2 Cation (a) and anion (b) exchange resins commonly used for biochemical separations. 


and the bound ions. In all cases, the bare charges on the resin particles must be coun- 
terbalanced by oppositely charged ions in solution (counterions); salt ions (e.g., Na* or 
Cl~) usually serve this purpose. The separation of a mixture of several amino acids on 
a column of cation exchange resin is illustrated in Figure 5A.3. Increasing the salt 
concentration in the solution passing through the column leads to competition be- 
tween the cationic amino acid bound to the column and the cations in the salt for 
binding to the column. Bound cationic amino acids that interact weakly with the 
charged groups on the resin wash out first, and those interacting strongly are washed 
out only at high salt concentrations. 


Size Exclusion Chromatography 


Size exclusion chromatography is also known as gel filtration chromatography or molecular 
sieve chromatography. In this method, fine, porous beads are packed into a chromatog- 
raphy column. The beads are composed of dextran polymers (Sephadex), agarose 
(Sepharose), or polyacrylamide (Sephacryl or BioGel P). The pore sizes of these beads ap- 
proximate the dimensions of macromolecules. The total bed volume (Figure 5A.4) of 
the packed chromatography column, V, is equal to the volume outside the porous 
beads (V,) plus the volume inside the beads (V) plus the volume actually occupied by 
the bead material (Vz): V= Yo + Vit Vy. (Ve is typically less than 1% of V, and can be 
conveniently ignored in most applications.) 

As a solution of molecules is passed through the column, the molecules passively 
distribute between V, and V, depending on their ability to enter the pores (that is, 
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ACTIVE FIGURE 5A.3 The separation of amino acids on a cation exchange column. Test yourself on the con- 
cepts in this figure at www.cengage.com/login 


their size). If a molecule is too large to enter at all, it is totally excluded from V; and 
emerges first from the column at an elution volume, V., equal to V, (Figure 5A.4). 
If a particular molecule can enter the pores in the gel, its distribution is given by the 
distribution coefficient, Kp: 


Kp = (Ve — Vo)/Vi 


where V, is the molecule’s characteristic elution volume (Figure 5A.4). The chro- 
matography run is complete when a volume of solvent equal to V, has passed 
through the column. 


Electrophoresis 


Electrophoretic techniques are based on the movement of ions in an electrical field. 
An ion of charge q experiences a force F given by F = Eq/d, where E is the voltage 
(or electrical potential) and d is the distance between the electrodes. In a vacuum, 
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FIGURE 5A.4 (a) A gel filtration chromatography column. Larger molecules are excluded from the gel beads 
and emerge from the column sooner than smaller molecules, whose migration is retarded because they can 
enter the beads. (b) An elution profile. 


Fwould cause the molecule to accelerate. In solution, the molecule experiences fric- 
tional drag, F;, due to the solvent: 


F; = 6rrnv 


where ris the radius of the charged molecule, y is the viscosity of the solution, and v 
is the velocity at which the charged molecule is moving. So, the velocity of the charged 
molecule is proportional to its charge q and the voltage E, but inversely proportional 
to the viscosity of the medium 7 and d, the distance between the electrodes. 
Generally, electrophoresis is carried out not in free solution but in a porous sup- 
port matrix such as polyacrylamide or agarose, which retards the movement of mol- 
ecules according to their dimensions relative to the size of the pores in the matrix. 


SDS-Polyacrylamide Gel Electrophoresis (SDS-PAGE) 


SDS is sodium dodecylsulfate (sodium lauryl sulfate) (Figure 5A.5). The hydro- 
phobic tail of dodecylsulfate interacts strongly with polypeptide chains. The num- 
ber of SDS molecules bound by a polypeptide is proportional to the length (num- 
ber of amino acid residues) of the polypeptide. Each dodecylsulfate contributes two 
negative charges. Collectively, these charges overwhelm any intrinsic charge that 
the protein might have. SDS is also a detergent that disrupts protein folding (pro- 


O 

SS “4 x 

Na 0=S=0. Ze 7 Cie „Ho _CHe _CHs 
| CH, `CH CH, “CH, CH, ~CHy 
o- 
Nat 


FIGURE 5A.5 The structure of sodium dodecylsulfate (SDS). 


tein 3° structure). SDS-PAGE is usually run in the presence of sulfhydryl-reducing 
agents such as -mercaptoethanol so that any disulfide links between polypeptide 
chains are broken. The electrophoretic mobility of proteins upon SDS-PAGE is in- 
versely proportional to the logarithm of the protein’s molecular weight (Figure 
5A.6). SDS-PAGE is often used to determine the molecular weight of a protein. 


Isoelectric Focusing 


Isoelectric focusing is an electrophoretic technique for separating proteins ac- 
cording to their isoelectric points (pIs). A solution of ampholytes (amphoteric elec- 
trolytes) is first electrophoresed through a gel, usually contained in a small tube. 
The migration of these substances in an electric field establishes a pH gradient 
in the tube. Then a protein mixture is applied to the gel, and electrophoresis is 
resumed. As the protein molecules move down the gel, they experience the pH 
gradient and migrate to a position corresponding to their respective pls. At its 
pl, a protein has no net charge and thus moves no farther. 


Two-Dimensional Gel Electrophoresis 


This separation technique uses isoelectric focusing in one dimension and SDS- 
PAGE in the second dimension to resolve protein mixtures. The proteins in a mix- 
ture are first separated according to pI by isoelectric focusing in a polyacrylamide 
gel in a tube. The gel is then removed and laid along the top of an SDS-PAGE slab, 
and the proteins are electrophoresed into the SDS polyacrylamide gel, where they 
are separated according to size (Figure 5A.7). The gel slab can then be stained to 
reveal the locations of the individual proteins. Using this powerful technique, re- 
searchers have the potential to visualize and construct catalogs of virtually all the 
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FIGURE 5A.6 A plot of the relative electrophoretic mo- 
bility of proteins in SDS-PAGE versus the log of the mol- 
ecular weights of the individual polypeptides. 


FIGURE 5A.7 A two-dimensional electrophoresis separa- 
tion. A mixture of macromolecules is first separated ac- 
cording to charge by isoelectric focusing in a tube gel. 
The gel containing separated molecules is then placed 
on top of an SDS-PAGE slab, and the molecules are elec- 
trophoresed into the SDS-PAGE gel, where they are sepa- 
rated according to size. 
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A protein interacts with a metabolite. The 
metabolite is thus a ligand that binds specifically 
to this protein 


G+ %9=@ 


Protein Metabolite 


The metabolite can be immobilized by covalently 
coupling it to an insoluble matrix such as an 
agarose polymer. Cell extracts containing many 
individual proteins may be passed through 

the matrix. 

















Specific protein binds to ligand. All other 
unbound material is washed out of the matrix. 

















Adding an excess of free metabolite that will 
compete for the bound protein dissociates the 
protein from the chromatographic matrix. The 
protein passes out of the column complexed with 
free metabolite. 

















Purifications of proteins as | 
much as 1000-fold or more are 
routinely achieved in a single 
affinity chromatographic step 

like this. 


FIGURE 5A.8 Diagram illustrating affinity 
chromatography. 


proteins present in particular cell types. The ExPASy server (hitp://us.expasy.org) 
provides access to a two-dimensional polyacrylamide gel electrophoresis database 
named SWISS-2DPAGE. This database contains information on proteins, identi- 
fied as spots on two-dimensional electrophoresis gels, from many different cell 
and tissue types. 


Hydrophobic Interaction Chromatography 


Hydrophobic interaction chromatography (HIC) exploits the hydrophobic nature of pro- 
teins in purifying them. Proteins are passed over a chromatographic column packed 
with a support matrix to which hydrophobic groups are covalently linked. Phenyl 
Sepharose, an agarose support matrix to which phenyl groups are affixed, is a prime 
example of such material. In the presence of high salt concentrations, proteins bind 
to the phenyl groups by virtue of hydrophobic interactions. Proteins in a mixture 
can be differentially eluted from the phenyl groups by lowering the salt concentra- 
tion or by adding solvents such as polyethylene glycol to the elution fluid. 


High-Performance Liquid Chromatography 


The principles exploited in high-performance (or high-pressure) liquid chromatography 
(HPLC) are the same as those used in the common chromatographic methods such 
as ion exchange chromatography or size exclusion chromatography. Very-high- 
resolution separations can be achieved quickly and with high sensitivity in HPLC using 
automated instrumentation. Reverse-phase HPLC is a widely used chromatographic pro- 
cedure for the separation of nonpolar solutes. In reverse-phase HPLC, a solution of 
nonpolar solutes is chromatographed on a column having a nonpolar liquid immobi- 
lized on an inert matrix; this nonpolar liquid serves as the stationary phase. A more po- 
lar liquid that serves as the mobile phase is passed over the matrix, and solute molecules 
are eluted in proportion to their solubility in this more polar liquid. 


Affinity Chromatography 


Affinity purification strategies for proteins exploit the biological function of the tar- 
get protein. In most instances, proteins carry out their biological activity through 
binding or complex formation with specific small biomolecules, or ligands, as in 
the case of an enzyme binding its substrate. If this small molecule can be immo- 
bilized through covalent attachment to an insoluble matrix, such as a chromato- 
graphic medium like cellulose or polyacrylamide, then the protein of interest, in 
displaying affinity for its ligand, becomes bound and immobilized itself. It can 
then be removed from contaminating proteins in the mixture by simple means 
such as filtration and washing the matrix. Finally, the protein is dissociated or 
eluted from the matrix by the addition of high concentrations of the free ligand 
in solution. Figure 5A.8 depicts the protocol for such an affinity chromatography 
scheme. Because this method of purification relies on the biological specificity of 
the protein of interest, it is a very efficient procedure and proteins can be puri- 
fied several thousand-fold in a single step. 


Ultracentrifugation 


Centrifugation methods separate macromolecules on the basis of their characteris- 
tic densities. Particles tend to “fall” through a solution if the density of the solution 
is less than the density of the particle. The velocity of the particle through the 
medium is proportional to the difference in density between the particle and the 
solution. The tendency of any particle to move through a solution under centrifu- 
gal force is given by the sedimentation coefficient, S: 


S= (Pp — Pm) V/f 


where pp is the density of the particle or macromolecule, pm is the density of the 
medium or solution, Vis the volume of the particle, and fis the frictional coeffi- 
cient, given by 


f=K/v 


where vis the velocity of the particle and /;is the frictional drag. Nonspherical mol- 
ecules have larger frictional coefficients and thus smaller sedimentation coeffi- 
cients. The smaller the particle and the more its shape deviates from spherical, the 
more slowly that particle sediments in a centrifuge. 

Centrifugation can be used either as a preparative technique for separating and 
purifying macromolecules and cellular components or as an analytical technique to 
characterize the hydrodynamic properties of macromolecules such as proteins and 
nucleic acids. 
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Like the Greek sea god Proteus, who could assume 
different forms, proteins act through changes in con- 
formation. Proteins (from the Greek proteios, meaning 
“orimary") are the primary agents of biological func- 
tion. (‘Proteus, Old Man of the Sea, Roman period mosaic, 
from Thessalonika, 1st century a.d. National Archaeologi- 
cal Museum, Athens/Ancient Art and Architecture Collec- 
tion Ltd./Bridgeman Art Library, London/New York) 


National Archaeological Museum, Athens, Greece/Bridgeman i 


Art Library 





Growing in size and complexity 
Living things, masses of atoms, DNA, protein 
Dancing a pattern ever more intricate. 
Out of the cradle onto the dry land 
Here it is standing 

Atoms with consciousness 

Matter with curiosity. 

Stands at the sea 

Wonders at wondering 

m 

A universe of atoms 

An atom in the universe. 


Richard P. Feynman (1915-1988) 


From “The Value of Science” in Edward Hutchings, 
Jr., ed. 1958. Frontiers of Science: A Survey. 
New York: Basic Books. 
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Proteins: Secondary, Tertiary, 
and Quaternary Structure 


ESSENTIAL QUESTION 


Linus Pauling received the Nobel Prize in Chemistry in 1954. The award cited “his 
research into the nature of the chemical bond and its application to the elucidation 
of the structure of complex substances.” Pauling pioneered the study of secondary 
structure in proteins. 


How do the forces of chemical bonding determine the formation, stability, 
and myriad functions of proteins? 


Nearly all biological processes involve the specialized functions of one or more pro- 
tein molecules. Proteins function to produce other proteins, control all aspects of 
cellular metabolism, regulate the movement of various molecular and ionic species 
across membranes, convert and store cellular energy, and carry out many other ac- 
tivities. Essentially all of the information required to initiate, conduct, and regulate 
each of these functions must be contained in the structure of the protein itself. The 
previous chapter described the details of protein primary structure. However, pro- 
teins do not normally exist as fully extended polypeptide chains but rather as com- 
pact structures that biochemists refer to as “folded.” The ability of a particular pro- 
tein to carry out its function in nature is normally determined by its overall 
three-dimensional shape, or conformation. 

This chapter reveals and elaborates upon the exquisite beauty of protein struc- 
tures. What will become apparent in this discussion is that the three-dimensional 
structure of proteins and their biological function are linked by several overarching 
principles: 


1. Function depends on structure. 

. Structure depends both on amino acid sequence and on weak, noncovalent 
forces. 

. The number of protein folding patterns is very large but finite. 

. The structures of globular proteins are marginally stable. 

. Marginal stability facilitates motion. 

. Motion enables function. 


QD Ot ® OO 


6xl= What Noncovalent Interactions Stabilize the Higher 
Levels of Protein Structure? 


The amino acid sequence (primary structure) of any protein is dictated by covalent 
bonds, but the higher levels of structure—secondary, tertiary, and quaternary—are 
formed and stabilized by weak, noncovalent interactions (Figure 6.1). Hydrogen 
bonds, hydrophobic interactions, electrostatic bonds, and van der Waals forces are all 
noncovalent in nature, yet they are extremely important influences on protein con- 
formation. The stabilization free energies afforded by each of these interactions may 
be highly dependent on the local environment within the protein, but certain gener- 
alizations can still be made. 


Hydrogen Bonds Are Formed Whenever Possible 


Hydrogen bonds are generally made wherever possible within a given protein 
structure. In most protein structures that have been examined to date, component 
atoms of the peptide backbone tend to form hydrogen bonds with one another. 
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FIGURE 6.1 An electrostatic interaction between the e-amino group of a lysine and the y-carboxyl group of a 
glutamate. The protein is IRAK-4 kinase, an enzyme that phosphorylates other proteins (pdb id = 2NRY).The 
interaction shown is between Lys?!? (left) and Glu?’ (right). 


Furthermore, side chains capable of forming H bonds are usually located on the 
protein surface and form such bonds either with the water solvent or with other 
surface residues. The strengths of hydrogen bonds depend to some extent on en- 
vironment. The difference in energy between a side chain hydrogen bonded to wa- 
ter and that same side chain hydrogen bonded to another side chain is usually 
quite small. On the other hand, a hydrogen bond in the protein interior, away 
from bulk solvent, can provide substantial stabilization energy to the protein. Al- 
though each hydrogen bond may contribute an average of only a few kilojoules per 
mole in stabilization energy for the protein structure, the number of H bonds 
formed in the typical protein is very large. For example, in a-helices, the C=O and 
N—H groups of every interior residue participate in H bonds. The importance of 
H bonds in protein structure cannot be overstated. 


Hydrophobic Interactions Drive Protein Folding 


Hydrophobic “bonds,” or, more accurately, interactions, form because nonpolar side 
chains of amino acids and other nonpolar solutes prefer to cluster in a nonpolar en- 
vironment rather than to intercalate in a polar solvent such as water. The forming 
of hydrophobic “bonds” minimizes the interaction of nonpolar residues with water 
and is therefore highly favorable. Such clustering is entropically driven, and it is in 
fact the principal impetus for protein folding. The side chains of the amino acids in 
the interior or core of the protein structure are almost exclusively hydrophobic. Po- 
lar amino acids are much less common in the interior of a protein, but the protein 
surface may consist of both polar and nonpolar residues. 


lonic Interactions Usually Occur on the Protein Surface 


Ionic interactions arise either as electrostatic attractions between opposite charges 
or repulsions between like charges. Chapter 4 discusses the ionization behavior of 
amino acids. Amino acid side chains can carry positive charges, as in the case of ly- 
sine, arginine, and histidine, or negative charges, as in aspartate and glutamate. In 
addition, the N-terminal and C-terminal residues of a protein or peptide chain usu- 
ally exist in ionized states and carry positive or negative charges, respectively. All of 
these may experience ionic interactions in a protein structure. Charged residues are 
normally located on the protein surface, where they may interact optimally with the 
water solvent. It is energetically unfavorable for an ionized residue to be located in 
the hydrophobic core of the protein. Ionic interactions between charged groups on 
a protein surface are often complicated by the presence of salts in the solution. For 
example, the ability of a positively charged lysine to attract a nearby negative gluta- 
mate may be weakened by dissolved salts such as NaCl (Figure 6.1). The Na” and 
Cl” ions are highly mobile, compact units of charge, compared to the amino acid 
side chains, and thus compete effectively for charged sites on the protein. In this 
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manner, ionic interactions among amino acid residues on protein surfaces may be 
damped out by high concentrations of salts. Nevertheless, these interactions are im- 
portant for protein stability. 


Van der Waals Interactions Are Ubiquitous 


Both attractive forces and repulsive forces are included in van der Waals interactions. 
The attractive forces are due primarily to instantaneous dipole-induced dipole inter- 
actions that arise because of fluctuations in the electron charge distributions of adja- 
cent nonbonded atoms. Individual van der Waals interactions are weak ones (with sta- 
bilization energies of 0.4 to 4.0 kJ/mol), but many such interactions occur in a typical 
protein, and by sheer force of numbers, they can represent a significant contribution 
to the stability of a protein. Peter Privalov and George Makhatadze have shown that, 
for pancreatic ribonuclease A, hen egg white lysozyme, horse heart cytochrome c, and 
sperm whale myoglobin, van der Waals interactions between tightly packed groups in 
the interior of the protein are a major contribution to protein stability. 


6:28 What Role Does the Amino Acid Sequence Play 
in Protein Structure? 


It can be inferred from the first section of this chapter that many different forces 
work together in a delicate balance to determine the overall three-dimensional struc- 
ture of a protein. These forces operate both within the protein structure itself and 
between the protein and the water solvent. How, then, does nature dictate the man- 
ner of protein folding to generate the three-dimensional structure that optimizes 
and balances these many forces? All of the information necessary for folding the peptide 
chain into its “native” structure is contained in the amino acid sequence of the peptide. 

Just how proteins recognize and interpret the information that is stored in the 
amino acid sequence is not yet well understood. Certain loci along the peptide 
chain may act as nucleation points, which initiate folding processes that eventually 
lead to the correct structures. Regardless of how this process operates, it must take 
the protein correctly to the final native structure. Along the way, local energy- 
minimum states different from the native state itself must be avoided. A long-range 
goal of many researchers in the protein structure field is the prediction of three- 
dimensional conformation from the amino acid sequence. As the details of sec- 
ondary and tertiary structure are described in this chapter, the complexity and 
immensity of such a prediction will be more fully appreciated. This area is one of 
the greatest uncharted frontiers remaining in molecular biology. 


6:38 What Are the Elements of Secondary Structure 
in Proteins, and How Are They Formed? 


Any discussion of protein folding and structure must begin with the peptide bond, the 
fundamental structural unit in all proteins. As we saw in Chapter 4, the resonance 
structures experienced by a peptide bond constrain six atoms—the oxygen, carbon, 
nitrogen, and hydrogen atoms of the peptide group, as well as the adjacent 
a-carbons—to lie in a plane. The resonance stabilization energy of this planar struc- 
ture is approximately 88 kJ/mol, and substantial energy is required to twist the 
structure about the C—N bond. A twist of 0 degrees involves a twist energy of 
88 sin?@ kJ/mol. 


All Protein Structure Is Based on the Amide Plane 


The planarity of the peptide bond means that there are only two degrees of free- 
dom per residue for the peptide chain. Rotation is allowed about the bond linking 
the a-carbon and the carbon of the peptide bond and also about the bond linking 
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the nitrogen of the peptide bond and the adjacent a-carbon. As shown in Figure 
6.2, each a-carbon is the joining point for two planes defined by peptide bonds. The 
angle about the C,—N bond is denoted by the Greek letter ¢ (phi), and that about 
the C,—C, is denoted by y (psi). For either of these bond angles, a value of 0° cor- 
responds to an orientation with the amide plane bisecting the H—C,—R (side- 
chain) angle and a cis conformation of the main chain around the rotating bond in 
question (Figure 6.3). 

The entire path of the peptide backbone in a protein is known if the @ and 
y rotation angles are all specified. Some values of ġ and y are not allowed due to 
steric interference between nonbonded atoms. As shown in Figure 6.3, values of 
ġ = 180° and y = 0° are not allowed because of the forbidden overlap of the N—H 
hydrogens. Similarly, $ = 0° and y = 180° are forbidden because of unfavorable 
overlap between the carbonyl oxygens. 

G. N. Ramachandran and his co-workers in Madras, India, demonstrated that it was 
convenient to plot ¢ values against y values to show the distribution of allowed values 
in a protein or in a family of proteins. A typical Ramachandran plot is shown in Fig- 
ure 6.4. Note the clustering of ¢ and w values in a few regions of the plot. Most com- 
binations of h and y are sterically forbidden, and the corresponding regions of the 
Ramachandran plot are sparsely populated. The combinations that are sterically al- 
lowed represent the subclasses of structure described in the remainder of this section. 


The Alpha-Helix Is a Key Secondary Structure 


As noted in Chapter 5, the term secondary structure describes local conformations of 
the polypeptide that are stabilized by hydrogen bonds. In nearly all proteins, the 
hydrogen bonds that make up secondary structures involve the amide proton of 
one peptide group and the carbonyl oxygen of another, as shown in Figure 6.5. 
These structures tend to form in cooperative fashion and involve substantial por- 
tions of the peptide chain. When a number of hydrogen bonds form between por- 
tions of the peptide chain in this manner, two basic types of structures can result: 
a-helices and B-pleated sheets. 
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(D) 


4 a-Carbon 


Side group 


Amide plane 


= 180°,  =180° 


FIGURE 6.2 The amide or peptide bond planes are 
joined by the tetrahedral bonds of the a-carbon. The 
rotation parameters are @ and wy. The conformation 
shown corresponds to @ = 180° and y = 180°. Note 
hat positive values of @ and correspond to clockwise 
rotation as viewed from Ca. Starting from 0°, a rotation 
of 180° in the clockwise direction (+ 180°) is equivalent 
o a rotation of 180° in the counterclockwise direction 
(—180°). (Illustration: Irving Geis. Rights owned by Howard 





Hughes Medical Institute. Not to be reproduced without 
permission.) 
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from the side chain 


CENGAGENOW” ACTIVE FIGURE 6.3 Many of the possible conformations about an a-carbon between two 
peptide planes are forbidden because of steric crowding. Several noteworthy examples are shown here. 

Note: The formal |UPAC-IUB Commission on Biochemical Nomenclature convention for the definition of the 
torsion angles œ and w in a polypeptide chain (Biochemistry 9:3471-3479, 1970) is different from that used here, where 
the C, atom serves as the point of reference for both rotations, but the result is the same. (Illustration: Irving Geis. Rights 
owned by Howard Hughes Medical Institute. Not to be reproduced without permission.) Test yourself on the concepts in 
this figure at www.cengage.com/login. 


group as far as possible 


= 0°, b= 0° 
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CENGAGENOW” ACTIVE FIGURE 6.4 A Ramachandran 
diagram showing the sterically reasonable values of the 
angles œ and ws. The shaded regions indicate particularly 
favorable values of these angles. Dots in purple indicate 
actual angles measured for 1000 residues (excluding 
glycine, for which a wider range of angles is permitted) 
in eight proteins. The lines running across the diagram 
(numbered +5 through 2 and —5 through —3) signify 
the number of amino acid residues per turn of the helix; 
“+"means right-handed helices; “—" means left-handed 
helices. (After Richardson, J. S., 1981. The anatomy and taxono- 
my of protein structure. Advances in Protein Chemistry 
34:167-339.) Test yourself on the concepts in this 
figure at www.cengage.com/login. 
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Knowing What the Right Hand and Left Hand Are Doing 


Certain conventions related to peptide bond angles and the “hand- 
edness” of biological structures are useful in any discussion of pro- 
tein structure. To determine the @ and y angles between peptide 
planes, viewers should imagine themselves at the C, carbon looking 
outward and should imagine starting from the ¢ = 0°, y = 0° con- 
formation. From this perspective, positive values of ¢ correspond to 
clockwise rotations about the C,—N bond of the plane that includes 
the adjacent N—H group. Similarly, positive values of y correspond 


to clockwise rotations about the C,—C bond of the plane that in- 
cludes the adjacent C=O group. 

Biological structures are often said to exhibit “right-hand” or 
“left-hand” twists. For all such structures, the sense of the twist can 
be ascertained by holding the structure in front of you and looking 
along the polymer backbone. If the twist is clockwise as one pro- 
ceeds outward and through the structure, it is said to be right- 
handed. If the twist is counterclockwise, it is said to be left-handed. 
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FIGURE 6.5 A hydrogen bond between the backbone C=O of Ala!?! and 
the backbone N—H of Ser'“” in the acetylcholine-binding protein of a snail, 
Lymnaea stagnalis (odb id = 1198). 





The earliest studies of protein secondary structure were those of William Ast- 
bury at the University of Leeds. Astbury carried out X-ray diffraction studies on 
wool and observed differences between unstretched wool fibers and stretched 
wool fibers. He proposed that the protein structure in unstretched fibers was a he- 
lix (which he called the alpha form). He also proposed that stretching caused the 
helical structures to uncoil, yielding an extended structure (which he called the 
beta form). Astbury was the first to propose that hydrogen bonds between peptide 
groups contributed to stabilizing these structures. 

In 1951, Linus Pauling, Robert Corey, and their colleagues at the California In- 
stitute of Technology summarized a large volume of crystallographic data in a set 
of dimensions for polypeptide chains. (A summary of data similar to what they 
reported is shown in Figure 4.15.) With these data in hand, Pauling, Corey, and 
their colleagues proposed a new model for a helical structure in proteins, which 
they called the a-helix. The report from Caltech was of particular interest to Max 
Perutz in Cambridge, England, a crystallographer who was also interested in pro- 
tein structure. By taking into account a critical but previously ignored feature of 
the X-ray data, Perutz realized that the a-helix existed in keratin, a protein from 
hair, and also in several other proteins. Since then, the a-helix has proved to be a 
fundamentally important peptide structure. Several representations of the a-helix 
are shown in Figure 6.6. One turn of the helix represents 3.6 amino acid residues. 
(A single turn of the a-helix involves 13 atoms from the O to the H of the H bond. 
For this reason, the a-helix is sometimes referred to as the 3.61 helix.) This is in 
fact the feature that most confused crystallographers before the Pauling and 
Corey a-helix model. Crystallographers were so accustomed to finding twofold, 
threefold, sixfold, and similar integral axes in simpler molecules that the notion 
of a nonintegral number of units per turn was never taken seriously before Paul- | CENGAGENOW" Go to CengageNOW at www 
ing and Corey’s work. .cengage.com/login and click BiochemistryInteractive 

Each amino acid residue extends 1.5 A (0.15 nm) along the helix axis. With se ene anatomy: thea hèl 
3.6 residues per turn, this amounts to 3.6 X 1.5 Å or 5.4 Å (0.54 nm) of travel along 
the helix axis per turn. This is referred to as the translation distance or the pitch of 
the helix. If one ignores side chains, the helix is about 6 Å in diameter. The side 
chains, extending outward from the core structure of the helix, are removed from 


140 Chapter6 Proteins: Secondary, Tertiary, and Quaternary Structure 





Side group 
(a) (b) (d) 
Hydrogen bonds stabilize The helix can be viewed 
the helix structure. as a stacked array of 
peptide planes hinged 


at the o-carbons and 
approximately parallel 
to the helix. 


FIGURE 6.6 Four different graphic representations of the a-helix. (a) A stick representation with H bonds as dotted 
lines, as originally conceptualized in Pauling’s 1960 The Nature of the Chemical Bond. (b) Showing the arrangement 
of peptide planes in the helix. (Illustration: Irving Geis. Rights owned by Howard Hughes Medical Institute. Not to be repro- 
duced without permission.) (c) A space-filling computer graphic presentation. (d) A “ribbon structure” with an inlaid 
stick figure, showing how the ribbon indicates the path of the polypeptide backbone. 


steric interference with the polypeptide backbone. As can be seen in Figure 6.6, 
each peptide carbonyl is hydrogen bonded to the peptide N—H group four residues farther up 
the chain. Note that all of the H bonds lie parallel to the helix axis and all of the car- 
bonyl groups are pointing in one direction along the helix axis while the N—H 
groups are pointing in the opposite direction. Recall that the entire path of the pep- 
tide backbone can be known if the @ and yf twist angles are specified for each 
residue. The a-helix is formed if the values of ¢ are approximately —60° and the val- 
ues of if are in the range of —45 to —50°. Figure 6.7 shows the structures of two pro- 
teins that contain a-helical segments. The number of residues involved in a given 
a-helix varies from helix to helix and from protein to protein. On average, there are 
about 10 residues per helix. Myoglobin, one of the first proteins in which a-helices 
were observed, has eight stretches of a-helix that form a box to contain the heme 
prosthetic group (see Figure 5.1). 

As shown in Figure 6.6, all of the hydrogen bonds point in the same direction 
along the a-helix axis. Each peptide bond possesses a dipole moment that arises 
from the polarities of the N—H and C=O groups, and because these groups are all 
aligned along the helix axis, the helix itself has a substantial dipole moment, with a 
partial positive charge at the N-terminus and a partial negative charge at the 
C-terminus (Figure 6.8). Negatively charged ligands (e.g., phosphates) frequently 
bind to proteins near the N-terminus of an a-helix. By contrast, positively charged 
ligands are only rarely found to bind near the C-terminus of an a-helix. 

In a typical a-helix of 12 (or n) residues, there are 8 (or n — 4) hydrogen bonds. 
As shown in Figure 6.9, the first 4 amide hydrogens and the last 4 carbonyl oxygens 
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B-Hemoglobin subunit Myohemerythrin 
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FIGURE 6.8 The arrangement of N—H and C=O 
groups (each with an individual dipole moment) 
along the helix axis creates a large net dipole for the 
helix. Numbers indicate fractional charges on re- 
spective atoms. 


CENGAGENOW” ANIMATED FIGURE 6.7 The three- 
dimensional structures of two proteins that contain sub- 
stantial amounts of a-helix in their structures. The helices 
are represented by the regularly coiled sections of the 
ribbon drawings. Myohemerythrin is the oxygen-carrying 
protein in certain invertebrates, including Sipunculids, a 
phylum of marine worm. B-Hemoglobin subunit: pdb 

id = 1HGA; myohemerythrin pdb id = 1A7D. (Jane 
Richardson.) See this figure animated at www.cengage 
.com/login. 






rane eee ((((( 





YS 





VV) dra eee (( 





3.6 
residues 





Petele 


ie) 
& 














FIGURE 6.9 Four N—H groups at the N-terminal end of 
an a-helix and four C=O groups at the C-terminal end 
lack partners for H-bond formation. The formation of 

H bonds with other nearby donor and acceptor groups 
is referred to as helix capping. Capping may also in- 
volve appropriate hydrophobic interactions that accom- 
modate nonpolar side chains at the ends of helical 
segments. 
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Amino Acid Helix Behavior* 

A Ala H 1) 

C Cys Variable 

D Asp Variable 

E Glu H 

F Phe H 

G Gly I (B) 

H His H (D 

I Ile H (C) 

K Lys Variable 

L Leu H 

M Met H 

N Asn C (I) 

P Pro B 

Q Gln H (1) 

R Arg H (1) 

S Ser C (B) 

T Thr Variable 

V Val Variable 

W Trp H (©) 

Y Tyr H (©) 
*H = helix former; | = indifferent; B = helix breaker; C = random coil; ( ) = secondary tendency. 


cannot participate in helix H bonds. Also, nonpolar residues situated near the he- 
lix termini can be exposed to solvent. Proteins frequently compensate for these 
problems by helix capping—providing H-bond partners for the otherwise bare 
N—H and C=O groups and folding other parts of the protein to foster hydropho- 
bic contacts with exposed nonpolar residues at the helix termini. 

Careful studies of the polyamino acids, polymers in which all the amino acids 
are identical, have shown that certain amino acids tend to occur in a-helices, 
whereas others are less likely to be found in them. Polyleucine and polyalanine, 
for example, readily form a-helical structures. In contrast, polyaspartic acid and 
polyglutamic acid, which are highly negatively charged at pH 7.0, form only ran- 
dom structures because of strong charge repulsion between the R groups along 
the peptide chain. At pH 1.5 to 2.5, however, where the side chains are protonated 
and thus uncharged, these latter species spontaneously form a-helical structures. 
In similar fashion, polylysine is a random coil at pH values below about 11, where 
repulsion of positive charges prevents helix formation. At pH 12, where polylysine 
is a neutral peptide chain, it readily forms an a-helix. 

The tendencies of various amino acids to stabilize or destabilize a-helices are dif- 
ferent in typical proteins than in polyamino acids. The occurrence of the common 
amino acids in helices is summarized in Table 6.1. Notably, proline (and hydrox- 
yproline) act as helix breakers due to their unique structure, which fixes the value 
of the C,—N—C bond angle. Helices can be formed from either D- or L-amino 
acids, but a given helix must be composed entirely of amino acids of one configu- 
ration. a-Helices cannot be formed from a mixed copolymer of D- and L-amino 
acids. An a-helix composed of D-amino acids is left-handed. 


The -Pleated Sheet Is a Core Structure in Proteins 


Another type of structure commonly observed in proteins also forms because of 
local, cooperative formation of hydrogen bonds. That is the pleated sheet, or 
f-structure, often called the B-pleated sheet. This structure was also first postulated 
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CRITICAL DEVELOPMENTS IN BIOCHEMISTRY 





In Bed with a Cold, Pauling Stumbles onto the a-Helix and a Nobel Prize* 


As high technology continues to transform the modern biochem- —putting in lengths and bond angles from memory....He knew that 
ical laboratory, it is interesting to reflect on Linus Pauling’s dis- the peptide bond, at the carbon-to-nitrogen link, was always rigid: 
covery of the a-helix. It involved only a piece of paper, a pencil, 

scissors, and a sick Linus Pauling, who had tired of reading detec- (b) 


tive novels. The story is told in the excellent book The Eighth Day 
of Creation by Horace Freeland Judson: 
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From the spring of 1948 through the spring of 1951...rivalry sput- aC N C aC N C aC 
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Bragg’s—over protein. The prize was to propose and verify in nature 
a general three-dimensional structure for the polypeptide chain. 
Pauling was working up from the simpler structures of components. 
In January 1948, he went to Oxford as a visiting professor for two 
terms, to lecture on the chemical bond and on molecular structure 
and biological specificity. “In Oxford, it was April, I believe, I caught 
cold. I went to bed, and read detective stories for a day, and got 
bored, and thought why don’t I have a crack at that problem of 
alpha keratin.” Confined, and still fingering the polypeptide chain 
in his mind, Pauling called for paper, pencil, and straightedge and 
attempted to reduce the problem to an almost Euclidean purity. “I 
took a sheet of paper—I still have this sheet of paper—and drew, 
rather roughly, the way that I thought a polypeptide chain would 
look if it were spread out into a plane.” The repetitious herringbone 
of the chain he could stretch across the paper as simply as this— 


And this meant that the chain could turn corners only at the alpha 
carbons....“I creased the paper in parallel creases through the alpha 
carbon atoms, so that I could bend it and make the bonds to the 
alpha carbons, along the chain, have tetrahedral value. And then I 
looked to see if I could form hydrogen bonds from one part of the 
chain to the next.” He saw that if he folded the strip like a chain of 
paper dolls into a helix, and if he got the pitch of the screw right, 
hydrogen bonds could be shown to form, N—H-::‘O—G, three or 
four knuckles apart along the backbone, holding the helix in shape. 
After several tries, changing the angle of the parallel creases in order 
to adjust the pitch of the helix, he found one where the hydrogen 
bonds would drop into place, connecting the turns, as straight lines 
of the right length. He had a model. 


(a) 


| *The discovery of the a-helix structure was only one of many achievements 
that led to Pauling’s Nobel Prize in Chemistry in 1954. The official citation 
for the prize was “for his research into the nature of the chemical bond 
| and its application to the elucidation of the structure of complex 
H substances.” 








by Pauling and Corey in 1951 and has now been observed in many natural proteins. | CENGAGENOW” Go to CengageNOW at www 
A B-pleated sheet can be visualized by laying thin, pleated strips of paper side by side —_-¢"9age.com/login and click Biochemistryinteractive 
to make a “pleated sheet” of paper (Figure 6.10). Each strip of paper can then be ii Sra ce tSPrincipal ac 
r > ondary structure in proteins. 
pictured as a single peptide strand in which the peptide backbone makes a zigzag 
pattern along the strip, with the a-carbons lying at the folds of the pleats. The 
pleated sheet can exist in both parallel and antiparallel forms. In the parallel 
B-pleated sheet, adjacent chains run in the same direction. In the antiparallel B- 
pleated sheet, adjacent strands run in opposite directions. 
Each single strand of the 6-sheet structure can be pictured as a twofold helix, 
that is, a helix with two residues per turn. The arrangement of successive amide 
planes has a pleated appearance due to the tetrahedral nature of the C, atom. It 
is important to note that the hydrogen bonds in this structure are essentially in- 
terstrand rather than intrastrand. Optimum formation of H bonds in the parallel 
pleated sheet results in a slightly less extended conformation than in the anti- 
parallel sheet. The H bonds thus formed in the parallel 6-sheet are bent signifi- 
cantly. The distance between residues is 0.347 nm for the antiparallel pleated 
sheet, but only 0.325 nm for the parallel pleated sheet. Note that the side chains 
in the pleated sheet are oriented perpendicular or normal to the plane of the 
sheet, extending out from the plane on alternating sides. 
Parallel B-sheets tend to be more regular than antiparallel 6-sheets. As can be 
seen in Figure 6.4, typical ¢,w values for a parallel -sheet are ¢ = —120°, y = 105°, 
and typical values for an anti-parallel 6-sheet are @¢ = —135°, y = 140°. However, 
the range of ¢ and y angles for the peptide bonds in parallel sheets is much smaller 
than that for antiparallel sheets. Parallel sheets are typically large structures; those 
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FIGURE 6.10 A “pleated sheet" of paper with an antiparallel B-sheet drawn on it. (Illustration: Irving Geis. Rights 
owned by Howard Hughes Medical Institute. Not to be reproduced without permission.) 


composed of less than five strands are rare. Antiparallel sheets, however, may con- 
sist of as few as two strands. Parallel sheets characteristically distribute hydrophobic 
side chains on both sides of the sheet, whereas antiparallel sheets are usually 
arranged with all their hydrophobic residues on one side of the sheet. This requires 
an alternation of hydrophilic and hydrophobic residues in the primary structure of 
peptides involved in antiparallel -sheets because every other side chain projects to 
the same side of the sheet (Figure 6.10). 

Antiparallel pleated sheets are the fundamental structure found in the fabric we 
know as silk, with the polypeptide chains forming the sheets running parallel to the 
silk fibers. The silk fibers thus formed have properties consistent with those of the 
B-sheets that form them. They are quite flexible but cannot be stretched or 
extended to any appreciable degree. 


Helix-Sheet Composites in Spider Silk 


Although the intricate designs of spider webs are eye- (and fly-) catching, it might 
be argued that the composition of web silk itself is even more remarkable. Spider 
silk (a form of keratin) is synthesized in special glands in the spider’s abdomen. The 
silk strands produced by these glands are both strong and elastic. Dragline silk (that 
from which the spider hangs) has a tensile strength of 200,000 psi (pounds per 
square inch)—stronger than steel and similar to Kevlar, the synthetic material used 
in bulletproof vests! This same silk fiber is also flexible enough to withstand strong 
winds and other natural stresses. This combination of strength and flexibility de- 
rives from the composite nature of spider silk. As keratin protein is extruded from the 
spider’s glands, it endures shearing forces that break the H bonds stabilizing keratin 
a-helices (Figure 6.11). These regions then form microcrystalline arrays of 6-sheets. 
These microcrystals are surrounded by the keratin strands, which adopt a highly 
disordered state composed of a-helices and random coil structures. The -sheet mi- 
crocrystals contribute strength, and the disordered array of helix and coil make the 
silk strand flexible. The resulting silk strand resembles modern human-engineered 
composite materials. Certain tennis racquets, for example, consist of fiberglass poly- 
mers impregnated with microcrystalline graphite. The fiberglass provides flexibility, 
and the graphite crystals contribute strength. 
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(a) Spider web (b) Radial strand 





(c) Ordered B-sheets surrounded 
by disordered a-helices and 
B-bends. 


FIGURE 6.11 Spider web silks are composites of a-helices and B-sheets. The radial strands of webs must be 
strong and rigid; silks in radial strands contain a higher percentage of B-sheets. The circumferential strands 
(termed capture silk) must be flexible (to absorb the impact of flying insects); capture silk contains a higher 
percentage of a-helices. Spiders typically have several different silk gland spinnerets, which secrete different 
silks of differing composition. Spiders have inhabited the earth for about 470 million years. 


-Turns Allow the Protein Strand to Change Direction 


Most proteins are globular structures. The polypeptide chain must therefore pos- 
sess the capacity to bend, turn, and reorient itself to produce the required compact, 
globular structures. A simple structure observed in many proteins is the B-turn (also 
known as the tight turn or B-bend), in which the peptide chain forms a tight loop with 
the carbonyl oxygen of one residue hydrogen bonded with the amide proton of the 
residue three positions down the chain. This H bond makes the -turn a relatively 
stable structure. As shown in Figure 6.12, the 6-turn allows the protein to reverse 
the direction of its peptide chain. This figure shows the two major types of B-turns, 
but a number of less common types are also found in protein structures. Because it 
lacks a side chain, glycine is sterically the most adaptable of the amino acids, and it 
accommodates conveniently to other steric constraints in the 6-turn. Proline, how- 

















(d) B-sheets impart strength and 
a-helices impart flexibility 
to the strand. 


CENGAGENOW” Go to CengageNOW at www 
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to discover the features of 8-turns and how they 
change the course of a polypeptide strand. 





FIGURE 6.12 The structures of two kinds of B-turns (also called tight turns or B-bends). Four residues are re- 
quired to form a B-turn. Left: Type |; right: Type II. (Illustration: Irving Geis. Rights owned by Howard Hughes Medical Insti- 
tute. Not to be reproduced without permission.) 
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ever, has a cyclic structure and a fixed ¢ angle, so, to some extent, it forces the for- 
mation of a 6-turn; in many cases this facilitates the turning of a polypeptide chain 
upon itself. Such bends promote formation of antiparallel 6-pleated sheets. Type I 
turns (Figure 6.12) are more common than type II. Proline fits best in the 3 posi- 
tion of the type I turn. In the type II turn, proline is preferred at the 2 position, 
whereas the 3 position prefers glycine or small polar residues. 


6:4: How Do Polypeptides Fold into Three-Dimensional 
Protein Structures? 


The arrangement of all atoms of a single polypeptide chain in three-dimensional 
space is referred to as its tertiary structure. As discussed in Section 6.2 all of the in- 
formation needed to fold the protein into its native tertiary structure is contained 
within the primary structure of the peptide chain itself. Sometimes proteins known as 
chaperones assist in the process of protein folding in the cell, but proteins in dilute 
solution can be unfolded and refolded without the assistance of such chaperones. 

The first determinations of the tertiary structure of a protein were by John Kendrew 
and Max Perutz. Kendrew’s structure of myoglobin and Perutz’s structure of hemo- 
globin, reported in the late 1950s, were each the result of more than 20 years of work. 
Ever since these first protein structures were elucidated, biochemists have sought to 
understand the principles by which proteins adopt their remarkable structures. Vigor- 
ous work in many laboratories has slowly brought important principles to light: 


e Secondary structures—helices and sheets—form whenever possible as a conse- 
quence of the formation of large numbers of hydrogen bonds. 

e a-Helices and £-sheets often associate and pack close together in the protein. No 
protein is stable as a single-layer structure, for reasons that become apparent 
later. There are a few common methods for such packing to occur. 

e Because the peptide segments between secondary structures in the protein tend 
to be short and direct, the peptide does not execute complicated twists and knots 
as it moves from one region of secondary structure to another. 

e Proteins generally fold so as to form the most stable structures possible. The sta- 
bility of most proteins arises from (1) the formation of large numbers of in- 
tramolecular hydrogen bonds and (2) the reduction in the surface area accessi- 
ble to solvent that occurs upon folding. 


Two factors lie at the heart of these four “principles.” First, proteins are typically 
a mixture of hydrophilic and hydrophobic amino acids. Why is this important? 
Imagine a protein that is composed only of polar and charged amino acids. In such 
a protein, every side chain could hydrogen bond to water. This would leave no rea- 
son for the protein to form a compact, folded structure. 

Now consider a protein composed of a mixture of hydrophilic and hydrophobic 
residues. In this case, the hydrophobic side chains cannot form H bonds with water, 
and their presence will disrupt the hydrogen-bonding structure of water itself. To 
minimize this, the hydrophobic groups will tend to cluster together. This hydrophobic effect in- 
duces formation of a compact structure—the folded protein. 

A potential problem with this rather simple folding model is that polar backbone 
N—H and C=O groups on the hydrophobic residues accompany the hydrophobic 
side chains into the folded protein interior. This would be energetically costly to the 
protein, but the actual result is that the polar backbone groups form H bonds with 
one another, so that the polar backbone N—H and C=O moieties are stabilized in 
a-helices and -sheets in the protein interior. 


Fibrous Proteins Usually Play a Structural Role 


In Chapter 5, we saw that proteins can be grouped into three large classes based on 
their structure and solubility: fibrous proteins, globular proteins, and membrane proteins. 
Fibrous proteins contain polypeptide chains organized approximately parallel 
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along a single axis, producing long fibers or large sheets. Such proteins tend to be 
mechanically strong and resistant to solubilization in water and dilute salt solutions. 
Fibrous proteins often play a structural role in nature (see Chapter 5). 


a-Keratin The a-keratins are the predominant constituents of claws, fingernails, 
hair, and horns in mammals. As their name suggests, the structure of the a-keratins 
is dominated by a-helical segments of polypeptide. The amino acid sequence of 
a-keratin subunits is composed of central a-helix-rich rod domains about 311 to 
314 residues in length, flanked by nonhelical N- and C-terminal domains of varying 
size and composition (Figure 6.13a). The structure of the central rod domain of a 
typical a-keratin is shown in Figure 6.13b. Pairs of right-handed a-helices wrap 
around each other to form a left-twisted coiled coil. X-ray diffraction data (includ- 
ing the original studies by William Astbury) show that these structures resemble 
a-helices, but with a pitch of 0.51 nm rather than the expected 0.54 nm. This is con- 
sistent with a tilt of the helix relative to the long axis of the coiled coil (and the ker- 
atin fiber) in Figure 6.13. 

The amino acid sequence of the central rod segments of a-keratin consists of 
quasi-repeating 7-residue segments of the form (a-b-c-d-e-fg),. These units are not 
true repeats, but residues a and d are usually nonpolar amino acids. In a-helices, 
with 3.6 residues per turn, these nonpolar residues are arranged in an inclined row 
or stripe that twists around the helix axis (Figure 6.13c). These nonpolar residues 
would make the helix highly unstable if they were exposed to solvent, but the asso- 
ciation of hydrophobic stripes on two a-helices to form the two-stranded coiled coil 
effectively buries the hydrophobic residues and forms a highly stable structure 
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FIGURE 6.13 (a) Both type | and type II œ-keratin molecules have sequences consisting of long, central rod do- 
mains with terminal cap domains. The numbers of amino acid residues in each domain are indicated. Asterisks 
denote domains of variable length. (b) The rod domains form coiled coils consisting of left-twisted right- 
handed a-helices. These coiled coils form protofilaments that then wind around each other in a right-handed 
twist. Keratin filaments consist of twisted protofibrils (each a bundle of four coiled coils). (c) Periodicity of hy- 
drophobic residues. (Adapted from Steinert, P, and Parry, D., 1985. Intermediate filaments: Conformity and diversity of expres- 
sion and structure. Annual Review of Cell Biology 1:41-65; and Cohlberg, J., 1993. Textbook error: The structure of alpha-keratin. 
Trends in Biochemical Sciences 18:360-362.) 
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A DEEPER LOOK 


The Coiled-Coil Motif in Proteins 


The coiled-coil motif was first identified in 1953 by Linus Pauling, 
Robert Corey, and Francis Crick as the main structural element of 
fibrous proteins such as keratin and myosin. Since then, many pro- 
teins have been found to contain one or more coiled-coil seg- 
ments or domains. A coiled coil is a bundle of a-helices that are 
wound into a superhelix. Two, three, or four helical segments may 
be found in the bundle, and they may be arranged parallel or an- 
tiparallel to one another. Coiled coils are characterized by a dis- 
tinctive and regular packing of side chains in the core of the bun- 
dle. This regular meshing of side chains requires that they occupy 


(a) Coiled coil 


equivalent positions turn after turn. This is not possible for undis- 
torted a-helices, which have 3.6 residues per turn. The positions of 
side chains on their surface shift continuously along the helix sur- 
face (see Figure 6.13c). However, giving the right-handed a-helix 
a left-handed twist reduces the number of residues per turn to 3.5, 
and because 3.5 X 2 = 7.0, the positions of the side chains repeat 
after two turns (7 residues). Thus, a heptad repeat pattern in the 
peptide sequence is diagnostic of a coiled-coil structure. The fig- 
ure shows a sampling of coiled-coil structures (highlighted in 
color) in various proteins. 
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(Figure 6.13). The helices clearly sacrifice some stability in assuming this twisted 
conformation, but they gain stabilization energy from the packing of side chains be- 
tween the helices. In other forms of keratin, covalent disulfide bonds form between 
cysteine residues of adjacent molecules, making the overall structure even more 
rigid, inextensible, and insoluble—important properties for structures such as 
claws, fingernails, hair, and horns. How and where these disulfides form determines 
the amount of curling in hair and wool fibers. When a hairstylist creates a perma- 
nent wave (simply called a “permanent”) in a hair salon, disulfides in the hair are 
first reduced and cleaved, then reorganized and reoxidized to change the degree of 
curl or wave. In contrast, a “set” that is created by wetting the hair, setting it with 
curlers, and then drying it represents merely a rearrangement of the hydrogen 
bonds between helices and between fibers. (On humid or rainy days, the hydrogen 
bonds in curled hair may rearrange, and the hair becomes “frizzy.”) 


Fibroin and 6-Keratin: B-Sheet Proteins The fibroin proteins found in silk fibers 
in the cocoons of the silkworm, Bombyx mori, and also in spiderwebs represent an- 
other type of fibrous protein. These are composed of stacked antiparallel 6-sheets, 
as shown in Figure 6.14. In the polypeptide sequence of silk proteins, there are 
large stretches in which every other residue is a glycine. As previously mentioned, 
the residues of a -sheet extend alternately above and below the plane of the sheet. 
As a result, the glycines all end up on one side of the sheet and the other residues 
(mainly alanines and serines) compose the opposite surface of the sheet. Pairs of 
B-sheets can then pack snugly together (glycine surface to glycine surface or 
alanine-serine surface to alanine—serine surface). The B-keratins found in bird 
feathers are also made up of stacked f-sheets. 





FIGURE 6.14 Silk fibroin consists of a unique stacked array of B-sheets. The primary structure of fibroin molecules 
consists of long stretches of alternating glycine and alanine or serine residues. When the sheets stack, the more 
bulky alanine and serine residues on one side of a sheet interdigitate with similar residues on an adjoining sheet. 
Glycine hydrogens on the alternating faces interdigitate in a similar manner, but with a smaller intersheet spacing. 
(Illustration: Irving Geis. Rights owned by Howard Hughes Medical Institute. Not to be reproduced without permission.) 
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Collagen: A Triple Helix Collagen is a rigid, inextensible fibrous protein that is a 
principal constituent of connective tissue in animals, including tendons, cartilage, 
bones, teeth, skin, and blood vessels. The high tensile strength of collagen fibers in 
these structures makes possible the various animal activities such as running and 
jumping that put severe stresses on joints and skeleton. Broken bones and tendon 
and cartilage injuries to knees, elbows, and other joints involve tears or hyperex- 
tensions of the collagen matrix in these tissues. 

The basic structural unit of collagen is tropocollagen, which has a molecular 
weight of 285,000 and consists of three intertwined polypeptide chains, each about 
1000 amino acids in length. Tropocollagen molecules are about 300 nm long and 
only about 1.4 nm in diameter. Several kinds of collagen have been identified. Type 
I collagen, which is the most common, consists of two identical peptide chains des- 
ignated a1(I) and one different chain designated a2(I). Type I collagen predomi- 
nates in bones, tendons, and skin. Type I collagen, found in cartilage, and type III col- 
lagen, found in blood vessels, consist of three identical polypeptide chains. 

Collagen has an amino acid composition that is unique and is crucial to its three- 
dimensional structure and its characteristic physical properties. Nearly one residue 
out of three is a glycine, and the proline content is also unusually high. Three un- 
usual modified amino acids are also found in collagen: 4-hydroxyproline (Hyp), 
3-hydroxyproline, and 5-hydroxylysine (Hyl) (Figure 4.4). Proline and Hyp together 
compose up to 30% of the residues of collagen. Interestingly, these three amino 
acids are formed from normal proline and lysine after the collagen polypeptides are 
synthesized. The modifications are effected by two enzymes: prolyl hydroxylase and 
lysyl hydroxylase. The prolyl hydroxylase reaction (Figure 6.15) requires molecular 
oxygen, a-ketoglutarate, and ascorbic acid (vitamin C) and is activated by Fe?*. The 
hydroxylation of lysine is similar. 

Because of their high content of glycine, proline, and hydroxyproline, collagen 
fibers are incapable of forming traditional structures such as a-helices and B-sheets. 
Instead, collagen polypeptides intertwine to form a unique right-handed triple helix, 
with each of the three strands arranged in a left-handed helical fashion (Figure 
6.16). Compared to the a-helix, the collagen helix is much more extended, with a 
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FIGURE 6.15 Hydroxylation of proline residues is catalyzed by prolyl hydroxylase. The reaction requires 
a-ketoglutarate and ascorbic acid (vitamin C). 
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rise per residue along the triple helix axis of 2.9 A (versus 1.5 A for the a-helix). 
There are about 3.3 residues per turn of each of these helices. The triple helix is a struc- 
ture that forms to accommodate the unique composition and sequence of collagen. Long 
stretches of the polypeptide sequence are repeats of a Gly-x-y motif, where x is fre- 
quently Pro and y is frequently Pro or Hyp. In the triple helix, every third residue 
faces or contacts the crowded center of the structure. This area is so crowded that 
only Gly can fit, and thus every third residue must be a Gly (as observed). Moreover, 
the triple helix is a staggered structure, such that Gly residues from the three strands 
stack along the center of the triple helix and the Gly from one strand lies adjacent 
to an x residue from the second strand and to a y from the third. This allows the 
N—H of each Gly residue to hydrogen bond with the C=O of the adjacent x residue. 
The triple helix structure is further stabilized and strengthened by the formation of 
interchain H bonds involving hydroxyproline. 

Collagen types I, II, and III form strong, organized fibrils, which consist of stag- 
gered arrays of tropocollagen molecules (Figure 6.17). The periodic arrangement 
of triple helices in a head-to-tail fashion results in banded patterns in electron mi- 
crographs. The banding pattern typically has a periodicity (repeat distance) of 
68 nm. Because collagen triple helices are 300 nm long, 40-nm gaps occur between 
adjacent collagen molecules in a row along the long axis of the fibrils and the pat- 
tern repeats every five rows (5 X 68 nm = 340 nm). The 40-nm gaps are referred 
to as hole regions, and they are important in at least two ways. First, sugars are found 
covalently attached to 5-hydroxylysine residues in the hole regions of collagen 
(Figure 6.18). The occurrence of carbohydrate in the hole region has led to the 
proposal that it plays a role in organizing fibril assembly. Second, the hole regions 
may play a role in bone formation. Bone consists of microcrystals of hydroxyap- 
atite, Ca;(PO4);0H, embedded in a matrix of collagen fibrils. When new bone tis- 
sue forms, the formation of new hydroxyapatite crystals occurs at intervals of 
68 nm. The hole regions of collagen fibrils may be the sites of nucleation for the 
mineralization of bone. 


Packing of collagen molecules 
Qs S S 

Hole zone 2 > g 2 2 
064 S$ 33% 


Dace OE 


NON 


ANON 


Overlap zone 
0.4d 


Z 


. 
RH 
RAH 





J. Gross, Biozentrum/Science Photo Library 


FIGURE 6.17 In the electron microscope, collagen fibers exhibit alternating light and dark bands. The dark bands 
correspond to the 40-nm gaps or“holes” between pairs of aligned collagen triple helices. The repeat distance, d, 
for the light- and dark-banded pattern is 68 nm. The collagen molecule is 300 nm long, which corresponds to 
441d.The molecular repeat pattern of five staggered collagen molecules corresponds to 5d. 
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(Adapted from Miller, M. H., and Scheraga, H. A., 1976. Calculation 
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FIGURE 6.18 A disaccharide of galactose and glucose is 
covalently linked to the 5-hydroxyl group of hydroxy- 
lysines in collagen by the combined action of the en- 
zymes galactosyltransferase and glucosyltransferase. 
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CENGAGENOW” ACTIVE FIGURE 6.19 The three- 
dimensional structure of bovine ribonuclease A (pdb id 
= 1FS3).(a) Ribbon diagram; (b) space-filling model. 
Uane Richardson.) Test yourself on the concepts in this 
figure at www.cengage.com/login 


Globular Proteins Mediate Cellular Function 


Fibrous proteins, although interesting for their structural properties, represent only 
a small percentage of the proteins found in nature. Globular proteins, so named for 
their approximately spherical shape, are far more numerous. The diversity of pro- 
tein structures found in nature reflects the remarkable variety of functions per- 
formed by proteins—binding, catalysis, regulation, transport, immunity, cellular sig- 
naling, and more. The functional diversity and versatility derive in turn from (1) the 
large number of folded structures that polypeptide chains can adopt and (2) the 
varied chemistry of the side chains of the 20 common amino acids. Remarkably, this 
diversity of structure and function derives from a relatively small number of princi- 
ples and themes of protein folding and design. The balance of Chapter 6 explores 
and elaborates these principles and themes. 


Helices and Sheets Make up the Core of Most Globular Proteins 


Globular proteins exist in an enormous variety of three-dimensional structures, but 
nearly all contain substantial amounts of a-helices and 6-sheets folded into a com- 
pact structure that is stabilized by both polar and nonpolar groups. A typical exam- 
ple is bovine ribonuclease A, a small protein (12.6 kD, 124 residues) that contains a 
few short a-helices, a broad section of antiparallel 6-sheet, a few B-turns, and several 
peptide segments without defined secondary structure (Figure 6.19). The space be- 
tween the helices and sheets in the protein interior is filled efficiently and tightly 
with mostly hydrophobic amino acid side chains. Most polar side chains in ribonu- 
clease face the outside of the protein structure and interact with solvent water. With 
its hydrophobic core and a hydrophilic surface, ribonuclease illustrates the typical 
properties of many folded globular proteins. 

The helices and sheets that make up the core of most globular proteins probably 
represent the starting point for protein folding, as shown later in this chapter. Thus, 
the folding of a globular protein, in its simplest conception, could be viewed rea- 
sonably as the condensation of multiple elements of secondary structure. On the 
other hand, most peptide segments that form helices, sheets, or beta turns in pro- 
teins are mostly disordered in small model peptides that contain those amino acid 
sequences. Thus hydrophobic interactions and other noncovalent interactions with 
the rest of the protein must stabilize these relatively unstable helices, sheets, and 
turns in the whole folded protein. 

Why should the cores of most globular and membrane proteins consist almost 
entirely of a-helices and B-sheets? The reason is that the highly polar N—H and 
C=O moieties of the peptide backbone must be neutralized in the hydrophobic 





(a) (b) 
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core of the protein. The extensively H-bonded nature of a-helices and -sheets is 
ideal for this purpose, and these structures effectively stabilize the polar groups of 
the peptide backbone in the protein core. 

The framework of sheets and helices in the interior of a globular protein is typi- 
cally constant and conserved in both sequence and structure. The surface of a glob- 
ular protein is different in several ways. Typically, much of the protein surface is com- 
posed of the loops and tight turns that connect the helices and sheets of the protein 
core, although helices and sheets may also be found on the surface. The result is that 
the surface of a globular protein is often a complex landscape of different structural 
elements. These complex surface structures can interact in certain cases with small 
molecules or even large proteins that have complementary structure or charge (Fig- 
ure 6.20). These regions of complementary, recognizable structure are formed typi- 
cally from the peptide segments that connect elements of secondary structure. They 
are the basis for enzyme-substrate interactions, protein-protein associations in cell 
signaling pathways, and antigen—antibody interactions, and more. 

The segments of the protein that are neither helix, sheet, nor turn have tradi- 
tionally been referred to as coil or random coil. Both of these terms are misleading. 
Most of these “loop” segments are neither coiled nor random, in any sense of the 
words. These structures are every bit as organized and stable as the defined sec- 
ondary structures. They just don’t conform to any frequently recurring pattern. 
These so-called coil structures are strongly influenced by side-chain interactions 
with the rest of the protein. 


Waters on the Protein Surface Stabilize the Structure 


A globular protein’s surface structure also includes water molecules. Many of the 
polar backbone and side chain groups on the surface of a globular protein make 
H bonds with solvent water molecules. There are often several such water molecules 
per amino acid residue, and some are in fixed positions (Figure 6.21). Relatively few 
water molecules are found inside the protein. 

In some globular proteins (Figure 6.22), it is common for one face of an 
a-helix to be exposed to the water solvent, with the other face toward the hydro- 
phobic interior of the protein. The outward face of such an amphiphilic helix con- 
sists mainly of polar and charged residues, whereas the inward face contains mostly 
nonpolar, hydrophobic residues. A good example of such a surface helix is that of 
residues 153 to 166 of flavodoxin from Anabaena (Figure 6.22a). Note that the 
helical wheel presentation of this helix readily shows that one face contains four 
hydrophobic residues and that the other is almost entirely polar and charged. 

Less commonly, an a-helix can be completely buried in the protein interior 
or completely exposed to solvent. Citrate synthase is a dimeric protein in which 
a-helical segments form part of the subunit-subunit interface. As shown in Figure 
6.22b, one of these helices (residues 260 to 270) is highly hydrophobic and contains 
only two polar residues, as would befit a helix in the protein core. On the other hand, 
Figure 6.22c shows the solvent-exposed helix (residues 74 to 87) of calmodulin, which 
consists of 10 charged residues, 2 polar residues, and only 2 nonpolar residues. 


Packing Considerations 


The secondary and tertiary structures of ribonuclease A (Figure 6.19) and other glob- 
ular proteins illustrate the importance of packing in tertiary structures. Secondary 
structures pack closely to one another and also intercalate with (insert between) ex- 
tended polypeptide chains. If the sum of the van der Waals volumes of a protein’s con- 
stituent amino acids is divided by the volume occupied by the protein, packing densi- 
ties of 0.72 to 0.77 are typically obtained. These packing densities are similar to those 
of a collection of solid spheres. This means that even with close packing, approximately 
25% of the total volume of a protein is not occupied by protein atoms. Nearly all of this 
space is in the form of very small cavities. Cavities the size of water molecules or larger 
do occasionally occur, but they make up only a small fraction of the total protein vol- 
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FIGURE 6.20 The surfaces of proteins are complemen- 
tary to the molecules they bind. PEP carboxykinase 
(shown here, pdb id = 1K3D) is an enzyme from the 
metabolic pathway that synthesizes glucose (gluconeo- 
genesis; see Chapter 22). In the so-called “active site” (yel- 
low) of this enzyme, catalysis depends on complemen- 
tary binding of substrates. Shown in this image are ADP 
(brown), a Mg?* ion (blue), and AIF;~ (a phosphate ana- 
log, in green, above the Mg?*). 





FIGURE 6.21 The surfaces of proteins are ideally suited 
to form multiple H bonds with water molecules. Shown 
here are waters (blue and white) associated with actini- 
din,an enzyme from kiwi fruit that cleaves polypeptide 
chains at arginine residues (pdb id = 2ACT).The polar 
backbone atoms and side chain groups on the surface 
of actinidin are extensively H-bonded with water. 


CENGAGENOW” Go to CengageNOW at www 
.cengage.com/login and click BiochemistrylInteractive 
to examine the secondary and tertiary structure of 
ribonuclease. 
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CENGAGENOW” ACTIVE FIGURE 6.22 The so-called helical wheel presentation can reveal the polar or nonpo- 
lar character of a-helices. If the helix is viewed end on, and the residues are numbered with residue 1 closest 
to the viewer, it is easy to see how polar and nonpolar residues are distributed to form a wheel. (a) The a-helix 
consisting of residues 153-166 (red) in flavodoxin from Anabaena is a surface helix and is amphipathic (pdb id 
= 1RCF).(b) The two helices (orange and red) in the interior of the citrate synthase dimer (residues 260-270 in 
each monomer) are mostly hydrophobic (pdb id = 5CSC). (c) The exposed helix (residues 74-87—red) of 
calmodulin is entirely accessible to solvent and consists mainly of polar and charged residues (pdb id = 1CLL). 
Test yourself on the concepts in this figure at www.cengage.com/login 
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Collagen-Related Diseases 


Collagen provides an ideal case study of the molecular basis of 
physiology and disease. For example, the nature and extent of col- 
lagen crosslinking depends on the age and function of the tissue. 
Collagen from young animals is predominantly un-crosslinked and 
can be extracted in soluble form, whereas collagen from older an- 
imals is highly crosslinked and thus insoluble. The loss of flexibility 
of joints with aging is probably due in part to increased crosslink- 
ing of collagen. 

Several serious and debilitating diseases involving collagen ab- 
normalities are known. Lathyrism occurs in animals due to the 
regular consumption of seeds of Lathyrus odoratus, the sweet pea, 
and involves weakening and abnormalities in blood vessels, joints, 
and bones. These conditions are caused by B-aminopropionitrile 
(see figure), which covalently inactivates lysyl oxidase, preventing 
intramolecular crosslinking of collagen and causing abnormalities 
in joints, bones, and blood vessels. 





iS 
N = C— CH; — CH, — NH; 





B-Aminopropionitrile 


Scurvy results from a dietary vitamin C deficiency and in- 
volves the inability to form collagen fibrils properly. This is the 
result of reduced activity of prolyl hydroxylase, which is vitamin 
C-dependent, as previously noted. Scurvy leads to lesions in the 
skin and blood vessels, and in its advanced stages, it can lead to 
grotesque disfiguration and eventual death. Although rare in the 
modern world, it was a disease well known to sea-faring explorers 
in earlier times who did not appreciate the importance of fresh 
fruits and vegetables in the diet. 

A number of rare genetic diseases involve collagen abnormali- 
ties, including Marfan’s syndrome and the Ehlers—Danlos syndromes, 
which result in hyperextensible joints and skin. The formation of 
atherosclerotic plaques, which cause arterial blockages in advanced 
stages, is due in part to the abnormal formation of collagenous 
structures in blood vessels. 





ume. It is likely that such cavities provide flexibility for proteins and facilitate confor- 
mation changes and a wide range of protein dynamics (discussed later). 


Protein Domains Are Nature’s Modular Strategy for Protein Design 


Proteins range in molecular weight from a thousand to more than a million. It is 
tempting to think that the size of unique globular, folded structures would increase 
with molecular weight, but this is not what has been observed. Proteins composed 
of about 250 amino acids or less often have a simple, compact globular shape. How- 
ever, larger globular proteins are usually made up of two or more recognizable and 
distinct structures, termed domains or modules—compact, folded protein struc- 
tures that are usually stable by themselves in aqueous solution. Figure 6.23 shows a 
two-domain DNA-binding protein, TonEBP, in which the two distinct domains are 
joined by a short segment of the peptide chain. Most domains consist of a single 
continuous portion of the protein sequence, but in some proteins the domain se- 
quence is interrupted by a sequence belonging to some other part of the protein 


FIGURE 6.23 Ton-EBP is a DNA-binding protein consist- 
ing of two distinct domains. The N-terminal domain is 
shown here on the right, with DNA (orange) in the 
middle, and the C-terminal domain on the left (pdb id 
= 1IMH). 
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FIGURE 6.24 Malony! CoA:ACP transacylase (pdb id = 
1NM2) is a metabolic enzyme consisting of two subdo- 
mains. The large subdomain (blue) includes residues 
1-132 and 198-316 and consists of a B-sheet sur- 
rounded by 12 a-helices. The small subdomain (gold = 
residues 133-197) consists of a four-stranded antiparal- 
lel B-sheet and two a-helices. 


that may even form a separate domain (Figure 6.24). In either case, typical domain 
structures consist of hydrophobic cores with hydrophilic surfaces (as was the case 
for ribonuclease, Figure 6.19). Importantly, individual domains often possess 
unique functional behaviors (for example, the ability to bind a particular ligand 
with high affinity and specificity), and an individual domain from a larger protein 
often expresses its unique function within the larger protein in which it is found. 
Multidomain proteins typically possess the sum total of functional properties and 
behaviors of their constituent domains. 

It is likely that proteins consisting of multiple domains (and thus multiple func- 
tions) evolved by the fusion of genes that once coded for separate proteins. This 
would require gene duplication to be common in nature, and analysis of completed 
genomes has confirmed that approximately 90% of domains in eukaryotes have 
been duplicated. Thus, the protein domain is a fundamental unit in evolution. 
Many proteins have been “assembled” by duplicating domains and then combining 
them in different ways. Many proteins are assemblies constructed from several in- 
dividual domains, and some proteins contain multiple copies of the same domain. 
Figure 6.25 shows the tertiary structures of nine domains that are frequently dupli- 
cated, and Figure 6.26 presents several proteins that contain multiple copies of one 
or more of these domains. 
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FIGURE 6.25 Ribbon structures of several protein modules used in the construction of complex multimodule 
proteins. (a) The complement control protein module (pdb id = 1HCC). (b) The immunoglobulin module 
(pdb id = 1T89). (c) The fibronectin type | module (pdb id = 1Q06).(d) The growth factor module (pdb id = 
1FSB). (e) The kringle module (pdb id = 1HPK). (f) The GYF module (pdb id = 1GYF). (g) The y-carboxygluta- 
mate module (pdb id = 1CFI). (h) The FF module (pdb id = 1UZC). (i) The DED domain (pdb id = 1A1W). 
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FIGURE 6.26 A sampling of proteins that consist of mo- 
saics of individual protein modules. The modules shown 
include yCG,a module containing y-carboxyglutamate 
residues; G, an epidermal growth factor-like module; 

K, the “kringle” domain, named for a Danish pastry; 

C, which is found in complement proteins; F1, F2, and 
F3, first found in fibronectin; |, the immunoglobulin 
superfamily domain; N, found in some growth factor 
receptors; E,a module homologous to the calcium- 







































































































































= binding E-F hand domain; and LB, a lectin module 
Q found in some cell surface proteins. (Adapted from Baron, 
z Eaa aa = M, Norman, D., and Campbell, |, 1991. Protein modules. Trends in 
as Biochemical Sciences 16:13-17.) 
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Classification Schemes for the Protein Universe Are Based on Domains 


The astounding diversity of properties and behaviors in living things can now be 
explored through the analysis of vast amounts of genomic information. Assessment 
of sequence and structural data from several million proteins in both protein and 
genome databases has shown that there is a relatively limited number of structurally 
distinct domains in proteins. Several comprehensive projects have organized the 
available information in defined hierarchies or levels of protein structure. 
The Structural Classification of Proteins database (SCOP, hitp://scop.mrc-lmb 
.cam.ac.uk/scop) recognizes five overarching classes, which encompass most pro- 
teins. SCOP is based on hierarchical levels that embody the evolutionary and struc- 
tural relationships among known proteins, and protein classification in SCOP is es- 
sentially a manual process using visual inspection and comparison of structures. 
CATH is another hierarchical classification system (http://www.cathdb.info) that 
groups protein domain structures into evolutionary families and structural group- 
ings, depending on sequence and structure similarities. CATH differs from SCOP 
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FIGURE 6.27 SCOP and CATH are hierarchical classifica- 
tion systems for the known proteins. Proteins are classi- 
fied in SCOP by a manual process, whereas CATH com- 
bines manual and automated procedures. Numbers 
indicate the population of each category. 


The CATH Hierarchy A 





The SCOP Hierarchy > 





in that it combines manual analysis with automation based on quantitative algo- 
rithms to classify protein structures. Figure 6.27 compares the hierarchical struc- 
tures of SCOP and CATH and defines the different levels of structure. 

Although the hierarchical names in SCOP and CATH differ somewhat, there are 
common threads shared in these schemes. Class is determined from the overall com- 
position of secondary structure elements in a domain. A fold describes the number, 
arrangement, and connections of these secondary structure elements. A superfamily 
includes domains of similar folds and usually similar functions, thus suggesting a 
common evolutionary ancestry. A family usually includes domains with closely re- 
lated amino acid sequences (in addition to folding similarities). Although the num- 
bers of unique folds, superfamilies, and families increase as more genomes are 
known and analyzed, it has become apparent that the number of protein domains in na- 
ture is large but limited. How many proteins can we expect to identify and understand 
someday? There are approximately 10° to 10° genes per organism and approximately 
13.6 million species of living organisms on earth (and this latter number is likely an 
underestimate). Thus, there may be approximately (10° X 1.36 X 107) or 10! to 10” 
different proteins in all organisms on earth. Still, this vast number of proteins may 
well consist of only about 10° sequence domain families (Figure 6.27) and approxi- 
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mately 10° protein folds of known structure—a remarkably small number compared 
to the total number of protein-coding genes (see Table 1.6). It is anticipated that 
most newly identified proteins will resemble other known proteins and that most 
structures can be broken into two or more domains, which resemble tertiary struc- 
tures observed in other proteins. 

Because structure depends on sequence, and because function depends on struc- 
ture, it is tempting to imagine that all proteins of a similar structure should share a 
common function, but this is not always true. For example, the TIM barrel is a com- 
mon protein fold consisting of eight a-helices and eight 6-strands that alternate 
along the peptide backbone to form a doughnutlike tertiary structure. The TIM 
barrel is named for triose phosphate isomerase, an enzyme that interconverts ke- 
tone and aldehyde substrates in the breakdown of sugars (see Chapter 18). How- 
ever, other TIM barrel proteins carry out very different functions (Figure 6.284), 
including the reduction of aldose sugars and hydrolysis of phosphate esters. More- 
over, not all proteins of similar function possess similar domains. Both proteins 
shown in Figure 6.28b catalyze the same reaction, but they bear little structural sim- 
ilarity to each other. 


Denaturation Leads to Loss of Protein Structure and Function 


Whereas the primary structure of proteins arises from covalent bonds, the sec- 
ondary, tertiary, and quaternary levels of protein structure are maintained by weak, 
noncovalent forces. The environment of a living cell is exquisitely suited to main- 
tain these weak forces and to preserve the structures of its many proteins. However, 
a variety of external stresses—for example, heat or chemical treatment—can disrupt 


(a) Same domain type, different functions: 





Triose phosphate isomerase Aldose reductase Phosphotriesterase 


(b) Same function, different structures: 





D-amino acid aminotransferase 





Aspartate aminotransferase 





FIGURE 6.28 (a) Some proteins share similar structural 
features but carry out quite different functions (triose 
phosphate isomerase, pdb id = 8TIM; aldose reductase, 
pdb id = 1ADS; phosphotriesterase, pdb id = 1DPM). 
(b) Proteins with quite different structures can carry 
out similar functions (yeast aspartate aminotransferase, 
pdb id = 1YAA);b-amino acid aminotransferase, pdb id 
= 3DAA). 
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FIGURE 6.30 Proteins can be denatured by heat, with 
commensurate loss of function. Ribonuclease A (blue) 
and ribonuclease B (red) lose activity above about 55°C. 
These two enzymes share identical protein structures, 
but ribonuclease B possesses a carbohydrate chain 
attached to Asn*4. (Adapted from Arnold, U. and Ulbrich- 
Hofmann, R, 1997. Kinetic and thermodynamic thermal stabili- 
ties of ribonuclease A and ribonuclease B. Biochemistry 36: 
2166-2172.) 








Ovalbumin monomer 


FIGURE 6.29 The proteins of egg white are denatured (as evidenced by their precipitation and aggregation) 
during cooking. More than half of the protein in egg whites is ovalbumin. Ovalbumin pdb id = 1OVA. 


these weak forces in a process termed denaturation—the loss of protein structure 
and function. 

An everyday example is the denaturation of the protein ovalbumin during the 
cooking of an egg (Figure 6.29). About 10% of the mass of an egg white is protein, 
and 54% of that is ovalbumin. When a chicken egg is cracked open, the “egg white” 
is a nearly transparent, viscous fluid. Cooking turns this fluid to a solid, white mass. 
The egg white proteins have unfolded and have precipitated out of solution, and 
the unfolded proteins have aggregated into a solid mass. 

As a typical protein solution is heated slowly, the protein remains in its native 
state until it approaches a characteristic melting temperature, Tm. As the solution is 
heated further, the protein denatures over a narrow range of temperatures around 
Tm (Figure 6.30). Denaturation over a very small temperature range such as this is 
evidence of a two-state transition between the native and the unfolded states of the 
protein, and this implies that unfolding is an all-or-none process: When weak forces 
are disrupted in one part of the protein, the entire structure breaks down. 

Most proteins can also be denatured below the transition temperature by a vari- 
ety of chemical agents, including acid or base, organic solvents, detergents, and par- 
ticular denaturing solutes. Guanidine hydrochloride and urea are examples of the 
latter (Figure 6.31). Denaturation in all these cases involves disruption of the weak 
forces that stabilize proteins. Covalent bonds are not affected. Acids and bases cause 
protonation and deprotonation of dissociable groups on the protein, altering ionic 
interactions and hydrogen bonds. Organic solvents and detergents disrupt hydro- 
phobic interactions that bury nonpolar groups in the protein interior. The effects 
of guanidine hydrochloride and urea are more complex. Recent research indicates 
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FIGURE 6.31 Proteins can be denatured (unfolded) by high concentrations of guanidine-HCl or urea. The 
denaturation of chymotrypsin is plotted here. (Adapted from Fersht, A, 1999. Structure and Mechanism in Protein Science. 
New York, W. H. Freeman.) 
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that these agents denature proteins by both direct effects (binding to hydrophilic 
groups on the protein) and indirect effects (altering the structure and dynamics of 
the water solvent). Also, both guanidine hydrochloride and urea are good H-bond 
donors and acceptors. 


Anfinsen’s Classic Experiment Proved That Sequence 
Determines Structure 


As noted earlier (Section 6.2), all the information needed to fold a polypeptide into 
its native structure is contained in the amino acid sequence. This simple but pro- 
found truth of protein structure was confirmed in the 1950s by the elegant studies 
of denaturation and renaturation of proteins by Christian Anfinsen and his co- 
workers at the National Institutes of Health. For their pivotal studies, they chose the 
small enzyme ribonuclease A from bovine pancreas, a protein with 124 residues 
and four disulfide bonds (Figures 6.19 and 6.32). (Ribonuclease cleaves chains of 


FIGURE 6.32 Ribonuclease can be unfolded by treatment with urea, and B-mercaptoethanol 
(MCE) cleaves disulfide bonds. If B-mercaptoethanol is then removed (but not urea) under 
oxidizing conditions, disulfide bonds reform in the still-unfolded protein (one possible hypo- 
thetical inactive form is shown). If urea is removed in the presence of a small amount of 
B-mercaptoethanol with gentle warming, ribonuclease returns to its native structure (with the 
correct set of disulfide bonds), and full enzymatic activity is restored. This experiment demon- 
strated that the information required for folding of globular proteins is contained in the pri- 
mary structure. 












Active Inactive 
Small amount of MCE —MCE 
w/gentle warming + Oxygen 





Hypothetical Inactive Form 
(Note random formation of disulfides) 
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ribonucleic acid. Only ribonuclease in its native structure posseses enzyme activity, 
so loss of activity in a denaturation experiment was proof of loss of structure.) They 
treated solutions of ribonuclease with a combination of urea, which unfolded the 
protein, and mercaptoethanol, which reduced the disulfide bridges. This treatment 
destroyed all enzymatic activity of ribonuclease. 

Anfinsen discovered that removing the mercaptoethanol but not the urea re- 
stored only 1% of the enzyme activity. This was attributed to the formation of ran- 
dom disulfide bridges by the still-denatured protein. With eight Cys residues, there 
are 105 possible ways to make four disulfide bridges; thus, a residual activity of 1% 
made sense to Anfinsen. (The first Cys to form a disulfide has seven possible part- 
ners, the next Cys has five possible partners, the next has three, and the last Cys has 
only one choice. 7 X 5 X 3 X 1 = 105). However, if Anfinsen removed mercap- 
toethanol and urea at the same time, the polypeptide was able to fold into its native 
structure, the correct set of four disulfides reformed, and full enzyme activity was 
recovered (Figure 6.32). This experiment demonstrated that the information 
needed for protein folding resided entirely within the amino acid sequence of the 
protein itself. Many subsequent experiments with a variety of proteins have con- 
firmed this fundamental postulate. For his studies of the relationship of sequence 
and structure, Anfinsen shared the 1972 Nobel Prize in Chemistry (with William H. 
Stein and Stanford Moore). 


Is There a Single Mechanism for Protein Folding? 


Christian Anfinsen’s experiments demonstrated that proteins can fold reversibly. A 
corollary result of Anfinsen’s work is that the native structures of at least some glob- 
ular proteins are thermodynamically stable states. But the matter of how a given pro- 
tein achieves such a stable state is a complex one. Cyrus Levinthal pointed out in 
1968 that so many conformations are possible for a typical protein that the protein 
does not have sufficient time to reach its most stable conformational state by sam- 
pling all the possible conformations. This argument, termed Levinthal’s paradox, goes 
as follows: Consider a protein of 100 amino acids. Assume that there are only two 
conformational possibilities per amino acid, or 21% = 1.27 X 10% possibilities. Allow 
107! sec for the protein to test each conformational possibility in search of the over- 
all energy minimum: 


(107}8 sec) (1.27 X 10°°) = 1.27 X 10!” sec = 4 X 10° years 


Four billion years is the approximate age of the earth. 

Levinthal’s paradox led protein chemists to hypothesize that proteins must fold 
by specific “folding pathways,” and many research efforts have been devoted to the 
search for these pathways. Several consistent themes have emerged from these stud- 
ies. Each of them may well play a role in the folding process: 


e Secondary structures—helices, sheets, and turns—probably form first. 

e Nonpolar residues may aggregate or coalesce in a process termed a hydrophobic 
collapse. 

e Subsequent steps probably involve formation of long-range interactions between 
secondary structures or involving other hydrophobic interactions. 

e The folding process may involve one or more intermediate states, including tran- 
sition states and what have become known as molten globules. 


The folding of most globular proteins may well involve several of these themes. 
For example, even in the denatured state, many proteins appear to possess small 
amounts of residual structure due to hydrophobic interactions, with strong inter- 
residue contacts between side chains that are relatively distant in the native protein 
structure. Such interactions, together with small amounts of secondary structure, 
may act as sites of nucleation for the folding process. A bit further in the folding 
process, the molten globule is postulated to be a flexible but compact form charac- 
terized by significant amounts of secondary structure, virtually no precise tertiary 
structure, and a loosely packed hydrophobic core. Moreover, it is likely that any one 
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D (94 ns) D (70 ns) I (30 ns) TS (0.26 ns) N (0 ns) 
Barnase 
D (4ns) D (2 ns) I (0.7 ns) TS (0.15 ns) N (0 ns) 


FIGURE 6.33 Computer simulations of folding and unfolding of proteins can reveal possible folding pathways. 
Molecular dynamics simulations of the unfolding of small proteins such as chymotrypsin inhibitor 2 (CI2) and 
barnase are presented here on a reversed time scale, to show how folding may occur. D = denatured, | = 
intermediate, TS = transition state, N = native. (Adapted from Daggett, V, and Fersht, A. R., 2003. Is there a unifying 
mechanism for protein folding? Trends in Biochemical Sciences 28:18-25. Figures provided by Alan Fersht and Valerie Daggett.) 


of these themes is more important for some proteins than for others. The process 
of folding is clearly complex, but sophisticated simulations have already provided 
reasonable models of folding (and unfolding) pathways for many proteins (Figure 
6.33). One school of thought suggests that for any given protein there may be mul- 
tiple folding pathways. For these cases, Ken Dill has suggested that the folding 
process can be pictured as a funnel of free energies—an energy landscape 
(Figure 6.34). The rim at the top of the funnel represents the many possible un- 
folded states for a polypeptide chain, each characterized by high free energy and 
significant conformational entropy. Polypeptides fall down the wall of the funnel as 
contacts made between residues establish different folding possibilities. The nar- 
rowing of the funnel reflects the smaller number of available states as the protein 
approaches its final state, and bumps or pockets on the funnel walls represent par- 
tially stable intermediates in the folding pathway. The most stable (native) folded 
state of the protein lies at the bottom of the funnel. 


What Is the Thermodynamic Driving Force for Folding 
of Globular Proteins? 


The free energy change for the folding of a globular protein must be negative if the 
folded state is more stable than the unfolded state. The free energy change for fold- 
ing depends, in turn, on changes in enthalpy and entropy for the process: 


AG=AH— TAS 


When AH, — TAS, and A Gare measured separately for the polar side chains and for 
the nonpolar side chains of the protein, an important insight is apparent. The en- 
thalpy and entropy changes for polar residues largely cancel each other out, and the 
AG of folding for the polar residues is approximately zero. 

To understand the behavior of the nonpolar residues, it is helpful to distinguish 
the AH and — TAS contributions for the polypeptide chain and for the water sol- 
vent. Both AH and — TAS for the nonpolar residues of the peptide chain are posi- 
tive and thus make unfavorable contributions to the folding free energy. However, 
large numbers of water molecules restricted and immobilized around nonpolar 
residues in the unfolded protein are liberated in the folding process. The burying 
of nonpolar residues in the folded protein’s core produces a dramatic entropy 
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FIGURE 6.34 A model for the steps involved in the fold- 
ing of globular proteins. The funnel represents a free 
energy surface or energy landscape for the folding 
process. The protein folding process is highly coopera- 
tive. Rapid and reversible formation of local secondary 
structures is followed by a slower phase in which estab- 
lishment of partially folded intermediates leads to the fi- 
nal tertiary structure. Substantial exclusion of water oc- 
curs very early in the folding process. 





increase for these liberated water molecules. This is just enough to make the over- 
all AG for folding negative (and thus favorable). The crucial results: 


e The largest contribution to the stability of a folded protein is the entropy change 
for the water molecules associated with the nonpolar residues. 

e The overall free energy change for the folding process is not large—typically 
—20 to —40 kJ/mol. 


Marginal Stability of the Tertiary Structure Makes Proteins Flexible 


A typical folded protein is only marginally stable. The hundreds of van der Waals in- 
teractions and hydrogen bonds in a folded structure are compensated and balanced 
by a dramatic loss of entropy suffered by the polypeptide as it assumes a compact 
folded structure. Because stability seems important to protein and cellular function, 
it is tempting to ask what the advantage of marginal stability might be. The answer 
appears to lie in flexibility and motion. All chemical bonds undergo a variety of mo- 
tions, including vibrations and (for single bonds) rotations. This propensity to 
move, together with the marginal stability of protein structures, means that the 
many noncovalent interactions within a protein can be interrupted, broken, and re- 
arranged rapidly. 
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Motion in Globular Proteins 


Proteins ave best viewed as dynamic structures. Most globular proteins oscillate and fluc- 
tuate continuously about their average or equilibrium structures (Figure 6.35). This 
flexibility is essential for a variety of protein functions, including ligand binding, en- 
zyme catalysis, and enzyme regulation, as shown throughout the remainder of this 
text. 

The motions of proteins may be motions of individual atoms, groups of atoms, 
or even whole sections of the protein. Furthermore, they may arise either from ther- 
mal energy or from specific, triggered conformational changes in the protein. 
Atomic fluctuations such as vibrations typically are random, are very fast, and usu- 
ally occur over small distances, as shown in Table 6.2. These motions arise from the 
kinetic energy within the protein and are a function of temperature. In the tightly 
packed interior of the typical protein, atomic movements of an angstrom or less are 
typical. The closer to the surface of the protein, the more movement can occur, and 
on the surface atomic movements of several angstroms are possible. 

A class of slower motions, which may extend over larger distances, is collective 
motions. These are movements of a group of atoms covalently linked in such a way 
that the group moves as a unit. Such a group can range from a few atoms to hun- 
dreds of atoms. These motions are of two types: (1) those that occur quickly but in- 
frequently, such as tyrosine ring flips, and (2) those that occur slowly, such as the 
hinge-bending movement between protein domains. For example, the two antigen- 
binding domains of immunoglobulins move as relatively rigid units to selectively 
bind separate antigen molecules. These collective motions also arise from thermal 
energies in the protein and operate on a timescale of 107 to 107° sec. It is often 
important to distinguish the time scale of the motion itself versus the frequency of 
its occurrence. A tyrosine ring flip takes only a picosecond (1 X 107!” sec), but such 
flips occur only about once every millisecond (1 X 107° sec). 

Conformational changes involve motions of groups of atoms (individual side 
chains, for example) or even whole sections of proteins. These motions occur on a 
time scale of 107° to 10° sec, and the distances covered can be as large as 1 nm. 
These motions may occur in response to specific stimuli or arise from specific in- 
teractions within the protein (hydrogen bonding, electrostatic interactions, or lig- 
and binding—see Chapters 14 and 15). 

The cis-trans isomerization of proline residues in proteins (Figure 6.36) occurs 
over an even longer time scale—typically 10! to 104 sec. Conversion of even a single 
proline from its cs to its trans configuration can alter a protein structure dramatically. 








TABLE 6.2 
Spatial 
Displacement Characteristic 
Type of Motion (A) Time (sec) Source of Energy 
Atomic vibrations 0.01-1 10715-107! Kinetic energy 
Collective motions 0.01-5 107—107" Kinetic energy 
or more 
1. Fast: Tyr ring flips; 
methyl group rotations 
2. Slow: hinge bending 
between domains 
Triggered conformation 0.5-10 10°*-10° Interactions with 
changes or more triggering agent 
Proline cis-trans 3-10 10!-104 Kinetic energy or 
isomerization enzyme driven 











Adapted from Petsko, G. A., and Ringe, D., 1984. Fluctuations in protein structure from X-ray diffraction. Annual Review 
of Biophysics and Bioengineering 13:331-371. 





FIGURE 6.35 Proteins are dynamic structures. The mar- 
ginal stability of a tertiary structure leads to flexibility 
and motion in the protein. Determination of structures 
of proteins (such as the SH3 domain of the a-chain of 
spectrin, shown here) by nuclear magnetic resonance 
produces a variety of stable tertiary structures that fit 
the data. Such structural ensembles provide a glimpse 
into the range of structures that may be accessible to a 
flexible, dynamic protein (pdb id = 1M8M). 


166 Chapter6 Proteins: Secondary, Tertiary, and Quaternary Structure 


(a) 


H 
ye HoC 
O ~ O 2m 
\ / * CH \ 7 CH 
" = l x poo i 
Cy, nc- OH = & (CES cH, 
AX R /X 
H R H R H 
trans cls 


FIGURE 6.36 The cis and trans configurations of proline residues in peptide chain are almost equally stable. 
Proline cis-trans isomerizations, often occurring over relatively long time scales, can alter protein structure 
significantly. 


Proline cis-trans isomerizations sometimes act as switches to activate a protein or open 
a channel across a membrane (see Chapter 9). 


The Folding Tendencies and Patterns of Globular Proteins 


Globular proteins adopt the most stable tertiary structure possible. To do this, the 
peptide chain must both (1) satisfy the constraints inherent in its own structure and 
(2) fold so as to “bury” the hydrophobic side chains, minimizing their contact with 
solvent. The polypeptide itself does not usually form simple straight chains. Even in 
chain segments where helices and sheets are not formed, an extended peptide 
chain, being composed of L-amino acids, has a tendency to twist slightly in a right- 
handed direction. As shown in Figure 6.37, this tendency is apparently the basis for 
the formation of a variety of tertiary structures having a right-handed sense. Princi- 
pal among these are the right-handed twists in -sheets and right-handed cross- 
overs in parallel B-sheets. Right-handed twisted 6-sheets are found at the center of 
a number of proteins (Figure 6.38) and provide an extended, highly stable struc- 
tural core. 

Connections between f-strands are of two types—hairpins and cross-overs. 
Hairpins, as shown in Figure 6.37, connect adjacent antiparallel 6-strands. Cross- 
overs are necessary to connect adjacent (or nearly adjacent) parallel 6-strands. 


(b) 








Natural right-handed twist by polypeptide chain 





Antiparallel hairpin 





Parallel, right-handed 





Cross-overs Parallel, left-handed 














FIGURE 6.37 (a) The natural right-handed twist exhibited by polypeptide chains, and (b) the types of connec- 
tions between B-strands. 
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Hydrophobic residues are 
buried between layers 





(a) Cytochrome ¢ (b) Phosphoglycerate kinase 
(domain 2) 


CENGAGENOW” ACTIVE FIGURE 6.38 Examples of protein domains with different numbers of layers of back- 
bone structure. (a) Cytochrome c’ with two layers of a-helix. (b) Domain 2 of phosphoglycerate kinase, com- 
posed of a B-sheet layer between two layers of helix, three layers overall. (c) An unusual five-layer structure, 
domain 2 of glycogen phosphorylase, a B-sheet layer sandwiched between four layers of a-helix. (d) The con- 
centric “layers” of B-sheet (inside) and a-helix (outside) in triose phosphate isomerase. Hydrophobic residues are 
buried between these concentric layers in the same manner as in the planar layers of the other proteins. 

The hydrophobic layers are shaded yellow. (Original art courtesy of Jane Richardson.) Test yourself on the con- 
cepts in this figure at www.cengage.com/login 


Nearly all cross-over structures are right-handed. In many cross-over structures, the 
cross-over connection itself contains an a-helical segment. This creates a BaB-loop. 
As shown in Figure 6.37, the strong tendency in nature to form right-handed cross- 
overs, the wide occurrence of a-helices in the cross-over connection, and the right- 
handed twists of -sheets can all be understood as arising from the tendency of an 
extended polypeptide chain of L-amino acids to adopt a right-handed twist struc- 
ture. This is a chiral effect. Proteins composed of D-amino acids would tend to adopt 
left-handed twist structures. 

The second driving force that affects the folding of polypeptide chains is the need 
to bury the hydrophobic residues of the chain, protecting them from solvent water. 
From a topological viewpoint, then, all globular proteins must have an “inside” 
where the hydrophobic core can be arranged and an “outside” toward which the hy- 
drophilic groups must be directed. The sequestration of hydrophobic residues away 
from water is the dominant force in the arrangement of secondary structures and 
nonrepetitive peptide segments to form a given tertiary structure. Globular proteins 
can be classified mainly on the basis of the particular kind of core or backbone struc- 
ture they use to accomplish this goal. The term hydrophobic core, as used here, refers 
to aregion in which hydrophobic side chains cluster together, away from the solvent. 
Backbone refers to the polypeptide backbone itself, excluding the particular side 
chains. Globular proteins can be pictured as consisting of “layers” of backbone, with 
hydrophobic core regions between them. More than half the known globular pro- 
tein structures have two layers of backbone (separated by one hydrophobic core). 
Roughly one-third of the known structures are composed of three backbone layers 
and two hydrophobic cores. There are also a few known four-layer structures and at 
least one five-layer structure. A few structures are not easily classified in this way, but 
it is remarkable that most proteins fit into one of these classes. Examples of each are 
presented in Figure 6.38. 


(c) Phosphorylase 
(domain 2) 





(d) Triosephosphate isomerase 
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Most Globular Proteins Belong to One of Four Structural Classes 


In addition to classification based on layer structure, proteins can be grouped ac- 
cording to the type and arrangement of secondary structure (Figure 6.39). There 
are four such broad groups: all œ proteins and all B proteins (in which the struc- 
tures are dominated by a-helices and $-sheets, respectively), œ/ß proteins (in which 
helices and sheets are intermingled), and a+f proteins (in which a-helical and 
-sheet domains are separated for the most part). 

It is important to note that the similarities of tertiary structure within these 
groups do not necessarily reflect similar or even related functions. Instead, func- 
tional homology usually depends on structural similarities on a smaller and more in- 
timate scale. 


Molecular Chaperones Are Proteins That Help Other Proteins to Fold 


To a first approximation, all the information necessary to direct the folding of a 
polypeptide is contained in its primary structure. On the other hand, the high 
protein concentration inside cells may adversely affect the folding process be- 
cause hydrophobic interactions may lead to aggregation of some unfolded or par- 
tially folded proteins. Also, it may be necessary to suppress or reverse incorrect or 
premature folding. A family of proteins, known as molecular chaperones, are es- 
sential for the correct folding of certain polypeptide chains in vivo; for their as- 
sembly into oligomers; and for preventing inappropriate liaisons with other pro- 
teins during their synthesis, folding, and transport. Many of these proteins were 
first identified as heat shock proteins, which are induced in cells by elevated tem- 
perature or other stress. The most thoroughly studied proteins are Hsp70, a 
70-kD heat shock protein, and the so-called chaperonins, also known as Cpn60s or 
Hsp60s, a class of 60-kD heat shock proteins. A well-characterized Hsp60 chaper- 
onin is GroEL, an E. coli protein that has been shown to affect the folding of sev- 
eral proteins. The mechanism of action of chaperones is discussed in Chapter 31. 


Some Proteins Are Intrinsically Unstructured 


Remarkably, it is now becoming clear that many proteins exist and function nor- 
mally in a partially unfolded state. Such proteins, termed intrinsically unstructured 
proteins (IUPs) or natively unfolded proteins, do not possess uniform structural 
properties but are nonetheless essential for basic cellular functions. These proteins 
are characterized by an almost complete lack of folded structure and an extended 
conformation with high intramolecular flexibility. 

Intrinsically unstructured proteins contact their targets over a large surface area 
(Figure 6.40). The p27 protein complexed with cyclin-dependent protein kinase 2 
(Cdk2) and cyclin A shows that p27 is in contact with its binding partners across its 
entire length. It binds in a groove consisting of conserved residues on cyclin A. On 
Cdk2, it binds to the N-terminal domain and also to the catalytic cleft. One of the 
most appropriate roles for such long-range interactions is assembly of complexes 
involved in the transcription of DNA into RNA, where large numbers of proteins 
must be recruited in macromolecular complexes. Thus, the transactivator domain 
catenin-binding domain (CBD) of tcf3 is bound to several functional domains of 
B-catenin (Figure 6.40). 

Can amino acid sequence information predict the existence of intrinsically un- 
structured regions on proteins? Intrinsically unstructured proteins are character- 
ized by a unique combination of high net charge and low overall hydrophobicity. 
Compared with ordered proteins, IUPs have higher levels of E, K, R, G, Q, S, and P, 
and low amounts of I, L, V, W, F, Y, C, and N. These features provide a rationale for 
prediction of regions of disorder from amino acid sequence information, and ex- 
perimental evidence shows that such predictions are better than 80% accurate. 
Genomic analysis of disordered proteins indicates that the proportion of the 
genome encoding IUPs and proteins with substantial regions of disorder tends to 
increase with the complexity of organisms. Thus, predictive analysis of whole 
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Human growth hormone Leucine-rich repeat Peridinin-chlorophyll protein Endoglucanase A (an a-helical Cat allergen 
(pdb id = 1HGU) variant (pdb id = 1LRV) (a “solenoid”—pdb id = 1PPR) barrel—pdb id = 1CEM) (pdb id = 1PUO) 
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Mannose-specific Rieske iron protein Hemopexin C-terminal Lectin from R. solanacearum Pleckstrin domain of 
aggluttinin (a prism— (a 3-layer B-sandwich— domain (a 4-bladed (a 6-bladed propellor— protein kinase B/AKT 
(pdb id = 1JPC) (pdb id = 1RIE) propellor—pdb id = 1HXN) pdb id = 1BT9) (pdb id = 1UNQ) 
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—pdb id = 2HVM) ø 
Equine leucocyte RuvA protein 
elastase inhibitor (pdb id = 1CUK) 
(pdb id = 1HLE) 
Human bactericidal Hepatocyte growth factor Prokaryotic ribosomal 
permeability-increasing (N-terminal domain protein L9 
protein (pdb id = 1BP1) —pdb id = 2HGF) (pdb id = 1DIV) 





Ribonuclease H L-Arginine: glycine 
(pdb id = 1RNH) amidinotransferase (a metabolic 
enzyme—pdb id = 4JDW) 
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MurA (an &-ß prism Porcine ribonuclease inhibitor 


—pdb id = 1EYN) (a “horseshoe”—pdb id = 2BNH) 
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FIGURE 6.39 Four major classes of protein structure (as defined in the SCOP database). (a) All œ proteins, 
where a-helices dominate the structure; (b) All B proteins, in which B-sheets are the primary feature; (c) a/B 
proteins, where a-helices and B-sheets are mixed within a domain; (d) a+8 proteins, in which a-helical and 
B-sheet domains are separated to at least some extent. 
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(a) 





FIGURE 6.40 Intrinsically unstructured proteins (IUPs) 
contact their target proteins over a large surface area. 
(a) p27*'! (yellow) complexed with cyclin-dependent 
kinase 2 (Cdk2, blue) and cyclin A (CycA, green). (b) The 
ransactivator domain CBD of Tcf3 (yellow) bound to 
B-catenin (blue). Note: Part of the B-catenin has been 
removed for a clear view of the CBD. (c) Bob 1 transcrip- 
ional coactivator (yellow) in contact with its four part- 
ners: TAF;,105 (green oval), the Oct 1 domains POU SD 
and POU HD (green), and the lgx promoter (blue). (From 
Tompa, P, 2002. Intrinsically unstructured proteins. Trends in Bio- 
chemical Sciences 27:527-533.) (d-g) Some intrinsically un- 
structured proteins (in red and yellow) bind to their 
argets by wrapping around them. Shown here are 

(d) SNAP-25 bound to BoNT/A, (e) SARA SBD bound to 
Smad 2 MH2, (f) HIF-1q@ interaction domain bound to 
he TAZ1 domain of CBP, and (g) HIF-1@ interaction 
domain bound to asparagine hydroxylase FIH. (From 
Trends in Biochemical Sciences, Vol. 27, No. 10, page 530. October 
2002.) 








(b) (c) 





B-catenin 








genomes indicates that 2% of archaeal, 4.2% of bacterial, and 33% of eukaryotic 
proteins probably contain long regions of disorder. 

Some proteins are disordered throughout their length, whereas others may con- 
tain stretches of 30 to 40 residues or more that are disordered and imbedded in 
an otherwise folded protein. The prevalence of disordered segments in proteins 
may reflect two different cellular needs. (1) Disordered proteins are more mal- 
leable and thus can adapt their structures to bind to multiple ligands, including 
other proteins. Each such interaction could provide a different function in the 
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a,-Antitrypsin—A Tale of Molecular Mousetraps and a Folding Disease 


In the human lung, oxygen and CO, are exchanged across the walls 
of alveoli—air sacs surrounded by capillaries that connect the pul- 
monary veins and arteries. The walls of alveoli consist of the elastic 
protein elastin. Inhalation expands the alveoli, and exhalation com- 
presses them. A pair of human lungs contains 300 million alveoli, 
and the total area of the alveolar walls in contact with capillaries is 
about 70 m?—an area about the size of a tennis court! In the lungs, 
neutrophils (a type of white blood cell) naturally secrete elastase, a 
protein-cleaving enzyme essential to tissue repair. However, elastase 
also can attack and break down the elastin of the alveolar walls 
if it spreads from the site of inflammation repair. To prevent this, 
the liver secretes into the blood a-antitrypsin—a 52-kD protein 
belonging to the serpin (serine protease inhibitor) family—which 
blocks elastase action, preventing alveolar damage. 

a,-Antitrypsin is a molecular mousetrap, with a flexible peptide 
loop (blue in the figure) that contains a Met residue as “bait” for the 
elastase and that can swing like the arm of a mousetrap. When elas- 
tase binds to the loop at the Met residue, it cuts the peptide loop. 
Now free to move, the loop slides into the middle of a large beta 
sheet (green), at the same time dragging elastase to the opposite side 
of the a -antitrypsin structure. At this new binding site, the elastase 
structure is distorted, and it cannot complete its reaction and free it- 
self from the q)-antitrypsin. Cellular scavenger enzymes then attack 


(a) 


Elastase 


a 


Met-containing 





(b) 








the elastase—antitypsin complex and destroy it. By sacrificing itself in 
this way, the aj-antitrypsin has prevented damage to the alveolar 
elastin. 

Defects in a -antitrypsin can cause serious lung and liver damage. 
The gene for q)-antitrypsin is polymorphic (that is, it occurs as many 
different sequence variants) and many variants of @-antitrypsin are 
either poorly secreted by the liver or function poorly in the lungs. 
Even worse, tobacco smoke oxidizes the critical Met residue in the 
flexible loop of aj-antitrypsin, and smokers, especially those who 
carry mutants of this protein, often develop emphysema—the de- 
struction of the elastin connective tissue in the lungs. 

The flexible loop of a)-antitrypsin—its mousetrap spring—is 
also its Achilles’ heel. Mutations in this loop make the protein vul- 
nerable to aberrant conformational changes. The Z-mutation of 
q)-antitrypsin is an interesting case, with a Lys in place of Glu at 
residue 342 (indicated by the arrow in M) at the base of the flexi- 
ble loop. This causes partial loop insertion in the large B-sheet 
(M*). This induces the modified B-sheet to accept the flexible 
loop of another qj-antitrypsin, forming a dimer. Repetition of 
these events forms polymers, which are trapped in the liver (often 
leading to cirrhosis and death). Z variants that manage to make it 
to the lungs associate so slowly with elastase that they are ineffec- 
tive in preventing lung damage. 


<“ (a) Elastase (dark gray) is inactivated by binding to a -antitrypsin. 
When elastase binds, cleaving the flexible loop at a Met residue, 
the rest of the loop (the red B-strand) rotates more than 180° and 
inserts into the green f-sheet, swinging the elastase to the other 
end of the molecule. At this new location, the elastase is distorted 
and inactivated. (b) In the Z-mutant of ay-antitrypsin, the flexible 
loop is only partially inserted in the large B-sheet, promoting poly- 
mer formation and trapping ay-antitrypsin at its site of synthesis in 
the liver. The consequences of this are cirrhosis of the liver, as well 
as lung damage, since the small amount of a;-antitrypsin that 
reaches the lungs is ineffective in preventing lung damage. Individ- 
ual monomers in the a -antitrypsin polymer are colored red, blue, 
and gold (far right). (From Lomas, D. A., et al., 2005. Molecular mousetraps 
and serpinopathies. Biochem Soc. Transactions 33:321-330. Figure provided by 
David Lomas.) 
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HUMAN BIOCHEMISTRY 


Diseases of Protein Folding 


A number of human diseases are linked to abnormalities of pro- 
tein folding. Protein misfolding may cause disease by a variety of 
mechanisms. For example, misfolding may result in loss of func- 


tion and the onset of disease. The following table summarizes sev- 
eral other mechanisms and provides an example of each. 





Disease Affected Protein 


Mechanism 





Alzheimer’s disease B-Amyloid peptide (derived from 


amyloid precursor protein) 


Familial amyloidotic Transthyretin 
polyneuropathy 

Cancer p53 
Creutzfeldt-Jakob disease Prion 
(human equivalent of 

mad cow disease) 

Hereditary emphysema a,-Antitrypsin 


Cystic fibrosis 
conductance regulator) 





CFTR (cystic fibrosis transmembrane 


Misfolded B-amyloid peptide accumulates in human neural 
tissue, forming deposits known as neuritic plaques. 
Aggregation of unfolded proteins. Nerves and other organs 
are damaged by deposits of insoluble protein products. 

p53 prevents cells with damaged DNA from dividing. One 
class of p53 mutations leads to misfolding; the misfolded 
protein is unstable and is destroyed. 

Prion protein with an altered conformation (PrPS°) may seed 
conformational transitions in normal PrP (PrP°) molecules. 


Mutated forms of this protein fold slowly, allowing its target, 
elastase, to destroy lung tissue. 

Folding intermediates of mutant CFTR forms don’t dissociate 
freely from chaperones, preventing the CFTR from reaching 
its destination in the membrane. 











cell. (2) Compared with compact, folded proteins, disordered segments in pro- 
teins appear to be able to form larger intermolecular interfaces to which ligands, 
such as other proteins, could bind (Figure 6.40). Folded proteins might have to be 
two to three times larger to produce the binding surface possible with a disordered 
protein. Larger proteins would increase cellular crowding or could increase cell 
size by 15% to 30%. The flexibility of disordered proteins may thus reduce protein, 
genome, and cell sizes. 
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Structural Genomics 


The prodigious advances in genome sequencing in recent years, 
together with advances in techniques for protein structure deter- 
mination, have not only provided much new information for bio- 
chemists but have also spawned a new field of investigation— 
structural genomics, the large-scale analysis of protein structures 
and functions based on gene sequences. The scale of this new en- 
deavor is daunting: hundreds of thousands of gene sequences are 
rapidly being determined, and current estimates suggest that 
there are probably less than 10,000 distinct and stable polypep- 
tide folding patterns in nature. The feasibility of large-scale, high- 
throughput structure determination programs is being explored 
in a variety of pilot studies in Europe, Asia, and North America. 
These efforts seek to add 20,000 or more new protein structures 
to our collected knowledge in the near future; from this wealth of 
new information, it should be possible to predict and determine 
new structures from sequence information alone. This effort will 
be vastly more complex and more expensive than the Human 
Genome Project. It presently costs about $100,000 to determine 


the structure of the typical globular protein, and one of the goals 
of structural genomics is to reduce this number to $20,000 or less. 
Advances in techniques for protein crystallization, X-ray diffrac- 
tion, and NMR spectroscopy, the three techniques essential to 
protein structure determination, will be needed to reach this goal 
in the near future. 

The payoffs anticipated from structural genomics are substantial. 
Access to large amounts of new three-dimensional structural infor- 
mation should accelerate the development of new families of drugs. 
The ability to scan databases of chemical entities for activities 
against drug targets will be enhanced if large numbers of new pro- 
tein structures are available, especially if complexes of drugs and tar- 
get proteins can be obtained or predicted. The impact of structural 
genomics will also extend, however, to functional genomics—the 
study of the functional relationships of genomic content—which 
will enable the comparison of the composite functions of whole 
genomes, leading eventually to a complete biochemical and mech- 
anistic understanding of all organisms, including humans. 
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FIGURE 6.41 The quaternary structure of liver alcohol dehydrogenase. Within each subunit is a six-stranded 
parallel sheet. Between the two subunits is a two-stranded antiparallel sheet (pdb id = 1CDO). Jane Richardson.) 


6:58 How Do Protein Subunits Interact at the Quaternary 
Level of Protein Structure? 


Many proteins exist in nature as oligomers, complexes composed of (often sym- 
metric) noncovalent assemblies of two or more monomer subunits. In fact, subunit 
association is a common feature of macromolecular organization in biology. Most 
intracellular enzymes are oligomeric and may be composed either of a single type 
of monomer subunit (homomultimers) or of several different kinds of subunits (het- 
eromultimers). The simplest case is a protein composed of identical subunits. Liver al- 
cohol dehydrogenase, shown in Figure 6.41, is such a protein. Alcohol consumed in 
a beer or mixed drink is oxidized in the liver by alcohol dehydrogenase. Hormonal 
signals modulate blood sugar levels by controlling the activity of glycogen phos- 
phorylase, an elegantly regulated homodimeric muscle enzyme. Oxygen is carried 
in the blood by hemoglobin, which contains two each of two different subunits (het- 
erotetramer). A counterpoint to these small clusters is made by the proteins that form 
large polymeric aggregates. Proteins are synthesized on large complexes of many pro- 
tein units and several RNA molecules called ribosomes. Muscle contraction depends 
on large polymer clusters of the protein myosin sliding along filamentous polymers 
of another protein, actin. The way in which separate folded monomeric protein 
subunits associate to form the oligomeric protein constitutes the quaternary struc- 
ture of that protein. Table 6.3 lists several proteins and their subunit compositions 
(see also Table 4.2). Proteins with two to four subunits predominate in nature, but 
many cases of higher numbers exist. 

The subunits of an oligomeric protein typically fold independently and then in- 
teract with other subunits. The surfaces at which subunits interact are similar in na- 
ture to the interiors of the individual subunits—closely packed with both polar and 
hydrophobic interactions. Interacting surfaces must therefore possess complemen- 
tary arrangements of polar and hydrophobic groups. 

Oligomeric associations of protein subunits can be divided into those between 
identical subunits and those between nonidentical subunits. Interactions among 
identical subunits can be further distinguished as either isologous or heterologous. 
In isologous interactions, the interacting surfaces are identical and the resulting 
structure is necessarily dimeric and closed, with a twofold axis of symmetry (Figure 
6.42). If any additional interactions occur to form a trimer or tetramer, these must 
use different interfaces on the protein’s surface. Many proteins, such as transthy- 
retin, form tetramers by means of two sets of isologous interactions (Figure 6.43). 
Such structures possess three different twofold axes of symmetry. In contrast, het- 
erologous associations among subunits involve nonidentical interfaces. These sur- 
faces must be complementary, but they are generally not symmetric. 














TABLE 6.3 / 

Number of 
Protein Subunits 
Alcohol dehydrogenase 2 
Malate dehydrogenase 2 
Superoxide dismutase 2 
Triose phosphate isomerase 2 
Glycogen phosphorylase 2 
Aldolase 3 
Bacteriochlorophyll protein 3 
Concanavalin A 4 
Glyceraldehyde-3-phosphate 4 
dehydrogenase 
Immunoglobulin 4 
Lactate dehydrogenase 4 
Prealbumin 4 
Pyruvate kinase 4 
Phosphoglycerate mutase 4 
Hemoglobin 2+2 
Insulin 6 
Aspartate transcarbamoylase 6+6 
Glutamine synthetase 12 
TMV protein disc 17 
Apoferritin 24 
Coat of tomato bushy stunt 180 


virus 
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FIGURE 6.43 Many proteins form tetramers by means of 
two sets of isologous interactions. The dimeric (a) and 
tetrameric (b) forms of transthyretin (also known as 
prealbumin) are shown here (pdb id = 1GKE).The 
monomers of this protein form a dimer in a manner that 
extends the large monomer B-sheet. The tetramer is 
formed by isologous interactions between the large 
B-sheets of two dimers. 


(c) Heterologous tetramer 
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FIGURE 6.42 Isologous and heterologous associations between protein subunits. (a) An isologous interaction be- 
tween two subunits with a twofold axis of symmetry perpendicular to the plane of the page. (b) A heterologous 
interaction that could lead to the formation of a long polymer. (c) A heterologous interaction leading to a closed 
structure—a tetramer. (d) A tetramer formed by two sets of isologous interactions. 





There Is Symmetry in Quaternary Structures 


Many multimeric proteins are symmetric arrangements of asymmetric objects (the 
monomer subunits). All of the polypeptide’s a-carbons are asymmetric, and the 
polypeptide nearly always folds to form a low-symmetry structure. (The long helical 
arrays formed by some synthetic polypeptides are an exception.) Thus, protein sub- 
units do not have mirror reflection planes, points, or axes of inversion. The only sym- 
metry operation possible for protein subunits is a rotation. The most common sym- 
metries observed for multisubunit proteins are cyclic symmetry and dihedral 
symmetry. In cyclic symmetry, the subunits are arranged around a single rotation 
axis, as shown in Figure 6.44. If there are two subunits, the axis is referred to as a 
twofold rotation axis. Rotating the quaternary structure 180° about this axis gives a 
structure identical to the original one. With three subunits arranged about a three- 
fold rotation axis, a rotation of 120° about that axis gives an identical structure. 
Dihedral symmetry occurs when a structure possesses at least one twofold rotation 
axis perpendicular to another n-fold rotation axis. This type of subunit arrangement 
(Figure 6.44) occurs in annexin XII (where n = 3). 


Quaternary Association Is Driven by Weak Forces 


Weak forces stabilize quaternary structures. Typical dissociation constants for simple 
two-subunit associations range from 10~* to 10~!° M. These values correspond to free 
energies of association of about 50 to 100 kJ/mol at 37°C. Dimerization of subunits is 
accompanied by both favorable and unfavorable energy changes. The favorable in- 
teractions include van der Waals interactions, hydrogen bonds, ionic bonds, and hy- 
drophobic interactions. However, considerable entropy loss occurs when subunits in- 
teract. When two subunits move as one, three translational degrees of freedom are 
lost for one subunit because it is constrained to move with the other one. In addition, 
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"a 
d 
id 
Dimer Trimer Tetramer Pentamer 
ARNT PAS-B E. blattae acid phosphatase B. anthracis dihydrodipicolinate Shiga-like toxin I B 
(pdb id = 2HV1) (pdb id = 2EOT) synthase (pdb id = 1XL9) (pdb id = 1CZG) 





Cyclic hexamer Trimer of dimers Bundled hexamer 
Circadian clock protein KaiC Annexin XII Uridylate kinase 
(pdb id = 1TF7) (pdb id = 1DM5) (pdb id = 2A1F) 





Heptamer Octamer Dodecamer 
M. tuberculosis chaperonin-10 Limulus polyphemus SAP-like pentraxin Lactococcus lactis MG1363 DpsB protein 
(pdb id = 1HX5) (pdb id = 1QTJ) (pdb id = 1ZS3) 


FIGURE 6.44 Multimeric proteins are symmetric arrangements of asymmetric objects. A variety of symmetries is 
displayed in these multimeric structures. 
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many peptide residues at the subunit interface, which were previously free to move 
on the protein surface, now have their movements restricted by the subunit associa- 
tion. This unfavorable energy of association is in the range of 80 to 120 kJ/mol for 
temperatures of 25° to 37°C. Thus, to achieve stability, the dimerization of two sub- 
units must involve approximately 130 to 220 kJ/mol of favorable interactions.! Van 
der Waals interactions at protein interfaces are numerous, often running to several 
hundred for a typical monomer—monomer association. This would account for about 
150 to 200 kJ/mol of favorable free energy of association. However, when solvent is 
removed from the protein surface to form the subunit-subunit contacts, nearly as 
many van der Waals associations are lost as are made. One subunit is simply trading 
water molecules for peptide residues in the other subunit. As a result, the energy of 
subunit association due to van der Waals interactions actually contributes little to the 
stability of the dimer. Hydrophobic interactions at the subunit-subunit interface, 
however, are generally very favorable. For many proteins, the subunit association 
process effectively buries as much as 20 nm? of surface area previously exposed to sol- 
vent, resulting in as much as 100 to 200 kJ/mol of favorable hydrophobic interactions. 
Together with whatever polar interactions occur at the protein-protein interface, this 
is sufficient to account for the observed stabilization that occurs when two protein 
subunits associate. 

An additional and important factor contributing to the stability of subunit asso- 
ciations for some proteins is the formation of disulfide bonds between different sub- 
units. All antibodies are aoB>-tetramers composed of two heavy chains (53 to 75 kD) 
and two light chains (23 kD). In addition to intrasubunit disulfide bonds (four per 
heavy chain, two per light chain), two intersubunit disulfide bridges hold the two 
heavy chains together and a disulfide bridge links each of the two light chains to 
a heavy chain (Figure 6.45). 








FIGURE 6.45 Schematic drawing of an immunoglobulin molecule, showing the intermolecular and intra- 
molecular disulfide bonds. Two identical L chains are joined with two identical H chains. Each L chain is held to 
an H chain via an interchain disulfide bond. The variable regions of the four polypeptides lie at the ends of the 
arms of the Y-shaped molecule. These regions are responsible for the antigen recognition function of antibody 
molecules. For purposes of illustration, some features are shown on only one or the other L chain or H chain, but 
all features are common to both chains. 


'For example, 130 kJ/mol of favorable interaction minus 80 kJ/mol of unfavorable interaction equals 
a net free energy of association of 50 kJ/mol. 
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A DEEPER LOOK 





Immunoglobulins—All the Features of Protein Structure Brought Together 


The immunoglobulin structure in Figure 6.45 represents the con- 
fluence of all the details of protein structure that have been thus 
far discussed. As for all proteins, the primary structure determines 
other aspects of structure. There are numerous elements of sec- 
ondary structure, including -sheets and tight turns. The tertiary 
structure consists of 12 distinct domains, and the protein adopts a 
heterotetrameric quaternary structure. To make matters more in- 
teresting, both intrasubunit and intersubunit disulfide linkages act 
to stabilize the discrete domains and to stabilize the tetramer itself. 


globulin chains are not constant! Instead, the primary structure of 
these chains is highly variable in the N-terminal regions (first 108 
residues). Heterogeneity of the amino acid sequence leads to vari- 
ations in the conformation of these variable regions. This variation 
accounts for antibody diversity and the ability of antibodies to rec- 
ognize and bind a virtually limitless range of antigens. This full po- 
tential of antibody:antigen recognition enables organisms to 
mount immunological responses to almost any antigen that might 
challenge the organism. 
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One more level of sophistication awaits. As discussed in Chap- 
ter 28, the amino acid sequences of both light and heavy immuno- 








Open Quaternary Structures Can Polymerize 


All of the quaternary structures we have considered to this point have been closed 
structures, with a limited capacity to associate. Many proteins in nature associate to 
form open heterologous structures, which can polymerize more or less indefinitely, 
creating structures that are both esthetically attractive and functionally important to 
the cells or tissue in which they exist. One such protein is tubulin, the a6-dimeric 
protein that polymerizes into long, tubular structures that are the structural basis 
of cilia, flagella, and the cytoskeletal matrix. The microtubule thus formed (Figure 
6.46) may be viewed as consisting of 13 parallel filaments arising from end-to-end 
aggregation of the tubulin dimers. Human immunodeficiency virus, HIV, the 
causative agent of AIDS (also discussed in Chapter 14), is enveloped by a spherical 
shell composed of hundreds of coat protein subunits, a large-scale, but closed, qua- 
ternary association. 


3.5- to 4.0-nm 
subunit 





FIGURE 6.46 The structure of a typical microtubule, 
showing the arrangement of the a- and B-monomers of 
the tubulin dimer. 


There Are Structural and Functional Advantages 
to Quaternary Association 


There are several important consequences when protein subunits associate in 
oligomeric structures. 


Stability One general benefit of subunit association is a favorable reduction of 
the protein’s surface-to-volume ratio. The surface-to-volume ratio becomes smaller 
as the radius of any particle or object becomes larger. (This is because surface area 
is a function of the radius squared and volume is a function of the radius cubed.) 
Because interactions within the protein usually tend to stabilize the protein ener- 
getically and because the interaction of the protein surface with solvent water is of- 
ten energetically unfavorable, decreased surface-to-volume ratios usually result in 
more stable proteins. Subunit association may also serve to shield hydrophobic 
residues from solvent water. Subunits that recognize either themselves or other sub- 
units avoid any errors arising in genetic translation by binding mutant forms of the 
subunits less tightly. 


Genetic Economy and Efficiency Oligomeric association of protein monomers is 
genetically economical for an organism. Less DNA is required to code for a 
monomer that assembles into a homomultimer than for a large polypeptide of the 
same molecular mass. Another way to look at this is to realize that virtually all of 
the information that determines oligomer assembly and subunit-subunit interac- 
tion is contained in the genetic material needed to code for the monomer. For ex- 
ample, HIV protease, an enzyme that is a dimer of identical subunits, performs a 
catalytic function similar to homologous cellular enzymes that are single polypep- 
tide chains of twice the molecular mass (see Chapter 14). 
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HUMAN BIOCHEMISTRY 


Faster-Acting Insulin: Genetic Engineering Solves a Quaternary Structure Problem 


Insulin is a peptide hormone secreted by the pancreas that regu- 
lates glucose metabolism in the body. Insufficient production of 
insulin or failure of insulin to stimulate target sites in liver, muscle, 
and adipose tissue leads to the serious metabolic disorder known 
as diabetes mellitus. Diabetes afflicts millions of people worldwide. 
Diabetic individuals typically exhibit high levels of glucose in the 
blood, but insulin injection therapy allows these individuals to 
maintain normal levels of blood glucose. 

Insulin is composed of two peptide chains covalently linked by 
disulfide bonds (see Figure 5.8). This “monomer” of insulin is 
the active form that binds to receptors in target cells. However, 
in solution, insulin spontaneously forms dimers, which them- 
selves aggregate to form hexamers. The surface of the insulin 
molecule that self-associates to form hexamers is also the surface 
that binds to insulin receptors in target cells. Thus, hexamers of 
insulin are inactive. 

Insulin released from the pancreas is monomeric and acts 
rapidly at target tissues. However, when insulin is administered (by 
injection) to a diabetic patient, the insulin hexamers dissociate 


In 1988, G. Dodson showed that insulin could be genetically 
engineered to prefer the monomeric (active) state. Dodson and 
his colleagues used recombinant DNA technology (discussed in 
Chapter 12) to produce insulin with an aspartate residue replac- 
ing a proline at the contact interface between adjacent subunits. 
The negative charge on the Asp side chain creates electrostatic 
repulsion between subunits and increases the dissociation con- 
stant for the hexamer == monomer equilibrium. Injection of 
this mutant insulin into test animals produced more rapid de- 
creases in blood glucose than did ordinary insulin. This mutant in- 
sulin, marketed by the Danish pharmaceutical company Novo as 
NovoLog in the United States and as NovoRapid in Europe, may 
eventually replace ordinary insulin in the treatment of diabetes. 
NovoLog has a faster rate of absorption, a faster onset of action, 
and a shorter duration of action than regular human insulin. It is 
particularly suited for mealtime dosing to control postprandial 
glycemia, the rise in blood sugar following consumption of food. 
Regular human insulin acts more slowly, so patients must usually 
administer it 30 minutes before eating. 


slowly and the patient’s blood glucose levels typically drop slowly 
(over several hours). 











Bringing Catalytic Sites Together Many enzymes (see Chapters 13 to 15) derive 
at least some of their catalytic power from oligomeric associations of monomer sub- 
units. This can happen in several ways. The monomer may not constitute a com- 
plete enzyme active site. Formation of the oligomer may bring all the necessary cat- 
alytic groups together to form an active enzyme. For example, the active sites of 
bacterial glutamine synthetase are formed from pairs of adjacent subunits. The dis- 
sociated monomers are inactive. 

Oligomeric enzymes may also carry out different but related reactions on differ- 
ent subunits. Thus, tryptophan synthase is a tetramer consisting of pairs of different 
subunits, a8». Purified a-subunits catalyze the following reaction: 


Indoleglycerol phosphate——= indole + glyceraldehyde-3-phosphate 


and the -subunits catalyze this reaction: 





Indole + L-serine 





L-tryptophan 


Indole, the product of the a@-reaction and the reactant for the 6-reaction, is passed 
directly from the a-subunit to the -subunit and cannot be detected as a free inter- 
mediate. 


Cooperativity There is another, more important consequence when monomer 
subunits associate into oligomeric complexes. Most oligomeric enzymes regulate 
catalytic activity by means of subunit interactions, which may give rise to coopera- 
tive phenomena. Multisubunit proteins typically possess multiple binding sites for a 
given ligand. If the binding of ligand at one site changes the affinity of the protein 
for ligand at the other binding sites, the binding is said to be cooperative. Infor- 
mation transfer in this manner across long distances in proteins is termed allostery, 
literally “action at another site.” Increases in affinity at subsequent sites represent 
positive cooperativity, whereas decreases in affinity correspond to negative cooper- 
ativity. The points of contact between protein subunits provide a mechanism for this 
signal transduction through the protein structure and for communication between 
the subunits. This in turn provides a way in which the binding of ligand to one sub- 
unit can influence the binding behavior at the other subunits. Such cooperative be- 
havior, discussed in greater depth in Chapter 15, is the underlying mechanism for 
regulation of many biological processes. 
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SUMMARY 


6.1 What Noncovalent Interactions Stabilize Protein Structure? Sev- 
eral different kinds of noncovalent interactions are of vital importance in 
protein structure. Hydrogen bonds, hydrophobic interactions, electrosta- 
tic bonds, and van der Waals forces are all noncovalent in nature yet are 
extremely important influences on protein conformations. The stabiliza- 
tion free energies afforded by each of these interactions are highly de- 
pendent on the local environment within the protein. 

Hydrogen bonds are generally made wherever possible within a 
given protein structure. Hydrophobic interactions form because non- 
polar side chains of amino acids and other nonpolar solutes prefer to 
cluster in a nonpolar environment rather than to intercalate in a polar 
solvent such as water. Electrostatic interactions include the attraction 
between opposite charges and the repulsion of like charges in the pro- 
tein. Van der Waals interactions involve instantaneous dipoles and in- 
duced dipoles that arise because of fluctuations in the electron charge 
distributions of adjacent nonbonded atoms. 


6.2 What Role Does the Amino Acid Sequence Play in Protein Struc- 
ture? All of the information necessary for folding the peptide chain 
into its “native” structure is contained in the amino acid sequence of the 
peptide. Just how proteins recognize and interpret the information that 
is stored in the polypeptide sequence is not yet well understood. It may 
be assumed that certain loci along the peptide chain act as nucleation 
points, which initiate folding processes that eventually lead to the cor- 
rect structures. Regardless of how this process operates, it must take the 
protein correctly to the final native structure, without getting trapped in 
a local energy-minimum state, which, although stable, may be different 
from the native state itself. 


6.3 What Are the Elements of Secondary Structure in Proteins, and 
How Are They Formed? Secondary structure in proteins forms so as to 
maximize hydrogen bonding and maintain the planar nature of the 


peptide bond. Secondary structures include a-helices, B-sheets, and 
tight turns. 


6.4 How Do Polypeptides Fold into Three-Dimensional Protein Struc- 
tures? First, secondary structures—helices and sheets—form when- 
ever possible as a consequence of the formation of large numbers of hy- 
drogen bonds. Second, a-helices and -sheets often associate and pack 
close together in the protein. There are a few common methods for 
such packing to occur. Third, because the peptide segments between 
secondary structures in the protein tend to be short and direct, the pep- 
tide does not execute complicated twists and knots as it moves from one 
region of a secondary structure to another. A consequence of these 
three principles is that protein chains are usually folded so that the sec- 
ondary structures are arranged in one of a few common patterns. For 
this reason, there are families of proteins that have similar tertiary struc- 
ture, with little apparent evolutionary or functional relationship among 
them. Finally, proteins generally fold so as to form the most stable struc- 
tures possible. The stability of most proteins arises from (1) the forma- 
tion of large numbers of intramolecular hydrogen bonds and (2) the re- 
duction in the surface area accessible to solvent that occurs upon 
folding. 


6.5 How Do Protein Subunits Interact at the Quaternary Level of Protein 
Structure? The subunits of an oligomeric protein typically fold into ap- 
parently independent globular conformations and then interact with 
other subunits. The particular surfaces at which protein subunits interact 
are similar in nature to the interiors of the individual subunits. These in- 
terfaces are closely packed and involve both polar and hydrophobic in- 
teractions. Interacting surfaces must therefore possess complementary 
arrangements of polar and hydrophobic groups. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. The central rod domain of a keratin protein is approximately 
312 residues in length. What is the length (in A) of the keratin 
rod domain? If this same peptide segment were a true a-helix, 
how long would it be? If the same segment were a £-sheet, what 
would its length be? 

2. A teenager can grow 4 inches in a year during a “growth spurt.” As- 
suming that the increase in height is due to vertical growth of col- 
lagen fibers (in bone), calculate the number of collagen helix turns 
synthesized per minute. 

3. Discuss the potential contributions to hydrophobic and van der 
Waals interactions and ionic and hydrogen bonds for the side 
chains of Asp, Leu, Tyr, and His in a protein. 

4. Pro is the amino acid least commonly found in a-helices but most 
commonly found in 6-turns. Discuss the reasons for this behavior. 

5. For flavodoxin (pdb id = 5NLL), identify the right-handed cross- 
overs and the left-handed cross-overs in the parallel 6-sheet. 

6. Choose any three regions in the Ramachandran plot and discuss 
the likelihood of observing that combination of ¢ and jf in a pep- 
tide or protein. Defend your answer using suitable molecular mod- 
els of a peptide. 

7. A new protein of unknown structure has been purified. Gel filtra- 
tion chromatography reveals that the native protein has a molecu- 
lar weight of 240,000. Chromatography in the presence of 6 M 
guanidine hydrochloride yields only a peak for a protein of M, 
60,000. Chromatography in the presence of 6 M guanidine hydro- 
chloride and 10 mM -mercaptoethanol yields peaks for proteins of 
M, 34,000 and 26,000. Explain what can be determined about the 
structure of this protein from these data. 


8. Two polypeptides, A and B, have similar tertiary structures, but A 
normally exists as a monomer, whereas B exists as a tetramer, B4. 
What differences might be expected in the amino acid composition 
of A versus B? 

9. The hemagglutinin protein in influenza virus contains a remarkably 
long a-helix, with 53 residues. 

a. How long is this @-helix (in nm)? 

b. How many turns does this helix have? 

c. Each residue in an a-helix is involved in two H bonds. How many 
H bonds are present in this helix? 

10. It is often observed that Gly residues are conserved in proteins to a 
greater degree than other amino acids. From what you have learned 
in this chapter, suggest a reason for this observation. 

11. Which amino acids would be capable of forming H bonds with a 
lysine residue in a protein? 

12. Poly-L-glutamate adopts an a-helical structure at low pH but 
becomes a random coil above pH 5. Explain this behavior. 

13. Imagine that the dimensions of the alpha helix were such that there 
were exactly 3.5 amino acids per turn, instead of 3.6. What would be 
the consequences for coiled-coil structures? 


Preparing for the MCAT Exam 

14. Consider the following peptide sequences: 
EANQIDEMLYNVQCSLTTLEDTVPW 
LGVHLDITVPLSWIWTLYVKL 
QQNWGGLVWVILTLVWFLM 
CNMKHGDSQCDERTYP 
YTREQSDGHIPKMNCDS 
AGPFGPDGPTIGPK 
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Which of the preceding sequences would be likely to be found in 
each of the following: 
a. A parallel B-sheet 
b. An antiparallel B-sheet 
c. A tropocollagen molecule 
d. The helical portions of a protein found in your hair 

15. To fully appreciate the elements of secondary structure in proteins, 
it is useful to have a practical sense of their structures. On a piece 
of paper, draw a simple but large zigzag pattern to represent a 


B-strand. Then fill in the structure, drawing the locations of the 
atoms of the chain on this zigzag pattern. Then draw a simple, 
large coil on a piece of paper to represent an a-helix. Then fill in 
the structure, drawing the backbone atoms in the correction loca- 
tions along the coil and indicating the locations of the R groups in 
your drawing. 

16. The dissociation constant for a particular protein dimer is 1 micro- 
molar. Calculate the free energy difference for the monomer to 
dimer transition. 
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Carbohydrates 
and the Glycoconjugates 
of Cell Surfaces 





ESSENTIAL QUESTION 


Carbohydrates are a versatile class of molecules of the formula (CH,0),,. They are a 
major form of stored energy in organisms, and they are the metabolic precursors of 
virtually all other biomolecules. Conjugates of carbohydrates with proteins and 
lipids perform a variety of functions, including recognition events that are important 
in cell growth, transformation, and other processes. 


What is the structure, chemistry, and biological function of carbohydrates? 


Carbohydrates are the single most abundant class of organic molecules found in na- 
ture. Energy from the sun captured by green plants, algae, and some bacteria dur- 
ing photosynthesis (see Chapter 21) converts more than 250 billion kilograms of 
carbon dioxide into carbohydrates every day on earth. In turn, carbohydrates are 
the metabolic precursors of virtually all other biomolecules. Breakdown of carbo- 
hydrates provides the energy that sustains animal life. In addition, carbohydrates 
are covalently linked with a variety of other molecules. These glycoconjugates are 
important components of cell walls and extracellular structures in plants, animals, 
and bacteria. In addition to the structural roles such molecules play, they also serve 
in a variety of processes involving recognition between cell types or recognition of cel- 
lular structures by other molecules. Recognition events are important in normal cell 
growth, fertilization, transformation of cells, and other processes. 

All of these functions are made possible by the characteristic chemical features 
of carbohydrates: 


the existence of at least one and often two or more asymmetric centers 

the ability to exist either in linear or ring structures 

the capacity to form polymeric structures via glycosidic bonds 

the potential to form multiple hydrogen bonds with water or other molecules 
in their environment. 


Tala How Are Carbohydrates Named? 


The name carbohydrate arises from the basic molecular formula (CH»:O),, where 
n = 3 or more. (CH0), can be rewritten (C-H».O),, to show that these substances 
are hydrates of carbon. 

Carbohydrates are generally classified into three groups: monosaccharides (and 
their derivatives), oligosaccharides, and polysaccharides. The monosaccharides 
are also called simple sugars and have the formula (CH2O),. Monosaccharides 
cannot be broken down into smaller sugars under mild conditions. Oligosaccha- 
rides derive their name from the Greek word oligo, meaning “few,” and consist of 
from two to ten simple sugar residues. Disaccharides are common in nature, and 
trisaccharides also occur frequently. Four- to six-sugar-unit oligosaccharides are 
usually bound covalently to other molecules, including glycoproteins. As their 
name suggests, polysaccharides are polymers of the simple sugars and their deriv- 
atives. They may be either linear or branched polymers and may contain hundreds 
or even thousands of monosaccharide units. Their molecular weights range up to 
1 million or more. 
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“The Discovery of Honey’—Piero di Cosimo (1492). 





Sugar in the gourd and honey in the horn, 
I never was so happy since the hour I was 
born. 


Turkey in the Straw, stanza 6 
(classic American folk tune) 





KEY QUESTIONS 


7.1 How Are Carbohydrates Named? 


7.2 What Is the Structure and Chemistry 
of Monosaccharides? 


7.3 What Is the Structure and Chemistry 
of Oligosaccharides? 


7.4 What Is the Structure and Chemistry 
of Polysaccharides? 





7.5 What Are Glycoproteins, and How Do They 
Function in Cells? 


7.6 How Do Proteoglycans Modulate Processes 
in Cells and Organisms? 


7.7 Do Carbohydrates Provide a Structural 
Code? 
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7:20 What Is the Structure and Chemistry of Monosaccharides? 


Monosaccharides Are Classified as Aldoses and Ketoses 


Monosaccharides consist typically of three to seven carbon atoms and are described 
either as aldoses or ketoses, depending on whether the molecule contains an alde- 
hyde function or a ketone group. The simplest aldose is glyceraldehyde, and the 
simplest ketose is dihydroxyacetone (Figure 7.1). These two simple sugars are 
termed trioses because they each contain three carbon atoms. The structures and 
names of a family of aldoses and ketoses with three, four, five, and six carbons are 
shown in Figures 7.2 and 7.3. Hexoses are the most abundant sugars in nature. Nev- 
ertheless, sugars from all these classes are important in metabolism. 
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FIGURE 7.2 The structure and stereochemical relationships of D-aldoses with three to six carbons. The configu- 
ration in each case is determined by the highest numbered asymmetric carbon (shown in pink). In each row, 


the “new” asymmetric carbon is shown in yellow. Blue highlights indicate the most common aldoses. 
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6 CH,OH CH,OH CH,OH eee asymmetric carbon (shown in pink). In each row, 
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Monosaccharides, either aldoses or ketoses, are often given more detailed 
generic names to describe both the important functional groups and the total 
number of carbon atoms. Thus, one can refer to aldotetroses and ketotetroses, aldo- 
pentoses and ketopentoses, aldohexoses and ketohexoses, and so on. Sometimes the 
ketone-containing monosaccharides are named simply by inserting the letters -ul- 
into the simple generic terms, such as fetruloses, pentuloses, hexuloses, heptuloses, and 
so on. The simplest monosaccharides are water soluble, and most taste sweet. 


Stereochemistry Is a Prominent Feature of Monosaccharides 


Aldoses with at least three carbons and ketoses with at least four carbons contain 
chiral centers (see Chapter 4). The nomenclature for such molecules must specify 
the configuration about each asymmetric center, and drawings of these molecules 
must be based on a system that clearly specifies these configurations. As noted in 
Chapter 4, the Fischer projection system is used almost universally for this purpose 
today. The structures shown in Figures 7.2 and 7.3 are Fischer projections. For 
monosaccharides with two or more asymmetric carbons, the prefix D or L refers to 
the configuration of the highest numbered asymmetric carbon (the asymmetric car- 
bon farthest from the carbonyl carbon). A monosaccharide is designated D if the 
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FIGURE 7.4 D-Fructose and L-fructose, an enantiomeric 
pair. Note that changing the configuration only at Cs 
would change D-fructose to L-sorbose. 
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hydroxyl group on the highest numbered asymmetric carbon is drawn to the right in 
a Fischer projection, as in D-glyceraldehyde (Figure 7.1). Note that the designation 
D or L merely relates the configuration of a given molecule to that of glyceraldehyde 
and does not specify the sign of rotation of plane-polarized light. If the sign of opti- 
cal rotation is to be specified in the name, the convention of D or L designations may 
be used along with a + (plus) or — (minus) sign. Thus, D-glucose (Figure 7.2) may 
also be called D(+)-glucose because it is dextrorotatory, whereas D-fructose 
(Figure 7.3), which is levorotatory, can also be named D(—)-fructose. 

All of the structures shown in Figures 7.2 and 7.3 are D-configurations, and the 
D-forms of monosaccharides predominate in nature, just as L-amino acids do. These 
preferences, established in apparently random choices early in evolution, persist uni- 
formly in nature because of the stereospecificity of the enzymes that synthesize and 
metabolize these small molecules. -Monosaccharides do exist in nature, serving a 
few relatively specialized roles. L-Galactose is a constituent of certain polysaccharides, 
and L-arabinose is a constituent of bacterial cell walls. 

According to convention, the D- and L-forms of a monosaccharide are mirror im- 
ages of each other, as shown in Figure 7.4 for fructose. Stereoisomers that are mir- 
ror images of each other are called enantiomers, or sometimes enantiomeric pairs. 
For molecules that possess two or more chiral centers, more than two stereoisomers 
can exist. Pairs of isomers that have opposite configurations at one or more of the 
chiral centers but that are not mirror images of each other are called diastereomers 
or diastereomeric pairs. Any two structures in a given row in Figures 7.2 and 7.3 are 
diastereomeric pairs. Two sugars that differ in configuration at only one chiral cen- 
ter are described as epimers. For example, D-mannose and D-talose are epimers and 
D-glucose and D-mannose are epimers, whereas D-glucose and D-talose are not 
epimers but merely diastereomers. 


Monosaccharides Exist in Cyclic and Anomeric Forms 


Although Fischer projections are useful for presenting the structures of particular 
monosaccharides and their stereoisomers, they discount one of the most interesting 
facets of sugar structure—the ability to form cyclic structures with formation of an addi- 
tional asymmetric center. Alcohols react readily with aldehydes to form hemiacetals 
(Figure 7.5). The British carbohydrate chemist Sir Norman Haworth showed that 
the linear form of glucose (and other aldohexoses) could undergo a similar in- 
tramolecular reaction to form a cyclic hemiacetal. The resulting six-membered, oxygen- 
containing ring is similar to pyran and is designated a pyranose. The reaction is cat- 
alyzed by acid (H*) or base (OH) and is readily reversible. 

In a similar manner, ketones can react with alcohols to form hemiketals. The 
analogous intramolecular reaction of a ketose sugar such as fructose yields a cyclic 
hemiketal (Figure 7.6). The five-membered ring thus formed is reminiscent of furan 
and is referred to as a furanose. The cyclic pyranose and furanose forms are the pre- 
ferred structures for monosaccharides in aqueous solution. At equilibrium, the lin- 
ear aldehyde or ketone structure is only a minor component of the mixture (gen- 
erally much less than 1%). 

When hemiacetals and hemiketals are formed, the carbon atom that carried the 
carbonyl function becomes an asymmetric carbon atom. Isomers of monosaccharides 
that differ only in their configuration about that carbon atom are called anomers, des- 
ignated as a or B, as shown in Figure 7.5, and the carbonyl carbon is thus called the 
anomeric carbon. When the hydroxyl group at the anomeric carbon is on the same side 
of a Fischer projection as the oxygen atom at the highest numbered asymmetric car- 
bon, the configuration at the anomeric carbon is a, as in a-D-glucose. When the 
anomeric hydroxyl is on the opposite side of the Fischer projection, the configuration is 
B, as in B-b-glucopyranose (Figure 7.5). 

The addition of this asymmetric center upon hemiacetal and hemiketal formation 
alters the optical rotation properties of monosaccharides, and the original assign- 
ment of the a and £ notations arose from studies of these properties. Early carbo- 
hydrate chemists frequently observed that the optical rotation of glucose (and other 
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CENGAGENOW” ANIMATED FIGURE 7.5 The linear form of D-glucose undergoes an intramolecular reaction to 
form a cyclic hemiacetal. See this figure animated at www.cengage.com/login. 


sugar) solutions could change with time, a process called mutarotation. This indi- 
cated that a structural change was occurring. It was eventually found that a-D-glucose 
has a specific optical rotation, [a] >”, of 112.2°, and that §-b-glucose has a specific op- 
tical rotation of 18.7°. Mutarotation involves interconversion of a- and B-forms of the 
monosaccharide with intermediate formation of the linear aldehyde or ketone, as 
shown in Figures 7.5 and 7.6. 


Haworth Projections Are a Convenient Device for Drawing Sugars 


Another of Haworth’s lasting contributions to the field of carbohydrate chemistry 
was his proposal to represent pyranose and furanose structures as hexagonal and 
pentagonal rings lying perpendicular to the plane of the paper, with thickened 
lines indicating the side of the ring closest to the reader. Such Haworth projec- 
tions, which are now widely used to represent saccharide structures (Figures 7.5 
and 7.6), show substituent groups extending either above or below the ring. Sub- 
stituents drawn to the left in a Fischer projection are drawn above the ring in the 
corresponding Haworth projection. Substituents drawn to the right in a Fischer 
projection are below the ring in a Haworth projection. Exceptions to these rules 
occur in the formation of furanose forms of pentoses and the formation of fura- 
nose or pyranose forms of hexoses. In these cases, the structure must be redrawn 
with a rotation about the carbon whose hydroxyl group is involved in the forma- 
tion of the cyclic form (Figure 7.7) in order to orient the appropriate hydroxyl 
group for ring formation. This is merely for illustrative purposes and involves no 
change in configuration of the saccharide molecule. 

The rules previously mentioned for assignment of a- and 6-configurations can be 
readily applied to Haworth projection formulas. For the D-sugars, the anomeric hy- 
droxyl group is below the ring in the a-anomer and above the ring in the B-anomer. 
For L-sugars, the opposite relationship holds. 
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CENGAGENOW” ANIMATED FIGURE 7.6 The linear form of D-fructose undergoes an intramolecular reaction to 
form a cyclic hemiketal. See this figure animated at www.cengage.com/login. 


As Figure 7.7 implies, in most monosaccharides there are two or more hydroxyl 
groups that can react with an aldehyde or ketone at the other end of the molecule 
to form a hemiacetal or hemiketal. Consider the possibilities for glucose, as shown 
in Figure 7.7. If the C-4 hydroxyl group reacts with the aldehyde of glucose, a five- 
membered ring is formed, whereas if the C-5 hydroxyl reacts, a six-membered ring 
is formed. The C-6 hydroxyl does not react effectively because a seven-membered 
ring is too strained to form a stable hemiacetal. The same is true for the C-2 and 
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CENGAGENOW” ANIMATED FIGURE 7.7 D-Glucose, D-ribose, and other simple sugars can cyclize in two ways, 
forming either furanose or pyranose structures. See this figure animated at www.cengage.com/login. 
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C-3 hydroxyls, and thus five- and six-membered rings are by far the most likely to be 
formed from six-membered monosaccharides. D-Ribose, with five carbons, readily 
forms either five-membered rings (a- or 6-D-ribofuranose) or six-membered rings 
(a- or B-D-ribopyranose) (Figure 7.7). In general, aldoses and ketoses with five or 
more carbons can form either furanose or pyranose rings, and the more stable form 
depends on structural factors. The nature of the substituent groups on the car- 
bonyl and hydroxyl groups and the configuration about the asymmetric carbon will 
determine whether a given monosaccharide prefers the pyranose or furanose 
structure. In general, the pyranose form is favored over the furanose ring for al- 
dohexose sugars, although, as we shall see, furanose structures are more stable for 
ketohexoses. 

Although Haworth projections are convenient for displaying monosaccharide 
structures, they do not accurately portray the conformations of pyranose and fura- 
nose rings. Given C—C—C tetrahedral bond angles of 109° and C—O—C angles 
of 111°, neither pyranose nor furanose rings can adopt true planar structures. In- 
stead, they take on puckered conformations, and in the case of pyranose rings, the 
two favored structures are the chair conformation and the boat conformation, 
shown in Figure 7.8. Note that the ring substituents in these structures can be equa- 
torial, which means approximately coplanar with the ring, or axial, that is, parallel 
to an axis drawn through the ring as shown. Two general rules dictate the confor- 
mation to be adopted by a given saccharide unit. First, bulky substituent groups on 
such rings are more stable when they occupy equatorial positions rather than axial 
positions, and second, chair conformations are slightly more stable than boat con- 
formations. For a typical pyranose, such as B-D-glucose, there are two possible chair 
conformations (Figure 7.8). Of all the D-aldohexoses, B-b-glucose is the only one 
that can adopt a conformation with all its bulky groups in an equatorial position. 
With this advantage of stability, it may come as no surprise that 6-D-glucose is the 
most widely occurring organic group in nature and the central hexose in carbohy- 
drate metabolism. 


Monosaccharides Can Be Converted to Several Derivative Forms 


A variety of chemical and enzymatic reactions produce derivatives of the simple sug- 
ars. These modifications produce a diverse array of saccharide derivatives. Some of 
the most common derivations are discussed here. 


Sugar Acids Sugars with free anomeric carbon atoms are reasonably good reduc- 
ing agents and will reduce hydrogen peroxide, ferricyanide, certain metals (Cu?* 
and Ag*), and other oxidizing agents. Such reactions convert the sugar to a sugar 


a = axial bond 
e = equatorial bond 





Chair Boat 


(b) 











FIGURE 7.8 (a) Chair and boat conformations of a pyra- 
nose sugar. (b) Two possible chair conformations of 
B-b-glucose. 
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acid. For example, addition of alkaline CuSO, (called Fehling’s solution) to an aldose 
sugar produces a red cuprous oxide (CuO) precipitate: 


1 i 
RC—H + 2Cu?* + 5 OH” —> RC—O™ + CwO} + 3 HO 
Aldehyde Carboxylate 


and converts the aldose to an aldonic acid, such as gluconic acid (Figure 7.9). For- 
mation of a precipitate of red CugO constitutes a positive test for an aldehyde. 
Carbohydrates that can reduce oxidizing agents in this way are referred to as 
reducing sugars. By quantifying the amount of oxidizing agent reduced by a sugar 
solution, one can accurately determine the concentration of the sugar. Diabetes 
mellitus is a condition that causes high levels of glucose in urine and blood, and 
frequent analysis of reducing sugars in diabetic patients is an important part of 
the diagnosis and treatment of this disease. Over-the-counter kits for the easy and 
rapid determination of reducing sugars have made this procedure a simple one 
for diabetic persons. 

Monosaccharides can be oxidized enzymatically at C-6, yielding uronic acids, 
such as D-glucuronic and L-iduronic acids (Figure 7.9). L-Iduronic acid is similar to 
p-glucuronic acid, except it has an opposite configuration at C-5. Oxidation at both 
C-1 and C-6 produces aldaric acids, such as p-glucaric acid. 
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Sugar Alcohols Sugar alcohols, another class of sugar derivative, can be prepared 
by the mild reduction (with NaBH, or similar agents) of the carbonyl groups of al- 
doses and ketoses. Sugar alcohols, or alditols, are designated by the addition of -itol 
to the name of the parent sugar (Figure 7.10). The alditols are linear molecules that 
cannot cyclize in the manner of aldoses. Nonetheless, alditols are characteristically 
sweet tasting, and sorbitol, mannitol, and xylitol are widely used to sweeten sugarless 
gum and mints (Figure 7.11). Sorbitol buildup in the eyes of diabetic persons is im- 
plicated in cataract formation. Glycerol and myo-inositol, a cyclic alcohol, are com- 
ponents of lipids (see Chapter 8). There are nine different stereoisomers of inositol; 
the one shown in Figure 7.10 was first isolated from heart muscle and thus has the 
prefix myo- for muscle. Ribitol is a constituent of flavin coenzymes (see Chapter 17). 


Deoxy Sugars The deoxy sugars are monosaccharides with one or more hydroxyl 
groups replaced by hydrogens. 2-Deoxy-D-ribose (Figure 7.12), whose systematic name 
is 2-deoxy-D-erythropentose, is a constituent of DNA in all living things (see Chapter 
10). Deoxy sugars also occur frequently in glycoproteins and polysaccharides. L-Fucose 
and L-+rhamnose, both 6-deoxy sugars, are components of some cell walls, and rham- 
nose is a component of ouabain, a highly toxic cardiac glycoside found in the bark and 
root of the ouabaio tree. Ouabain is used by the East African Somalis as an arrow poi- 
son. The sugar moiety is not the toxic part of the molecule (see Chapter 9). 


Sugar Esters Phosphate esters of glucose, fructose, and other monosaccharides 
are important metabolic intermediates, and the ribose moiety of nucleotides such as 
ATP and GTP is phosphorylated at the 5’-position (Figure 7.13). 
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FIGURE 7.12 Several deoxy sugars. Hydrogen and carbon atoms highlighted in red are “deoxy” positions. 
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FIGURE 7.10 Structures of some sugar alcohols. (Note 
that myo-inositol is a polyhydroxy cyclohexane, not a 
sugar alcohol.) 
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FIGURE 7.11 Sugar alcohols such as sorbitol, mannitol, 
and xylitol sweeten many “sugarless” gums and candies. 
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FIGURE 7.13 Several sugar esters important in metabolism. 
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A DEEPER LOOK 


Honey—An Ancestral Carbohydrate Treat 


Honey, the first sweet known to humankind, is the only sweeten- 
ing agent that can be stored and used exactly as produced in na- 
ture. Bees process the nectar of flowers so that their final product 
is able to survive long-term storage at ambient temperature. Used 
as a ceremonial material and medicinal agent in earliest times, 
honey was not regarded as a food until the Greeks and Romans. 
Only in modern times have cane and beet sugar surpassed honey 
as the most frequently used sweetener. What is the chemical na- 
ture of this magical, viscous substance? 

The bees’ processing of honey consists of (1) reducing the wa- 
ter content of the nectar (30% to 60%) to the self-preserving 
range of 15% to 19%, (2) hydrolyzing the significant amount of 
sucrose in nectar to glucose and fructose by the action of the en- 
zyme invertase, and (3) producing small amounts of gluconic 
acid from glucose by the action of the enzyme glucose oxidase. 
Most of the sugar in the final product is glucose and fructose, 
and the final product is supersaturated with respect to these 
monosaccharides. Honey actually consists of an emulsion of 
microscopic glucose hydrate and fructose hydrate crystals in a 
thick syrup. Sucrose accounts for only about 1% of the sugar in 
the final product, with fructose at about 38% and glucose at 31% 
by weight. 

The accompanying figure shows a °C nuclear magnetic reso- 
nance spectrum of honey from a mixture of wildflowers in south- 
eastern Pennsylvania. Interestingly, five major hexose species con- 
tribute to this spectrum. Although most textbooks show fructose 
exclusively in its furanose form, the predominant form of fructose 
(67% of total fructose) is B-p-fructopyranose, with the g- and 
a-fructofuranose forms accounting for 27% and 6% of the fruc- 





© Scott Camazine/Photo Researchers, Inc. 


tose, respectively. In polysaccharides, fructose invariably prefers 
the furanose form, but free fructose (and crystalline fructose) is 
predominantly 6-fructopyranose. 


Sources: White, J. W., 1978. Honey. Advances in Food Research 24:287-374; 
and Prince, R. C., Gunson, D. E., Leigh, J. S., and McDonald, G. G., 1982. 
The predominant form of fructose is a pyranose, not a furanose ring. 
Trends in Biochemical Sciences '7:239-240. 
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FIGURE 7.14 Structures of D-glucosamine and 
b-galactosamine. 


Amino Sugars Amino sugars, including D-glucosamine and D-galactosamine 
(Figure 7.14), contain an amino group (instead of a hydroxyl group) at the C-2 
position. They are found in many oligosaccharides and polysaccharides, includ- 
ing chitin, a polysaccharide in the exoskeletons of crustaceans and insects. 

Muramic acid and neuraminic acid, which are components of the polysaccha- 
rides of cell membranes of higher organisms and also bacterial cell walls, are 
glycosamines linked to three-carbon acids at the C-1 or C-3 positions. In muramic 
acid (thus named as an amine isolated from bacterial cell wall polysaccharides; 
murus is Latin for “wall”), the hydroxyl group of a lactic acid moiety makes an 
ether linkage to the C-3 of glucosamine. Neuraminic acid (an amine isolated from 
neural tissue) forms a C—C bond between the C-1 of N-acetylmannosamine and 
the C-3 of pyruvic acid (Figure 7.15). The N-acetyl and N-glycolyl derivatives of 
neuraminic acid are collectively known as sialic acids and are distributed widely 
in bacteria and animal systems. 
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FIGURE 7.15 Structures of (a) muramic acid and (b) several depictions of a sialic acid. 
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FIGURE 7.16 Acetals and ketals can be formed from hemiacetals and hemiketals, respectively. 


Acetals, Ketals, and Glycosides }Hemiacetals and hemiketals can react with alco- 
hols in the presence of acid to form acetals and ketals, as shown in Figure 7.16. This 
reaction is another example of a dehydration synthesis and is similar in this respect to 
the reactions undergone by amino acids to form peptides and nucleotides to form 
nucleic acids. The pyranose and furanose forms of monosaccharides react with al- 
cohols in this way to form glycosides with retention of the a- or -configuration at 
the C-1 carbon. The new bond between the anomeric carbon atom and the oxygen 
atom of the alcohol is called a glycosidic bond. Glycosides are named according to 
the parent monosaccharide. For example, methyl-B-b-glucoside (Figure 7.17) can be 
considered a derivative of B-D-glucose. 
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Given the relative complexity of oligosaccharides and polysaccharides in higher or- 
ganisms, it is perhaps surprising that these molecules are formed from relatively few 
different monosaccharide units. (In this respect, the oligosaccharides and polysac- 
charides are similar to proteins; both form complicated structures based on a small 
number of different building blocks.) Monosaccharide units include the hexoses 
glucose, fructose, mannose, and galactose and the pentoses ribose and xylose. 


Disaccharides Are the Simplest Oligosaccharides 


The simplest oligosaccharides are the disaccharides, which consist of two monosac- 
charide units linked by a glycosidic bond. As in proteins and nucleic acids, each in- 
dividual unit in an oligosaccharide is termed a residue. The disaccharides shown in 
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FIGURE 7.17 The anomeric forms of methyl-b-glucoside. 
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Figure 7.18 are all commonly found in nature, with sucrose, maltose, and lactose 
being the most common. Each is a mixed acetal, with one hydroxyl group provided 
intramolecularly and one hydroxyl from the other monosaccharide. Except for su- 
crose, each of these structures possesses one free unsubstituted anomeric carbon 
atom, and thus each of these disaccharides is a reducing sugar. The end of the mol- 
ecule containing the free anomeric carbon is called the reducing end, and the other 
end is called the nonreducing end. In the case of sucrose, both of the anomeric car- 
bon atoms are substituted, that is, neither has a free —OH group. The substituted 
anomeric carbons cannot be converted to the aldehyde configuration and thus can- 
not participate in the oxidation—-reduction reactions characteristic of reducing sug- 
ars. Thus, sucrose is not a reducing sugar. 

Maltose, isomaltose, and cellobiose are all homodisaccharides because they each 
contain only one kind of monosaccharide, namely, glucose. Maltose is produced 
from starch (a polymer of a-p-glucose produced by plants) by the action of amylase 
enzymes and is a component of malt, a substance obtained by allowing grain (par- 
ticularly barley) to soften in water and germinate. The enzyme diastase, produced 
during the germination process, catalyzes the hydrolysis of starch to maltose. Mal- 
tose is used in beverages (malted milk, for example), and because it is fermented 
readily by yeast, it is important in the brewing of beer. In both maltose and 
cellobiose, the glucose units are 1—4 linked, meaning that the C-1 of one glucose 
is linked by a glycosidic bond to the C-4 oxygen of the other glucose. The only dif- 
ference between them is in the configuration at the glycosidic bond. Maltose exists 
in the a-configuration, whereas cellobiose is a 6-configuration. Isomaltose is ob- 
tained in the hydrolysis of some polysaccharides (such as dextran), and cellobiose 
is obtained from the acid hydrolysis of cellulose. Isomaltose also consists of two glu- 
cose units in a glycosidic bond, but in this case, C-1 of one glucose is linked to C-6 
of the other, and the configuration is a. 

The complete structures of these disaccharides can be specified in shorthand no- 
tation by using abbreviations for each monosaccharide, a@ or 8, to denote configura- 
tion, and appropriate numbers to indicate the nature of the linkage. Thus, cellobiose 
is Glc61-4Glc, whereas isomaltose is Glca1—6Glc. Often the glycosidic linkage is writ- 
ten with an arrow so that cellobiose and isomaltose would be GlcB1—4Glc and 
Glcal—6Glc, respectively. Because the linkage carbon on the first sugar is always 
C-1, a newer trend is to drop the 1- or 1— and describe these simply as Glc84Glc and 
Glca6Glc, respectively. More complete names can also be used, however; for example, 
maltose would be O-a-b-glucopyranosyl-(1—4)-b-glucopyranose. Cellobiose, because 
of its B-glycosidic linkage, is formally O-6-D-glucopyranosyl-(1—4) -D-glucopyranose. 


A DEEPER LOOK 
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Trehalose—A Natural Protectant for Bugs 


Insects use an open circulatory system to circulate hemolymph (in- 
sect blood). The “blood sugar” is not glucose but rather trehalose, 
an unusual, nonreducing disaccharide (see figure). Trehalose is 
found typically in organisms that are naturally subject to tempera- 
ture variations and other environmental stresses—bacterial spores, 
fungi, yeast, and many insects. (Interestingly, honeybees do not 
have trehalose in their hemolymph, perhaps because they practice 
a colonial, rather than solitary, lifestyle. Bee colonies maintain a 
rather constant temperature of 18°C, protecting the residents from 
large temperature changes.) 

What might explain this correlation between trehalose utiliza- 


and thawing of biological tissues frequently causes irreversible 
structural changes, destroying biological activity. High concentra- 
tions of polyhydroxy compounds, such as sucrose and glycerol, can 
protect biological materials from such damage. Trehalose is par- 
ticularly well suited for this purpose and has been shown to be su- 
perior to other polyhydroxy compounds, especially at low concen- 
trations. Support for this novel idea comes from studies by Paul 
Attfield,t which show that trehalose levels in the yeast Saccharomyces 
cerevisiae increase significantly during exposure to high salt and 
high growth temperatures—the same conditions that elicit the 
production of heat shock proteins! 
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tion and environmentally stressful lifestyles? Konrad Bloch* sug- 


gests that trehalose may act as a natural cryoprotectant. Freezing H 
I 





CHOH H 
*Bloch, K., 1994. Blondes in Venetian Paintings, the Nine-Banded Armadillo, HO 
and Other Essays in Biochemistry. New Haven: Yale University Press. 

tAttfeld, P. V., 1987. Trehalose accumulates in Saccharomyces cerevisiae dur- 
ing exposure to agents that induce heat shock responses. FEBS Letters 1 
225:259. H 











b-D-Lactose (O-6-D-galactopyranosyl-(1—4)-D-glucopyranose) (Figure 7.18) is the 
principal carbohydrate in milk and is of critical nutritional importance to mammals 
in the early stages of their lives. It is formed from D-galactose and D-glucose via a 
B(1—4) link, and because it has a free anomeric carbon, it is capable of mutarotation 
and is a reducing sugar. It is an interesting quirk of nature that lactose cannot be ab- 
sorbed directly into the bloodstream. It must first be broken down into galactose and 
glucose by lactase, an intestinal enzyme that exists in young, nursing mammals but is 
not produced in significant quantities in the mature mammal. Most adult humans, 
with the exception of certain groups in Africa and northern Europe, produce only 
low levels of lactase. For most individuals, this is not a problem, but some cannot tol- 
erate lactose and experience intestinal pain and diarrhea upon consumption of milk. 

Sucrose, in contrast, is a disaccharide of almost universal appeal and tolerance. 
Produced by many higher plants and commonly known as table sugar, it is one of the 
products of photosynthesis and is composed of fructose and glucose. Sucrose has a 
specific optical rotation, [a],2°, of +66.5°, but an equimolar mixture of its compo- 
nent monosaccharides has a net negative rotation ([a] p?? of glucose is +52.5° and 
of fructose is —92°). Sucrose is hydrolyzed by the enzyme invertase, so named for 
the inversion of optical rotation accompanying this reaction. Sucrose is also easily 
hydrolyzed by dilute acid, apparently because the fructose in sucrose is in the rela- 
tively unstable furanose form. Although sucrose and maltose are important to the 
human diet, they are not taken up directly in the body. In a manner similar to lac- 
tose, they are first hydrolyzed by sucrase and maltase, respectively, in the human 
intestine. 


A Variety of Higher Oligosaccharides Occur in Nature 


In addition to the simple disaccharides, many other oligosaccharides are found in 
both prokaryotic and eukaryotic organisms, either as naturally occurring sub- 
stances or as hydrolysis products of natural materials. 

Oligosaccharides also occur widely as components (via glycosidic bonds) of an- 
tibiotics derived from various sources. Figure 7.19 shows the structures of two repre- 
sentative carbohydrate-containing antibiotics. 
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FIGURE 7.19 Some antibiotics are oligosaccharides or contain oligosaccharide groups. 
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Nomenclature for Polysaccharides Is Based on Their Composition 
and Structure 


By far the majority of carbohydrate material in nature occurs in the form of polysac- 
charides. By our definition, polysaccharides include not only those substances com- 
posed only of glycosidically linked sugar residues but also molecules that contain 
polymeric saccharide structures linked via covalent bonds to amino acids, peptides, 
proteins, lipids, and other structures. 

Polysaccharides, also called glycans, consist of monosaccharides and their de- 
rivatives. If a polysaccharide contains only one kind of monosaccharide molecule, 
it is a homopolysaccharide, or homoglycan, whereas those containing more than 
one kind of monosaccharide are heteropolysaccharides. The most common con- 
stituent of polysaccharides is D-glucose, but D-fructose, D-galactose, L-galactose, 
D-mannose, L-arabinose, and D-xylose are also common. Common monosaccha- 
ride derivatives in polysaccharides include the amino sugars (D-glucosamine and 
p-galactosamine), their derivatives (N-acetylneuraminic acid and N-acetylmuramic 
acid), and simple sugar acids (glucuronic and iduronic acids). Polysaccharides dif- 
fer not only in the nature of their component monosaccharides but also in the 
length of their chains and in the amount of chain branching that occurs. Although 
a given sugar residue has only one anomeric carbon and thus can form only one 
glycosidic linkage with hydroxyl groups on other molecules, each sugar residue 
carries several hydroxyls, one or more of which may be an acceptor of glycosyl sub- 
stituents (Figure 7.20). This ability to form branched structures distinguishes poly- 
saccharides from proteins and nucleic acids, which occur only as linear polymers. 


Polysaccharides Serve Energy Storage, Structure, 
and Protection Functions 


Polysaccharides function as storage materials, structural components, or protective 
substances. Thus, starch, glycogen, and other storage polysaccharides, as readily me- 
tabolizable food, provide energy reserves for cells. Chitin and cellulose provide 
strong support for the skeletons of arthropods and green plants, respectively. 
Mucopolysaccharides, such as the hyaluronic acids, form protective coats on animal 
cells. In each of these cases, the relevant polysaccharide is either a homopolymer 
or a polymer of small repeating units. Recent research indicates, however, that 
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CENGAGENOW” ANIMATED FIGURE 7.20 Amylose and amylopectin are the two forms of starch. Note that the 
linear linkages are a(1—>4) but the branches in amylopectin are æ(1—6). Branches in polysaccharides can 
involve any of the hydroxyl groups on the monosaccharide components. Amylopectin is a highly branched 
structure, with branches occurring every 12 to 30 residues. See this figure animated at www.cengage.com/ 
login. 


oligosaccharides and polysaccharides with varied structures may also be involved in 
much more sophisticated tasks in cells, including a variety of cellular recognition 
and intercellular communication events, as discussed later. 


Polysaccharides Provide Stores of Energy 


Organisms store carbohydrates in the form of polysaccharides rather than as 
monosaccharides to lower the osmotic pressure of the sugar reserves. Because os- 
motic pressures depend only on numbers of molecules, the osmotic pressure is 
greatly reduced by formation of a few polysaccharide molecules out of thousands 
(or even millions) of monosaccharide units. 


Starch By far the most common storage polysaccharide in plants is starch, which ex- 
ists in two forms: a@-amylose and amylopectin (Figure 7.20). Most forms of starch in 
nature are 10% to 30% a-amylose and 70% to 90% amylopectin. a-Amylose is com- 
posed of linear chains of D-glucose in a(1—4) linkages. The chains are of varying 
length, having molecular weights from several thousand to half a million. As can be 
seen from the structure in Figure 7.20, the chain has a reducing end and a nonre- 
ducing end. Although poorly soluble in water, a-amylose forms micelles in which the 
polysaccharide chain adopts a helical conformation (Figure 7.21). Iodine reacts with 
a-amylose to give a characteristic blue color, which arises from the insertion of iodine 
into the middle of the hydrophobic amylose helix. 

Amylopectin is a highly branched chain of glucose units (Figure 7.20). Branches 
occur in these chains every 12 to 30 residues. The average branch length is between 
24 and 30 residues, and molecular weights of amylopectin molecules can range up 
to 100 million. The linear linkages in amylopectin are a(1—4), whereas the branch 
linkages are a(1— 6). As is the case for a-amylose, amylopectin forms micellar sus- 
pensions in water; iodine reacts with such suspensions to produce a red-violet color. 

Starch is stored in plant cells in the form of granules in the stroma of plastids 
(plant cell organelles). When starch is to be mobilized and used by the plant that 
stored it, it is split into its monosaccharide elements by stepwise phosphorolytic 
cleavage of glucose units, a reaction catalyzed by starch phosphorylase (Figure 
7.22). The products are one molecule of glucose-1-phosphate and a starch molecule 
with one less glucose unit. In a-amylose, this process continues all along the chain 
until the end is reached. 


CH,OH CHOH o3 CH,OH CH,OH 





FIGURE 7.21 Suspensions of amylose in water adopt a 
helical conformation. lodine (l) can insert into the mid- 
dle of the amylose helix to give a blue color that is char- 
acteristic and diagnostic for starch. 
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CENGAGENOW” ANIMATED FIGURE 7.22 The starch phosphorylase reaction cleaves glucose residues from 
amylose, producing a-b-glucose-1-phosphate. See this figure animated at www.cengage.com/login. 


In animals, digestion and use of plant starches begin in the mouth with salivary 
a-amylase (a (1—4)-glucan 4-glucanohydrolase), the major enzyme secreted by the 
salivary glands. Although the capability of making and secreting salivary a-amylases 
is widespread in the animal world, some animals (such as cats, dogs, birds, and 
horses) do not secrete them. Salivary a-amylase is an endoamylase that splits 
a(1—4) glycosidic linkages only within the chain. Raw starch is not very susceptible 
to salivary endoamylase. However, when suspensions of starch granules are heated, 
the granules swell, taking up water and causing the polymers to become more ac- 
cessible to enzymes. Thus, cooked starch is more digestible. Most starch digestion 
occurs in the small intestine via glycohydrolases. 


Glycogen The major form of storage polysaccharide in animals is glycogen. Glyco- 
gen is found mainly in the liver (where it may amount to as much as 10% of liver 
mass) and skeletal muscle (where it accounts for 1% to 2% of muscle mass). Liver 
glycogen consists of granules containing highly branched molecules, with a (1—6) 
branches occurring every 8 to 12 glucose units. Like amylopectin, glycogen yields a 
red-violet color with iodine. Glycogen can be hydrolyzed by both a- and B-amylases, 
yielding glucose and maltose, respectively, as products and can also be hydrolyzed 
by glycogen phosphorylase, an enzyme present in liver and muscle tissue, to release 
glucose-1-phosphate. 


Dextran Another important family of storage polysaccharides is the dextrans, 
which are a@(1—6)-linked polysaccharides of D-glucose with branched chains found 
in yeast and bacteria. Because the main polymer chain is a(1—6) linked, the re- 
peating unit is isomaltose, Glcal—6Glc. The branch points may be 1—2, 1—3, or 
1—4 in various species. The degree of branching and the average chain length be- 
tween branches depend on the species and strain of the organism. Bacteria growing 
on the surfaces of teeth produce extracellular accumulations of dextrans, an impor- 
tant component of dental plaque. 


Polysaccharides Provide Physical Structure and Strength to Organisms 


Cellulose The structural polysaccharides have properties that are dramatically dif- 
ferent from those of the storage polysaccharides, even though the compositions of 
these two classes are similar. The structural polysaccharide cellulose is the most abun- 
dant natural polymer in the world. Found in the cell walls of nearly all plants, cellu- 
lose is one of the principal components providing physical structure and strength. 
The wood and bark of trees are insoluble, highly organized structures formed from 
cellulose and also from lignin (see Figure 25.35). It is awe-inspiring to look at a large 
tree and realize the amount of weight supported by polymeric structures derived from 
sugars and organic alcohols. Cellulose also has its delicate side, however. Cotton, 
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FIGURE 7.23 (a) Amylose, composed exclusively of the relatively bent a(1—4) linkages, prefers to adopt a 
helical conformation, whereas (b) cellulose, with B(1—4)-glycosidic linkages, can adopt a fully extended con- 
formation with alternating 180° flips of the glucose units. The hydrogen bonding inherent in such extended 
structures is responsible for the great strength of tree trunks and other cellulose-based materials. 


whose woven fibers make some of our most comfortable clothing fabrics, is almost 
pure cellulose. Derivatives of cellulose have found wide use in our society. Cellulose 
acetates are produced by the action of acetic anhydride on cellulose in the presence 
of sulfuric acid and can be spun into a variety of fabrics with particular properties. Re- 
ferred to simply as acetates, they have a silky appearance, a luxuriously soft feel, and a 
deep luster and are used in dresses, lingerie, linings, and blouses. 

Cellulose is a linear homopolymer of D-glucose units, just as in a-amylose. The 
structural difference, which completely alters the properties of the polymer, is that 
in cellulose the glucose units are linked by 6(1—4)-glycosidic bonds, whereas in 
a-amylose the linkage is a (1—4). The conformational difference between these two 
structures is shown in Figure 7.23. The a(1—4)-linkage sites of amylose are natu- 
rally bent, conferring a gradual turn to the polymer chain, which results in the he- 
lical conformation already described (Figure 7.21). The most stable conformation 
about the 6(1—4) linkage involves alternating 180° flips of the glucose units along 
the chain so that the chain adopts a fully extended conformation, referred to as an 
extended ribbon. Juxtaposition of several such chains permits efficient interchain 
hydrogen bonding, the basis of much of the strength of cellulose. 

The structure of one form of cellulose, determined by X-ray and electron dif- 
fraction data, is shown in Figure 7.24, The flattened sheets of the chains lie side 
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FIGURE 7.24 The structure of cellulose, showing the 
hydrogen bonds (blue) between the sheets, which 
strengthen the structure. Intrachain hydrogen bonds 
are in red, and interchain hydrogen bonds are in green. 
(Illustration: Irving Geis. Rights owned by Howard Hughes Med- 
ical Institute. Not to be reproduced without permission.) 
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FIGURE 7.25 Giraffes, cattle, deer, and camels are rumi- 
nant animals that are able to metabolize cellulose, 
thanks to bacterial cellulase in the rumen, a large first 
compartment in the stomach of a ruminant. 


by side and are joined by hydrogen bonds. These sheets are laid on top of one 
another in a way that staggers the chains, just as bricks are staggered to give 
strength and stability to a wall. Cellulose is extremely resistant to hydrolysis, 
whether by acid or by the digestive tract amylases described earlier. As a result, 
most animals (including humans) cannot digest cellulose to any significant 
degree. Ruminant animals, such as cattle, deer, giraffes, and camels, are an 
exception because bacteria that live in the rumen (Figure 7.25) secrete the 
enzyme cellulase, a -glucosidase effective in the hydrolysis of cellulose. The 
resulting glucose is then metabolized in a fermentation process to the benefit of 
the host animal. Termites and shipworms (Teredo navalis) similarly digest cellulose 
because their digestive tracts also contain bacteria that secrete cellulase. 


Chitin A polysaccharide that is similar to cellulose, both in its biological function 
and its primary, secondary, and tertiary structure, is chitin. Chitin is present in the 
cell walls of fungi and is the fundamental material in the exoskeletons of crustaceans, 
insects, and spiders. The structure of chitin, an extended ribbon, is identical to that 
of cellulose, except that the —OH group on each C-2 is replaced by —NHCOCH;, 
so the repeating units are N-acetyl-D-glucosamines in 6 (1—4) linkage. Like cellulose 
(Figure 7.24), the chains of chitin form extended ribbons (Figure 7.26) and pack 
side by side in a crystalline, strongly hydrogen-bonded form. One significant differ- 
ence between cellulose and chitin is whether the chains are arranged in parallel (all 
the reducing ends together at one end of a packed bundle and all the nonreducing 
ends together at the other end) or antiparallel (each sheet of chains having the 
chains arranged oppositely from the sheets above and below). Natural cellulose 
seems to occur only in parallel arrangements. Chitin, however, can occur in three 
forms, sometimes all in the same organism. a-Chitin is an all-parallel arrangement of 
the chains, whereas B-chitin is an antiparallel arrangement. In 6-chitin, the structure 
is thought to involve pairs of parallel sheets separated by single antiparallel sheets. 
Chitin is the earth’s second most abundant carbohydrate polymer (after cellu- 
lose), and its ready availability and abundance offer opportunities for industrial and 
commercial applications. Chitin-based coatings can extend the shelf life of fruits, 
and a chitin derivative that binds to iron atoms in meat has been found to slow the 
reactions that cause rancidity and flavor loss. Without such a coating, the iron in 
meats activates oxygen from the air, forming reactive free radicals that attack and 
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CENGAGENOW” ANIMATED FIGURE 7.26 Like 
cellulose, chitin and mannan form extended rib- 
bons and pack together efficiently, taking advan- 
tage of multiple hydrogen bonds. See this 
figure animated at www.cengage.com/login. 
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A Complex Polysaccharide in Red Wine—The Strange Story of Rhamnogalacturonan II 


For many years, cotton and grape growers and other farmers have 
known that boron is an essential trace element for their crops. Un- 
til recently, however, the role or roles of boron in sustaining plant 
growth were unknown. Recent reports show that at least one role 
for boron in plants is that of crosslinking an unusual polysaccha- 
ride called rhamnogalacturonan II (RGII). RGI is a low-molecular- 
weight (5 to 10 kDa) polysaccharide, but it is thought to be the 
most complex polysaccharide on earth, comprised as it is of 11 dif- 
ferent sugar monomers. It can be released from plant cell walls by 
treatment with a galacturonase, and it is also present in red wine. 
Part of the structure of RGII is shown in the accompanying figure. 
The nature of the borate ester crosslinks (also indicated in the fig- 
ure) was elucidated by Malcolm O’Neill and his colleagues, who 
used a combination of chemical methods and boron-11 NMR. 
Why is rhamnogalacturonan II essential for the structure and 
growth of plant walls? Plant walls are extremely sophisticated com- 
posite materials, composed of networks of protein, polysaccha- 
rides, and phenolic compounds. Cellulose microfibrils as strong as 
steel provide a load-bearing framework for the plant. These mi- 
crofibrils are tiny wires made of crystalline arrays of B-1,4-linked 


chains of glucose residues, which are extruded from hexameric 
spinnerets in the plasma membrane of the plant cell, surrounding 
the growing plant cell like hoops around a barrel. These micro- 
fibrils thus constrain the directions of cell expansion and deter- 
mine the shapes of the plant cells and the plant as well. The sepa- 
ration of the barrel hoops is controlled by hemicelluloses, such as 
xyloglucans, which form H-bonded crosslinks with the cellulose mi- 
crofibrils. The hemicellulose network is embedded in a hydrated 
gel inside the plant wall. This gel consists of complex galacturonic 
acid-rich polysaccharides, including RGIH—it provides a dynamic 
operating environment for cell wall processes. 

It is interesting to note that the tiny spinnerets of plant cells are 
nature’s version of the viscose process, developed in 1910, for the 
production of rayon fibers. In this process, viscose—literally 
a viscous solution of cellulose—is forced through a spinneret (a de- 
vice resembling a shower head with many tiny holes). Each hole 
produces a fine filament of viscose. The fibers precipitate in an 
acid bath and are stretched to form interchain H bonds that give 
the filaments the properties essential for use as textile fibers. 


RGII monomer 






HO 


HQ 
ae 
Ho HOCH, 
OH O 
HO 


a ©] 
O 
H,C, 

O 


O, 








@ Methyl groups 
@ Acetyl groups 














Source: Hofte, H., 2001. A baroque residue in red wine. Science 294:795-797. 
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FIGURE 7.27 The favored conformation of agarose in 
water is a double helix with a threefold screw axis. 


oxidize polyunsaturated lipids, causing most of the flavor loss associated with ran- 
cidity. Chitin-based coatings coordinate the iron atoms, preventing their interaction 
with oxygen. 


Agarose An important polysaccharide mixture isolated from marine red algae 
(Rhodophyceae) is agar, which consists of two components: agarose and agaropectin. 
Agarose (Figure 7.27) is a chain of alternating D-galactose and 3,6-anhydro-L- 
galactose, with side chains of 6-methyl-D-galactose. Agaropectin is similar, but in ad- 
dition, it contains sulfate ester side chains and D-glucuronic acid. The three- 
dimensional structure of agarose is a double helix with a threefold screw axis, as 
shown in Figure 7.27. The central cavity is large enough to accommodate water 
molecules. Agarose and agaropectin readily form gels containing large amounts 
(up to 99.5%) of water. 


Glycosaminoglycans A class of polysaccharides known as glycosaminoglycans is 
involved in a variety of extracellular (and sometimes intracellular) functions. Gly- 
cosaminoglycans consist of linear chains of repeating disaccharides in which one of 
the monosaccharide units is an amino sugar and one (or both) of the monosac- 
charide units contains at least one negatively charged sulfate or carboxylate group. 
The repeating disaccharide structures found commonly in glycosaminoglycans are 
shown in Figure 7.28. Heparin, with the highest net negative charge of the disac- 
charides shown, is a natural anticoagulant substance. It binds strongly to antithrom- 
bin IIT (a protein involved in terminating the clotting process) and inhibits blood 
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FIGURE 7.28 Glycosaminoglycans are formed from repeating disaccharide arrays. Glycosaminoglycans are 


components of the proteoglycans. 
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Billiard Balls, Exploding Teeth, and Dynamite—The Colorful History of Cellulose 


Although humans cannot digest it and most people’s acquain- 
tance with cellulose is limited to comfortable cotton clothing, cel- 
lulose has enjoyed a colorful and varied history of utilization. In 
1838, Théophile Pelouze in France found that paper or cotton 
could be made explosive if dipped in concentrated nitric acid. 
Christian Schénbein, a professor of chemistry at the University of 
Basel, prepared “nitrocotton” in 1845 by dipping cotton in a mix- 
ture of nitric and sulfuric acids and then washing the material to 
remove excess acid. In 1860, Major E. Schultze of the Prussian 
Army used the same material, now called guncotton, as a propel- 
lant replacement for gunpowder, and its preparation in brass 
cartridges quickly made it popular for this purpose. The only 
problem was that it was too explosive and could detonate unpre- 
dictably in factories where it was produced. The entire town of 
Faversham, England, was destroyed in such an accident. In 1868, 
Alfred Nobel mixed guncotton with ether and alcohol, thus 


In 1869, concerned over the precipitous decline (from hunt- 
ing) of the elephant population in Africa, the billiard ball man- 
ufacturers Phelan and Collander offered a prize of $10,000 for 
production of a substitute for ivory. Brothers Isaiah and John 
Hyatt in Albany, New York, produced a substitute for ivory by 
mixing guncotton with camphor, then heating and squeezing it 
to produce celluloid. This product found immediate uses well 
beyond billiard balls. It was easy to shape, strong, and resilient, 
and it exhibited a high tensile strength. Celluloid was eventually 
used to make dolls, combs, musical instruments, fountain pens, 
piano keys, and a variety of other products. The Hyatt brothers 
eventually formed the Albany Dental Company to make false 
teeth from celluloid. Because camphor was used in their pro- 
duction, the company advertised that their teeth smelled “clean,” 
but as reported in the New York Times in 1875, the teeth also oc- 
casionally exploded! 


preparing nitrocellulose, and in turn mixed this with nitroglyc- 
erin and sawdust to produce dynamite. Nobel’s income from 
dynamite and also from his profitable development of the Russ- 
ian oil fields in Baku eventually formed the endowment for the 
Nobel Prizes. 


Portions adapted from Burke, J., 1996. The Pinball Effect: How Renaissance 
Water Gardens Made the Carburetor Possible and Other Journeys Through Knowledge. 
New York: Little, Brown, & Company. 








clotting. Hyaluronate molecules may consist of as many as 25,000 disaccharide 
units, with molecular weights of up to 10’. Hyaluronates are important components 
of the vitreous humor in the eye and of synovial fluid, the lubricant fluid of joints in 
the body. The chondroitins and keratan sulfate are found in tendons, cartilage, and 
other connective tissue; dermatan sulfate, as its name implies, isa component of the 
extracellular matrix of skin. Glycosaminoglycans are fundamental constituents of 
proteoglycans (discussed later). 


Polysaccharides Provide Strength and Rigidity to Bacterial Cell Walls 


Some of nature’s most interesting polysaccharide structures are found in bacterial 
cell walls. Given the strength and rigidity provided by polysaccharide structures, it is 
not surprising that bacteria use such structures to provide protection for their cel- 
lular contents. Bacteria normally exhibit high internal osmotic pressures and fre- 
quently encounter variable, often hypotonic exterior conditions. The rigid cell walls 
synthesized by bacteria maintain cell shape and size and prevent swelling or shrink- 
age that would inevitably accompany variations in solution osmotic strength. 


Peptidoglycan Is the Polysaccharide of Bacterial Cell Walls 


Bacteria are conveniently classified as either Gram-positive or Gram-negative de- 
pending on their response to the so-called Gram stain. Despite substantial differ- 
ences in the various structures surrounding these two types of cells, nearly all bac- 
terial cell walls have a strong, protective peptide—polysaccharide layer called 
peptidoglycan. Gram-positive bacteria have a thick (approximately 25 nm) cell wall 
consisting of multiple layers of peptidoglycan. This thick cell wall surrounds the 
bacterial plasma membrane. Gram-negative bacteria, in contrast, have a much thin- 
ner (2 to 3 nm) cell wall consisting of a single layer of peptidoglycan sandwiched 
between the inner and outer lipid bilayer membranes. In either case, peptidogly- 
can, sometimes called murein (from the Latin murus, meaning “wall”), is a contin- 
uous crosslinked structure—in essence, a single molecule—built around the cell. 
The structure is shown in Figure 7.29. The backbone is a B(1—4)-linked polymer 
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FIGURE 7.29 (a) The structure of peptidoglycan. The tetrapeptides linking adjacent backbone chains contain an 
unusual y-carboxyl linkage. (b) The crosslink in Gram-positive cell walls is a pentaglycine bridge. (c) In Gram- 
negative cell walls, the linkage between the tetrapeptides of adjacent carbohydrate chains in peptidoglycan 
involves a direct amide bond between the lysine side chain of one tetrapeptide and b-alanine of the other. 


of N-acetylglucosamine and Nacetylmuramic acid units. This part of the structure 
is similar to that of chitin, but it is joined to a tetrapeptide, usually L-Ala > D-Glu - 
L-Lys : D-Ala, in which the L-lysine is linked to the y-COOH of b-glutamate. The pep- 
tide is linked to the Macetylmuramic acid units via its D-lactate moiety. The e-amino 
group of lysine in this peptide is linked to the —COOH of D-alanine of an adjacent 
tetrapeptide. In Gram-negative cell walls, the lysine e-amino group forms a direct 
amide bond with this D-alanine carboxyl (Figure 7.29). In Gram-positive cell walls, a 
pentaglycine chain bridges the lysine e-amino group and the D-Ala carboxyl group. 
Gram-negative cell walls are also covered with highly complex lipopolysaccharides 
(Figure 7.30). 
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FIGURE 7.30 The structures of the cell wall and membrane(s) in Gram-positive and Gram-negative bacteria. 
The Gram-positive cell wall is thicker than that in Gram-negative bacteria, compensating for the absence of a 
second (outer) bilayer membrane. 


Cell Walls of Gram-Negative Bacteria In Gram-negative bacteria, the peptidogly- 
can wall is the rigid framework around which is built an elaborate membrane struc- 
ture (Figure 7.30). The peptidoglycan layer encloses the periplasmic space and is at- 
tached to the outer membrane via a group of hydrophobic proteins. 

As shown in Figure 7.31, the outer membrane of Gram-negative bacteria is 
coated with a highly complex lipopolysaccharide, which consists of a lipid group 
(anchored in the outer membrane) joined to a polysaccharide made up of long 
chains with many different and characteristic repeating structures (Figure 7.31). 
These many different unique units determine the antigenicity of the bacteria; that 
is, animal immune systems recognize them as foreign substances and raise antibod- 
ies against them. As a group, these antigenic determinants are called the O antigens, 


> FIGURE7.31 Lipopolysaccharide (LPS) coats the outer membrane of Gram-negative bacteria. The lipid por- 
tion of the LPS is embedded in the outer membrane and is linked to a complex polysaccharide. 
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and there are thousands of different ones. The Salmonella bacteria alone have well 
over a thousand known O antigens that have been organized into 17 different 
groups. The great variation in these O antigen structures apparently plays a role in 
the recognition of one type of cell by another and in evasion of the host immune 
system. 


Cell Walls of Gram-Positive Bacteria In Gram-positive bacteria, the cell exterior 
is less complex than for Gram-negative cells. Having no outer membrane, Gram- 
positive cells compensate with a thicker wall. Covalently attached to the peptido- 
glycan layer are teichoic acids, which often account for 50% of the dry weight of 
the cell wall. The teichoic acids are polymers of ribitol phosphate or glycerol phosphate 
linked by phosphodiester bonds. 


Animals Display a Variety of Cell Surface Polysaccharides 


Compared to bacterial cells, which are identical within a given cell type (except for 
O antigen variations), animal cells display a wondrous diversity of structure, consti- 
tution, and function. Although each animal cell contains, in its genetic material, the 
instructions to replicate the entire organism, each differentiated animal cell care- 
fully controls its composition and behavior within the organism. A great part of 
each cell’s uniqueness begins at the cell surface. This surface uniqueness is critical 
to each animal cell because cells spend their entire life span in intimate contact with 
other cells and must therefore communicate with one another. That cells are able 
to pass information among themselves is evidenced by numerous experiments. For 
example, heart myocytes, when grown in culture (in glass dishes), establish synchrony 
when they make contact, so that they “beat” or contract in unison. If they are re- 
moved from the culture and separated, they lose their synchronous behavior, but if 
allowed to reestablish cell-to-cell contact, they spontaneously restore their synchro- 
nous contractions. 

As these and many other related phenomena show, it is clear that molecular 
structures on one cell are recognizing and responding to molecules on the 
adjacent cell or to molecules in the extracellular matrix, the complex “soup” of 
connective proteins and other molecules that exists outside of and among cells. 
Many of these interactions involve glycoproteins on the cell surface and proteoglycans 
in the extracellular matrix. The “information” held in these special carbohydrate- 
containing molecules is not encoded directly in the genes (as with proteins) but is 
determined instead by expression of the appropriate enzymes that assemble car- 
bohydrate units in a characteristic way on these molecules. Also, by virtue of the 
several hydroxyl linkages that can be formed with each carbohydrate monomer, 
these structures are arguably more information-rich than proteins and nucleic 
acids, which can form only linear polymers. A few of these glycoproteins and their 
unique properties are described in the following sections. 


7:55 What Are Glycoproteins, and How Do They 
Function in Cells? 


Many proteins found in nature are glycoproteins because they contain covalently 
linked oligosaccharide and polysaccharide groups. The list of known glycoproteins 
includes structural proteins, enzymes, membrane receptors, transport proteins, and 
immunoglobulins, among others. In most cases, the precise function of the bound 
carbohydrate moiety is not understood. 

Carbohydrate groups may be linked to polypeptide chains via the hydroxyl 
groups of serine, threonine, or hydroxylysine residues (in O-linked saccharides) 
(Figure 7.32a) or via the amide nitrogen of an asparagine residue (in N-linked 
saccharides) (Figure 7.32b). The carbohydrate residue linked to the protein in 
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FIGURE 7.32 The carbohydrate moieties of glycopro- 
teins may be linked to the protein via (a) serine or 
threonine residues (in the O-linked saccharides) or 

(b) asparagine residues (in the N-linked saccharides). 
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FIGURE 7.33 The O-linked saccharides of glycoproteins 
appear in many cases to adopt extended conformations 
that serve to extend the functional domains of these 
proteins above the membrane surface. (Adapted from 
Jentoft, N., 1990. Why are proteins O-glycosylated? Trends in Bio- 
chemical Sciences 15:291-294.) 


O-linked saccharides is usually an Macetylgalactosamine, but mannose, galactose, and 
xylose residues linked to protein hydroxyls are also found (Figure 7.32a). Oligosac- 
charides O-linked to glycophorin (see Figure 9.10) involve Macetylgalactosamine 
linkages and are rich in sialic acid residues. N-linked saccharides always have a 
unique core structure composed of two Macetylglucosamine residues linked to a 
branched mannose triad (Figure 7.32b, c). Many other sugar units may be linked to 
each of the mannose residues of this branched core. 

O-linked saccharides are often found in cell surface glycoproteins and in 
mucins, the large glycoproteins that coat and protect mucous membranes in the 
respiratory and gastrointestinal tracts in the body. Certain viral glycoproteins also 
contain O-linked sugars. O-linked saccharides in glycoproteins are often found 
clustered in richly glycosylated domains of the polypeptide chain. Physical studies 
on mucins show that they adopt rigid, extended structures. An individual mucin 
molecule (M, = 107) may extend over a distance of 150 to 200 nm in solution. In- 
herent steric interactions between the sugar residues and the protein residues in 
these cluster regions cause the peptide core to fold into an extended and relatively 
rigid conformation. This interesting effect may be related to the function of 
O-linked saccharides in glycoproteins. It allows aggregates of mucin molecules to 
form extensive, intertwined networks, even at low concentrations. These viscous 
networks protect the mucosal surface of the respiratory and gastrointestinal tracts 
from harmful environmental agents. 

There appear to be two structural motifs for membrane glycoproteins containing 
O-linked saccharides. Certain glycoproteins, such as leukosialin, are O-glycosylated 
throughout much or most of their extracellular domain (Figure 7.33). Leukosialin, 
like mucin, adopts a highly extended conformation, allowing it to project great dis- 
tances above the membrane surface, perhaps protecting the cell from unwanted in- 
teractions with macromolecules or other cells. The second structural motif is ex- 
emplified by the low-density lipoprotein (LDL) receptor and by decay-accelerating 
factor (DAF). These proteins contain a highly O-glycosylated stem region that sep- 
arates the transmembrane domain from the globular, functional extracellular 
domain. The O-glycosylated stem serves to raise the functional domain of the pro- 
tein far enough above the membrane surface to make it accessible to the extracel- 
lular macromolecules with which it interacts. 
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Drug Research Finds a Sweet Spot 
A variety of diseases are being successfully treated with sugar-based either on the market or at various stages of clinical trials. Some of 
therapies. As this table shows, several carbohydrate-based drugs are these drugs are enzymes, whereas others are glycoconjugates. 
Drug Description Manufacturer 
Cerezyme This enzyme degrades glycolipids, compensating for an enzyme deficiency that Genzyme 
(imiglucerase) causes Gaucher’s disease. Cambridge, MA 
Vancocin A very potent glycopeptide antibiotic that is typically used against antibiotic- Eli Lilly 
(vancomycin) resistant infections. It inhibits synthesis of peptidoglycan in the bacterial cell wall. Indianapolis, IN 
Vevesca A sugar analog that inhibits synthesis of the glycolipid that accumulates in Oxford GlycoSciences 
(OGT 918) Gaucher’s disease. Abingdon, UK 
GMK A vaccine containing ganglioside GM2; it triggers an immune response against Progenics Pharmaceuticals 
cancer cells carrying GM2. Tarrytown, NY 
Staphvax A vaccine that is a protein with a linked bacterial sugar; it is intended to treat NABI Pharmaceuticals 
Staphylococcus infection. Boca Raton, FL 
Bimosiamose A sugar analog that inhibits selectin-based inflammation in blood vessels. Texas Biotechnology 
(TBC1269) Houston, TX 
GCS-100 A sugar that blocks action of a sugar-binding protein on tumors. GlycoGenesys 
Boston 
GD0039 A sugar analog that inhibits synthesis of carbohydrates essential to tumor GlycoDesign 
(swainsonine) metastasis. Toronto, Canada 
PI-88 A sugar that inhibits growth factor-dependent angiogenesis and enzymes that Progen 
promote metastasis. Darra, Australia 
Adapted from Maeder, T., 2002. Sweet medicines. Scientific American 287:40-47. 
Additional References: Alper, J, 2001. Searching for medicine's sweet spot. Science 291:2338-2343. Borman, S., 2007. Sugar medicine. Chemical & Engineering News 85:19-30. 








Polar Fish Depend on Antifreeze Glycoproteins 


A unique family of O-linked glycoproteins permits fish to live in the icy seawater of 
the Arctic and Antarctic regions, where water temperature may reach as low as 
—1.9°C. Antifreeze glycoproteins (AFGPs) are found in the blood of nearly all 
Antarctic fish and at least five Arctic fish. These glycoproteins have the peptide 
structure 


{Ala-Ala-Thr],,-Ala-Ala 


where n can be 4, 5, 6, 12, 17, 28, 35, 45, or 50. Each of the threonine residues is gly- 
cosylated with the disaccharide £-galactosyl-(1—3)-a-Nacetylgalactosamine (Figure 
7.34). This glycoprotein adopts a flexible rod conformation with regions of threefold 
left-handed helix. The evidence suggests that antifreeze glycoproteins may inhibit the 
formation of ice in the fish by binding specifically to the growth sites of ice crystals, in- 
hibiting further growth of the crystals. 


N-Linked Oligosaccharides Can Affect the Physical Properties 
and Functions of a Protein 


N-linked oligosaccharides are found in many different proteins, including immuno- 
globulins G and M, ribonuclease B, ovalbumin, and peptide hormones. Many differ- 
ent functions are known or suspected for N-glycosylation of proteins. Glycosylation 
can affect the physical and chemical properties of proteins, altering solubility, mass, 
and electrical charge. Carbohydrate moieties have been shown to stabilize protein con- 
formations and protect proteins against proteolysis. Eukaryotic organisms use post- 
translational additions of N-linked oligosaccharides to direct selected proteins to 
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FIGURE 7.34 The structure of the repeating unit of 
antifreeze glycoproteins, a disaccharide consisting of 
B-galactosyl-(1—>3)-a-N-acetylgalactosamine in 
glycosidic linkage to a threonine residue. 


‘Sia — Gal — GlcNAc — Man 


Man — GlcNAc 


Ba Gal — GIcNAc — Man’ 


/ 
“Sia — Gal— GlcNAc 


(Does not bind) 





B-Galactosyl-1,3-a-N-acetylgalactosamine 


Repeating unit of antifreeze glycoproteins 


various membrane compartments. Recent evidence indicates that N-linked oligosac- 
charides promote the proper folding of newly synthesized polypeptides in the endo- 
plasmic reticulum (see A Deeper Look on page 209). 


Oligosaccharide Cleavage Can Serve as a Timing Device 
for Protein Degradation 


The slow cleavage of monosaccharide residues from N-linked glycoproteins circu- 
lating in the blood targets these proteins for degradation by the organism. The liver 
contains specific receptor proteins that recognize and bind glycoproteins that are 
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FIGURE 7.35 Progressive cleavage of sialic acid residues 
exposes galactose residues. Binding to the asialoglyco- 
protein receptor in the liver becomes progressively 
more likely as more Gal residues are exposed. 
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7.6 How Do Proteoglycans Modulate Processes in Cells and Organisms? 


A DEEPER LOOK 





N-Linked Oligosaccharides Help Proteins Fold 


One important effect of N-linked oligosaccharides in eukaryotic or- 
ganisms may be their contribution to the correct folding of certain 
globular proteins. This adaptation of saccharide function allows 
cells to produce and secrete larger and more complex proteins at 
high levels. Inhibition of glycosylation leads to production of mis- 
folded, aggregated proteins that lack function. Certain proteins are 
highly dependent on glycosylation, whereas others are much less 


Studies with model peptides show that oligosaccharides can al- 
ter the conformational preferences near the glycosylation sites. In 
addition, the presence of polar saccharides may serve to orient 
that portion of a peptide toward the surface of protein domains. 
However, it has also been found that saccharides usually are not 
essential for maintaining the overall folded structure after a gly- 
coprotein has reached its native, folded structure. 


209 


so, and certain glycosylation sites are more important for protein 
folding than are others. 


Source: Helenius, A., and Aebi, M., 2001. Intracellular functions of N-linked 
glycans. Science 291:2364-2369. 








ready to be degraded and recycled. Newly synthesized serum glycoproteins contain 
N-linked triantennary (three-chain) oligosaccharides having structures similar to 
those in Figure 7.35, in which sialic acid residues cap galactose residues. As these 
glycoproteins circulate, enzymes on the blood vessel walls cleave off the sialic acid 
groups, exposing the galactose residues. In the liver, the asialoglycoprotein recep- 
tor binds the exposed galactose residues of these glycoproteins with very high affin- 
ity (Kp = 107° to 10-8 M). The complex of receptor and glycoprotein is then taken 
into the cell by endocytosis, and the glycoprotein is degraded in cellular lysosomes. 
Highest affinity binding of glycoprotein to the asialoglycoprotein receptor requires 
three free galactose residues. Oligosaccharides with only one or two exposed galac- 
tose residues bind less tightly. This is an elegant way for the body to keep track of 
how long glycoproteins have been in circulation. Over a period of time—anywhere 
from a few hours to weeks—the sialic acid groups are cleaved one by one. The 
longer the glycoprotein circulates and the more sialic acid residues are removed, 
the more galactose residues become exposed so that the glycoprotein is eventually 
bound to the liver receptor. 


726: How Do Proteoglycans Modulate Processes 
in Cells and Organisms? 


Proteoglycans are a family of glycoproteins whose carbohydrate moieties are pre- 
dominantly glycosaminoglycans. The structures of only a few proteoglycans are 
known, and even these few display considerable diversity (Figure 7.36). Those 
known range in size from serglycin, having 104 amino acid residues (10.2 kD), to 
versican, having 2409 residues (265 kD). Each of these proteoglycans contains one 
or two types of covalently linked glycosaminoglycans. In the known proteoglycans, 
the glycosaminoglycan units are O-linked to serine residues of Ser-Gly dipeptide 
sequences. Serglycin is named for a unique central domain of 49 amino acids com- 
posed of alternating serine and glycine residues. The cartilage matrix proteoglycan 
contains 117 Ser-Gly pairs to which chondroitin sulfates attach. Decorin, a small 
proteoglycan secreted by fibroblasts and found in the extracellular matrix of con- 
nective tissues, contains only three Ser-Gly pairs, only one of which is normally gly- 
cosylated. In addition to glycosaminoglycan units, proteoglycans may also contain 
other N-linked and O-linked oligosaccharide groups. 


Functions of Proteoglycans Involve Binding to Other Proteins 


Proteoglycans may be soluble and located in the extracellular matrix, as is the case 
for serglycin, versican, and the cartilage matrix proteoglycan, or they may be inte- 
gral transmembrane proteins, such as syndecan. Both types of proteoglycan appear to 
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FIGURE 7.36 The known proteoglycans include a variety of structures. The carbohydrate groups of proteogly- 
cans are predominantly glycosaminoglycans O-linked to serine residues. Proteoglycans include both soluble 
proteins and integral transmembrane proteins. 


function by interacting with a variety of other molecules through their glycosami- 
noglycan components and through specific receptor domains in the polypeptide it- 
self. For example, syndecan (from the Greek syndein, meaning “to bind together”) 
is a transmembrane proteoglycan that associates intracellularly with the actin cyto- 
skeleton (see Chapter 16). Outside the cell, it interacts with fibronectin, an extra- 
cellular protein that binds to several cell surface proteins and to components of the 
extracellular matrix. The ability of syndecan to participate in multiple interactions 
with these target molecules allows them to act as a sort of “glue” in the extracellu- 
lar space, linking components of the extracellular matrix, facilitating the binding of 
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cells to the matrix, and mediating the binding of growth factors and other soluble 
molecules to the matrix and to cell surfaces (Figure 7.37). 

Many of the functions of proteoglycans involve the binding of specific proteins to 
the glycosaminoglycan groups of the proteoglycan. The glycosaminoglycan-binding 
sites on these specific proteins contain multiple basic amino acid residues. The 
amino acid sequences BBXB and BBBXXB (where B is a basic amino acid and X 
is any amino acid) recur repeatedly in these binding domains. Basic amino acids 
such as lysine and arginine provide charge neutralization for the negative 
charges of glycosaminoglycan residues, and in many cases, the binding of extra- 
cellular matrix proteins to glycosaminoglycans is primarily charge-dependent. For 
example, more highly sulfated glycosaminoglycans bind more tightly to fibronectin. 
However, certain protein—glycosaminoglycan interactions require a specific carbo- 
hydrate sequence. A particular pentasaccharide sequence in heparin, for example, 
binds tightly to antithrombin II (Figure 7.38), accounting for the anticoagulant 
properties of heparin. Other glycosaminoglycans interact much more weakly. 


Proteoglycans May Modulate Cell Growth Processes 


Several lines of evidence raise the possibility of modulation or regulation of cell 
growth processes by proteoglycans. First, heparin and heparan sulfate are known 
to inhibit cell proliferation in a process involving internalization of the gly- 
cosaminoglycan moiety and its migration to the cell nucleus. Second, fibroblast 
growth factor binds tightly to heparin and other glycosaminoglycans, and the 
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FIGURE 7.37 Proteoglycans serve a variety of functions 
on the cytoplasmic and extracellular surfaces of the 
plasma membrane. Many of these functions appear 

to involve the binding of specific proteins to the 
glycosaminoglycan groups. 


FIGURE 7.38 A portion of the structure of heparin, a car- 
bohydrate having anticoagulant properties. It is used by 
blood banks to prevent the clotting of blood during do- 
nation and storage and also by physicians to prevent the 
ormation of life-threatening blood clots in patients re- 
covering from serious injury or surgery. This sulfated pen- 
asaccharide sequence in heparin binds with high affin- 
ity to antithrombin IIl, accounting for this anticoagulant 
activity. The 3-O-sulfate marked by an asterisk is essential 
or high-affinity binding of heparin to antithrombin III. 
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FIGURE 7.39 Hyaluronate (see Figure 7.28) forms the 
backbone of proteoglycan structures, such as those 
found in cartilage. The proteoglycan subunits consist 
of a core protein containing numerous O-linked and 
N-linked glycosaminoglycans. In cartilage, these highly 
hydrated proteoglycan structures are enmeshed in a 
network of collagen fibers. Release (and subsequent re- 
absorption) of water by these structures during com- 
pression accounts for the shock-absorbing qualities of 
cartilaginous tissue. 


heparin—growth factor complex protects the growth factor from degradative en- 
zymes, thus enhancing its activity. There is evidence that binding of fibroblast 
growth factors by proteoglycans and glycosaminoglycans in the extracellular ma- 
trix creates a reservoir of growth factors for cells to use. Third, transforming 
growth factor B has been shown to stimulate the synthesis and secretion of pro- 
teoglycans in certain cells. Fourth, several proteoglycan core proteins, including 
versican and lymphocyte homing receptor, have domains similar in sequence to 
those of epidermal growth factor and complement regulatory factor. These 
growth factor domains may interact specifically with growth factor receptors in 
the cell membrane in processes that are not yet understood. 
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7.7 Do Carbohydrates Provide a Structural Code? 


Proteoglycans Make Cartilage Flexible and Resilient 


Cartilage matrix proteoglycan is responsible for the flexibility and resilience of car- 
tilage tissue in the body. In cartilage, long filaments of hyaluronic acid are studded 
or coated with proteoglycan molecules, as shown in Figure 7.39. The hyaluronate 
chains can be as long as 4 wm and can coordinate 100 or more proteoglycan units. 
Cartilage proteoglycan possesses a hyaluronic acid-binding domain on the NHs- 
terminal portion of the polypeptide, which binds to hyaluronate with the assistance 
of a link protein. The proteoglycan—hyaluronate aggregates can have molecular 
weights of 2 million or more. 

The proteoglycan-hyaluronate aggregates are highly hydrated by virtue of 
strong interactions between water molecules and the polyanionic complex. 
When cartilage is compressed (such as when joints absorb the impact of walking 
or running), water is briefly squeezed out of the cartilage tissue and then reab- 
sorbed when the stress is diminished. This reversible hydration gives cartilage its 
flexible, shock-absorbing qualities and cushions the joints during physical activi- 
ties that might otherwise injure the involved tissues. 


Y= Do Carbohydrates Provide a Structural Code? 


The surprisingly low number of genes in the genomes of complex multicellular or- 
ganisms has led biochemists to consider other explanations for biological complexity 
and diversity. Oligosaccharides and polysaccharides, endowed with an unsurpassed 
variability of structures, are information carriers, and glycoconjugates—complexes of 
proteins with oligosaccharides and polysaccharides—are the mediators of informa- 
tion transfer by these carbohydrate structures. Individual sugar units are the “letters” 
of the sugar code, and the “words” and “sentences” of this code are synthesized by gly- 
cosyltransferases, glycosidases, and other enzymes. The total number of permutations 
for a six-unit polymer formed from an alphabet of 20 hexose monosaccharides is a 
staggering 1.44 X 10%, whereas only 6.4 X 10’ hexamers can be formed from 
20 amino acids and only 4096 hexanucleotides can be formed from the four nu- 
cleotides of DNA. The vast array of possible glycan structures adds a glycomic dimen- 
sion to the genomic complexity achieved by protein expression in organisms. 

The processes of cell migration, cell-cell interaction, immune response, and 
blood clotting, along with many other biological processes, depend on information 
transfer modulated by glycoconjugates. Many of the proteins involved in glycocon- 
jugate formation belong to the lectins—a class of proteins that bind carbohydrates 
with high specificity and affinity. Lectins ave the translators of the sugar code. Table 7.1 
describes a few of the many known lectins, their carbohydrate affinities, and their 
functions. A few examples of lectin-carbohydrate complexes and their roles in bio- 
logical information transfer will illustrate the nature of these important and com- 
plex interactions. 











TABLE 7.1 an 
Carbohydrate 

Lectin Family Specificity Function 

Calnexins Glucose Ligand-selective molecular chaperones in ER 

C-type lectins Variable Cell-type specific endocytosis and other 
functions 

ERGIC-53 Mannose Intracellular routing of glycoproteins and 
vesicles 

Galectins Galactose/lactose Cellular growth regulation and cell—matrix 
interactions 

Pentraxins Variable Anti-inflammatory action 

Selectins Variable Cell migration and routing 
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Selectins, Rolling Leukocytes, and the Inflammatory Response 


Human bodies are constantly exposed to a plethora of bacteria, viruses, and other 
inflammatory substances. To combat these infectious and toxic agents, the body has 
developed a carefully regulated inflammatory response system. Part of that re- 
sponse is the orderly migration of leukocytes to sites of inflammation. Leukocytes 
literally roll along the vascular wall and into the tissue site of inflammation. This 
rolling movement is mediated by reversible adhesive interactions between the 
leukocytes and the vascular surface. 

These interactions involve adhesion proteins called selectins, which are found 
both on the rolling leukocytes and on the endothelial cells of the vascular walls. 
Selectins have a characteristic domain structure, consisting of an N-terminal extra- 
cellular lectin (LEC) domain, a single epidermal growth factor (E) domain, a series 
of two to nine short consensus repeat (SCR) domains, a single transmembrane 
segment, and a short cytoplasmic domain. The lectin domains bind carbohydrates 
with high affinity and specificity. Selectins of three types are known: E-selectins, 
L-selectins, and P-selectins. L-Selectin is found on the surfaces of leukocytes, in- 
cluding neutrophils and lymphocytes, and binds to carbohydrate ligands on endo- 
thelial cells (Figure 7.40). The presence of L-selectin is a necessary component of 
leukocyte rolling. P-Selectin and E-selectin are located on the vascular endothelium 
and bind with carbohydrate ligands on leukocytes. Typical neutrophil cells possess 
10,000 to 20,000 P-selectin—binding sites. Selectins are expressed on the surfaces of 
their respective cells by exposure to inflammatory signal molecules, such as hista- 
mine, hydrogen peroxide, and bacterial endotoxins. P-Selectins, for example, are 
stored in intracellular granules and are transported to the cell membrane within 
seconds to minutes of exposure to a triggering agent. 

Substantial evidence supports the hypothesis that selectin-carbohydrate ligand 
interactions modulate the rolling of leukocytes along the vascular wall. Studies with 
L-selectin—deficient and P-selectin—-deficient leukocytes show that L-selectins medi- 
ate weaker adherence of the leukocyte to the vascular wall and promote faster 
rolling along the wall. Conversely, P-selectins promote stronger adherence and 
slower rolling. Thus, leukocyte rolling velocity in the inflammatory response could 
be modulated by variable exposure of P-selectins and L-selectins at the surfaces of 
endothelial cells and leukocytes, respectively. 
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FIGURE 7.40 (a) The interactions of selectins with their receptors. (b) The selectin family of adhesion proteins. 
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(b) 





FIGURE 7.41 (a) Structure of the human galectin-1 
dimer. The lactose-binding sites are at opposite ends of 
the dimer. (b) The carbohydrate recognition site of 
human galectin-1, showing the amino acids involved in 
galactose binding and the network of water molecules 
(red circles) that orient these residues. 





Galectins—Mediators of Inflammation, Immunity, and Cancer 


The galectins are a very conserved family of proteins with carbohydrate recognition 
domains (CRDs) of about 135 amino acids that bind 6-galactosides specifically. 
Galectins occur in both vertebrates and invertebrates, and they participate in 
processes such as cell adhesion, growth regulation, inflammation, immunity, and 
cancer metastasis. In humans, one galectin is associated with increased risk of heart 
attacks and another is implicated in inflammatory bowel disease. The structure of 
human galectin-1 is a dimer of antiparallel beta-sandwich subunits (Figure 7.41a). 
Lactose binds at opposite ends of the dimer. Structural studies of the protein in the 
presence and absence of ligand reveal that the amino acid residues implicated in 
galactose binding are kept in their proper orientation in the absence of ligand by a 
hydrogen-bonded network of four water molecules (Figure 7.41b). 


C-Reactive Protein—A Lectin That Limits Inflammation Damage 


The pentraxins are lectins that adopt an unusual quaternary structure in which five 
identical subunits combine to form a planar ring with a central hole (Figure 7.42a). 
C-reactive protein is a pentraxin that functions to limit tissue damage, acute in- 
flammation, and autoimmune reactions. C-reactive protein acts by binding to phos- 
phocholine moieties on damaged membranes. Binding of the protein to phospho- 
choline is apparently mediated through a bound calcium ion and a hydrophobic 
pocket centered on Phe® (Figure 7.42b). 


FIGURE 7.42 (a) The C-reactive protein pentamer. 

(b) The phosphocholine-binding site of C-reactive pro- 
tein contains two bound Ca?* ions (blue) and a 
hydrophobic pocket. Phe® is shown in purple. 
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SUMMARY 


Carbohydrates are a versatile class of molecules of the formula (CH,O) ,. 
They are a major form of stored energy in organisms, and they are the 
metabolic precursors of virtually all other biomolecules. Carbohydrates 
linked to lipids (glycolipids) are components of biological membranes. 
Carbohydrates linked to proteins (glycoproteins) are important compo- 
nents of cell membranes and function in recognition between cell types 
and recognition of cells by other molecules. Recognition events are im- 
portant in cell growth, differentiation, fertilization, tissue formation, 
transformation of cells, and other processes. 


7.1 How Are Carbohydrates Named? Carbohydrates are classified 
into three groups: monosaccharides, oligosaccharides, and polysaccha- 
rides. Monosaccharides cannot be broken down into smaller sugars un- 
der mild conditions. Oligosaccharides consist of from two to ten simple 
sugar molecules. Polysaccharides are polymers of simple sugars and 
their derivatives and may be branched or linear. Their molecular 
weights range up to | million or more. 


7.2 What Is the Structure and Chemistry of Monosaccharides? Mono- 
saccharides consist typically of three to seven carbon atoms and are de- 
scribed as either aldoses or ketoses. Aldoses with at least three carbons 
and ketoses with at least four carbons contain chiral centers. The pre- 
fixes D- and L- are often used to indicate the configuration of the high- 
est numbered asymmetric carbon. The D- and L-forms of a monosac- 
charide are mirror images of each other, called enantiomers. Pairs of 
isomers that have opposite configurations at one or more chiral centers, 
but are not mirror images of each other, are called diastereomers. Sug- 
ars that differ in configuration at only one chiral center are epimers. An 
interesting feature of carbohydrates is their ability to form cyclic struc- 
tures with formation of an additional asymmetric center. Aldoses and 
ketoses with five or more carbons can form either furanose or pyranose 
rings, and the more stable form depends on structural factors. A variety 
of chemical and enzymatic reactions produce derivatives of simple sug- 
ars, such as sugar acids, sugar alcohols, deoxy sugars, sugar esters, 
amino sugars, acetals, ketals, and glycosides. 


7.3 What Is the Structure and Chemistry of Oligosaccharides? The 
complex array of oligosaccharides in higher organisms is formed from 
relatively few different monosaccharide units, particularly glucose, fruc- 
tose, mannose, galactose, ribose, and xylose. Disaccharides consist of 
two monosaccharide units linked by a glycosidic bond, and each indi- 
vidual unit is termed a residue. The most common disaccharides in na- 
ture are sucrose, maltose, and lactose. The anomeric carbons of oligo- 
saccharides may be substituted or unsubstituted. Disaccharides with a 
free, unsubstituted anomeric carbon can reduce oxidizing agents and 
thus are termed reducing sugars. 


7.4 What Is the Structure and Chemistry of Polysaccharides? Poly- 
saccharides are formed from monosaccharides and their derivatives. If 


a polysaccharide consists of only one kind of monosaccharide, it is a 
homopolysaccharide, whereas those with more than one kind of mono- 
saccharide are heteropolysaccharides. Polysaccharides may function as 
energy storage materials, structural components of organisms, or pro- 
tective substances. Starch and glycogen are readily metabolizable and 
provide energy reserves for cells. Chitin and cellulose provide strong 
support for the skeletons of arthropods and green plants, respectively. 
Mucopolysaccharides such as hyaluronic acid form protective coats on 
animal cells. Peptidoglycan, the strong protective macromolecule of 
bacterial cell walls, is composed of peptide-linked glycan chains. 


7.5 What Are Glycoproteins, and How Do They Function in Cells? 
Glycoproteins are proteins that contain covalently linked oligosaccha- 
rides and polysaccharides. Carbohydrate groups may be linked to pro- 
teins via the hydroxyl groups of serine, threonine, or hydroxylysine 
residues (in O-linked saccharides) or via the amide nitrogen of an as- 
paragine residue (in N-linked saccharides). O-Glycosylated stems of cer- 
tain proteins raise the functional domain of the protein above the mem- 
brane surface and the associated glycocalyx, making these domains 
accessible to interacting proteins. N-Glycosylation confers a variety of 
functions to proteins. N-linked oligosaccharides promote the proper 
folding of newly synthesized polypeptides in the endoplasmic reticulum 
of eukaryotic cells. 


7.6 How Do Proteoglycans Modulate Processes in Cells and Organ- 
isms? Proteoglycans are a family of glycoproteins whose carbohydrate 
moieties are predominantly glycosaminoglycans. Proteoglycans may be 
soluble and located in the extracellular matrix, as for serglycin, versican, 
and cartilage matrix proteoglycans, or they may be integral transmem- 
brane proteins, such as syndecan. Both types appear to function by in- 
teracting with a variety of other molecules through their glycosamino- 
glycan components and through specific receptor domains in the 
polypeptide itself. Proteoglycans modulate cell growth processes and 
are also responsible for the flexibility and resilience of cartilage tissue in 
the body. 


7.7 Do Carbohydrates Provide a Structural Code? Oligosaccharides 
and polysaccharides are information carriers, and glycoconjugates are 
the mediators of information transfer by these carbohydrate structures. 
The vast array of possible glycan structures adds a glycomic dimension 
to the genomic complexity achieved by protein expression in organ- 
isms. The processes of cell migration, cell-cell interaction, immune re- 
sponse, and blood clotting, along with many other biological processes, 
depend on information transfer modulated by glycoconjugates. Many of 
the proteins involved in glycoconjugate formation belong to the 
lectins—a class of proteins that bind carbohydrates with high specificity 
and affinity. 


PROBLEMS 


CENGAGENOW* Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. Draw Haworth structures for the two possible isomers of D-altrose 
(Figure 7.2) and D-psicose (Figure 7.3). 

2. (Integrates with Chapters 4 and 5.) Consider the peptide DGNILSR, 
where N has a covalently linked galactose and S has a covalently 
linked glucose. Draw the structure of this glycopeptide, and also 
draw titration curves for the glycopeptide and for the free peptide 
that would result from hydrolysis of the two sugar residues. 


3. (Integrates with Chapters 5 and 6.) Human hemoglobin can react 
with sugars in the blood (usually glucose) to form covalent adducts. 
The a-amino groups of N-terminal valine in the Hb -subunits 
react with the C-1 (aldehyde) carbons of monosaccharides to form 
aldimine adducts, which rearrange to form very stable ketoamine 
products. Quantitation of this “glycated hemoglobin” is important 
clinically, especially for diabetic individuals. Suggest at least three 
methods by which glycated Hb could be separated from normal Hb 
and quantitated. 


4. Trehalose, a disaccharide produced in fungi, has the following 


structure: 
CHOH 
O, 
H H 
OH H 
HO 
H OH 





a. What is the systematic name for this disaccharide? 
b. Is trehalose a reducing sugar? Explain. 


5. Draw a Fischer projection structure for L-sorbose (D-sorbose is 
shown in Figure 7.3). 

6. œ-D-Glucose has a specific rotation, [æ], ®, of +112.2°, whereas 
B-b-glucose has a specific rotation of +18.7°. What is the composi- 
tion of a mixture of a-D- and B-D-glucose, which has a specific rota- 
tion of 83.0°? 

7. Use the information in the Critical Developments in Biochemistry 
box titled “Rules for Description of Chiral Centers in the (R,S) Sys- 
tem” (Chapter 4) to name D-galactose using (R,S) nomenclature. 
Do the same for L-altrose. 

8. A 0.2-¢ sample of amylopectin was analyzed to determine the frac- 
tion of the total glucose residues that are branch points in the struc- 
ture. The sample was exhaustively methylated and then digested, 
yielding 50 umol of 2,3-dimethylglucose and 0.4 umol of 1,2,3,6- 
tetramethylglucose. 

a. What fraction of the total residues are branch points? 
b. How many reducing ends does this sample of amylopectin have? 

9. (Integrates with Chapters 5, 6, and 9.) Consider the sequence of gly- 
cophorin (see Figure 9.10), and imagine subjecting glycophorin, 
and also a sample of glycophorin treated to remove all sugars, to 
treatment with trypsin and chymotrypsin. Would the presence of 
sugars in the native glycophorin make any difference to the results? 


10. (Integrates with Chapters 4, 5, and 23.) The caloric content of 
protein and carbohydrate are quite similar, at approximately 16 to 
17 kJ/g, whereas that of fat is much higher, at 38 kJ/g. Discuss the 
chemical basis for the similarity of the values for carbohydrate and 
for protein. 

11. Write a reasonable chemical mechanism for the starch phosphory- 
lase reaction (Figure 7.22). 

12. Laetrile is a glycoside found in bitter almonds and peach pits. 
Laetrile treatment is offered in some countries as a cancer therapy. 
This procedure is dangerous, and there is no valid clinical evidence 
of its efficacy. Look up the structure of laetrile and suggest at least 
one reason that laetrile treatment could be dangerous for human 
patients. 

13. Treatment with chondroitin and glucosamine is offered as one pop- 
ular remedy for arthritis pain. Suggest an argument for the efficacy 
of this treatment, and then comment on its validity, based on what 
you know of polysaccharide chemistry. 

14. Certain foods, particularly beans and legumes, contain substances 
that are indigestible (at least in part) by the human stomach, but 
which are metabolized readily by intestinal microorganisms, produc- 
ing flatulence. One of the components of such foods is stachyose. 
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Stachyose 


CHOH 
OH 
Beano is a commercial product that can prevent flatulence. Describe 
the likely breakdown of stachyose in the human stomach and in- 
testines and how Beano could contribute to this process. What would 
be an appropriate name for the active ingredient in Beano? 
15. Give the systematic name for stachyose. 


16. Prolonged exposure of amylopectin to starch phosphorylase yields 
a substance called a limit dextrin. Describe the chemical composi- 
tion of limit dextrins, and draw a mechanism for the enzyme- 
catalyzed reaction that can begin the breakdown of a limit dextrin. 

17. Biochemist Joseph Owades revolutionized the production of beer 
in the United States by developing a simple treatment with an en- 
zyme that converted regular beer into “light beer,” which was mar- 
keted aggressively as a beverage that “tastes great,” even though it is 
“less filling.” What was the enzyme-catalyzed reaction that Owades 
used to modify the fermentation process so cleverly, and how is reg- 
ular beer different from light beer? 

18. Amateur brewers of beer, who do not have access to the enzyme de- 
scribed in problem 17, have nonetheless managed to brew light 
beers using a readily available commercial product. What is that 
product, and how does it work? 


19. Kudzu is a vine that grows prolifically in the southern and south- 
eastern United States. A native of Japan, China, and India, kudzu 
was brought to the United States in 1876 at the Centennial Exposi- 
tion in Philadelphia. During the Great Depression of the 1930s, the 
Soil Conservation Service promoted kudzu for erosion control, and 
farmers were paid to plant it. Today, however, kudzu is a universal 
nuisance, spreading rapidly, and covering and destroying trees in 
large numbers. Already covering 7 to 10 million acres in the U.S., 
kudzu grows at the rate of a foot per day. Assume that the kudzu 
vine consists almost entirely of cellulose fibers, and assume that the 
fibers lie parallel to the vine axis. Calculate the rate of the cellulose 
synthase reaction that adds glucose units to the growing cellulose 
molecules. Use the structures in your text to make a reasonable es- 
timate of the unit length of a cellulose molecule (from one glucose 
monomer to the next). 


Preparing for the MCAT Exam 


20. Heparin has a characteristic pattern of hydroxy and anionic func- 
tions. What amino acid side chains on antithrombin III might be 
the basis for the strong interactions between this protein and the 
anticoagulant heparin? 

21. What properties of hyaluronate, chondroitin sulfate, and keratan 
sulfate make them ideal components of cartilage? 
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Lipids 


ESSENTIAL QUESTION 


Lipids are a class of biological molecules defined by low solubility in water and high 
solubility in nonpolar solvents. As molecules that are largely hydrocarbon in nature, 
lipids represent highly reduced forms of carbon and, upon oxidation in metabolism, 
yield large amounts of energy. Lipids are thus the molecules of choice for metabolic 
energy storage. 


What are the structure, chemistry, and biological function of lipids? 


The lipids found in biological systems are either hydrophobic (containing only non- 
polar groups) or amphipathic (possessing both polar and nonpolar groups). The 
hydrophobic nature of lipid molecules allows membranes to act as effective barriers 
to more polar molecules. In this chapter, we discuss the chemical and physical prop- 
erties of the various classes of lipid molecules. The following chapter considers 
membranes, whose properties depend intimately on their lipid constituents. 


Sule What Are the Structures and Chemistry of Fatty Acids? 


A fatty acid is composed of a long hydrocarbon chain (“tail”) and a terminal car- 
boxyl group (or “head”). The carboxyl group is normally ionized under physio- 
logical conditions. Fatty acids occur in large amounts in biological systems but 
only rarely in the free, uncomplexed state. They typically are esterified to glyc- 
erol or other backbone structures. Most of the fatty acids found in nature have 
an even number of carbon atoms (usually 14 to 24). Certain marine organisms, 
however, contain substantial amounts of fatty acids with odd numbers of carbon 
atoms. Fatty acids are either saturated (all carbon-carbon bonds are single 
bonds) or unsaturated (with one or more double bonds in the hydrocarbon 
chain). If a fatty acid has a single double bond, it is said to be monounsaturated, 
and if it has more than one, polyunsaturated. Fatty acids can be named or de- 
scribed in at least three ways, as shown in Table 8.1. For example, a fatty acid 
composed of an 18-carbon chain with no double bonds can be called by its sys- 
tematic name (octadecanoic acid), its common name (stearic acid), or its short- 
hand notation, in which the number of carbons is followed by a colon and the 
number of double bonds in the molecule (18:0 for stearic acid). The structures 
of several common fatty acids are given in Figure 8.1. Stearic acid (18:0) and 
palmitic acid (16:0) are the most common saturated fatty acids in nature. 

Free rotation around each of the carbon-carbon bonds makes saturated fatty 
acids extremely flexible molecules. Owing to steric constraints, however, the fully 
extended conformation (Figure 8.1) is the most stable for saturated fatty acids. 
Nonetheless, the degree of stabilization is slight, and (as will be seen) saturated fatty 
acid chains adopt a variety of conformations. 

Unsaturated fatty acids are slightly more abundant in nature than saturated fatty 
acids, especially in higher plants. The most common unsaturated fatty acid is oleic 
acid, or 18:14°, with the number in parentheses indicating that the double bond is 
between carbons 9 and 10. The number of double bonds in an unsaturated fatty 
acid typically varies from one to four, but in the fatty acids found in most bacteria, 
this number rarely exceeds one. 

The double bonds found in fatty acids are nearly always in the cis configuration. 
As shown in Figure 8.1, this causes a bend or “kink” in the fatty acid chain. This bend 
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“The mighty whales which swim in a sea of water, and 
have a sea of oil swimming in them.” Herman Melville, 
“Extracts.” Moby Dick. New York: Penguin Books, 1972. 
(Humpback whale [Megaptera novaeangliae] breach- 
ing, Cape Cod, MA) 





A feast of fat things, a feast of wines on the lees. 


Isaiah 25:6 





KEY QUESTIONS 


8.1 


8.2 


8.3 


8.4 


8.5 
8.6 


8.7 


8.8 


8.9 


What Are the Structures and Chemistry 
of Fatty Acids? 


What Are the Structures and Chemistry 
of Triacylglycerols? 


What Are the Structures and Chemistry 
of Glycerophospholipids? 





What Are Sphingolipids, and How Are They 
mportant for Higher Animals? 

What Are Waxes, and How Are They Used? 
What Are Terpenes, and What Is Their 
Relevance to Biological Systems? 

What Are Steroids, and What Are Their 
Cellular Functions? 

How Do Lipids and Their Metabolites Act 
as Biological Signals? 


What Can Lipidomics Tell Us about Cell, 
Tissue, and Organ Physiology? 
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TABLE 8.1 
Number 

of Carbons Common Name Systematic Name Symbol Structure 

Saturated fatty acids 
12 Lauric acid Dodecanoic acid 12:0 CH; (CHo) ;pCOOH 
14 Myristic acid Tetradecanoic acid 14:0 CH; (CH) 12COOH 
16 Palmitic acid Hexadecanoic acid 16:0 CH; (CHo) 14COOH 
18 Stearic acid Octadecanoic acid 18:0 CH; (CHo) }gCOOH 
20 Arachidic acid Eicosanoic acid 20:0 CH; (CHo) ı8COOH 
22 Behenic acid Docosanoic acid 22:0 CH; (CH2) 2x9COOH 
24 Lignoceric acid Tetracosanoic acid 24:0 CH; (CHo) »2COOH 

Unsaturated fatty acids (all double bonds are cis) 
16 Palmitoleic acid 9-Hexadecenoic acid 16:1* CH3(CHe2) ;CH=CH (CHo); COOH 
18 Oleic acid 9-Octadecenoic acid 18:1 CH3(CH») 7 CH=CH (CHo); COOH 
18 Linoleic acid 9,12-Octadecadienoic acid 18:2 CH; (CHo) 4(CH=CHCHg») 9(CH») ;COOH 
18 a-Linolenic acid 9,12,15-Octadecatrienoic acid 18:3 CHCH (CH=CHCHo) 3(CHe) COOH 
18 y-Linolenic acid 6,9,12-Octadecatrienoic acid 18:3 CH; (CHo) 4(CH=CHCHs) (CH2), COOH 
20 Arachidonic acid 5,8,11,14-Eicosatetraenoic acid 20:4 CH; (CH) 4(CH=CHCHg») 4(CH») COOH 
24 Nervonic acid 15-Tetracosenoic acid 24:1 CH3(CHe2) ;CH=CH (CHo) ;;COOH 











*Palmitoleic acid can also be described as 16:14, in a convention used to indicate the position of the double bond. 


has very important consequences for the structure of biological membranes. Satu- 
rated fatty acid chains can pack closely together to form ordered, rigid arrays un- 
der certain conditions, but unsaturated fatty acids prevent such close packing and 
produce flexible, fluid aggregates. 

Some fatty acids are not synthesized by mammals and yet are necessary for nor- 
mal growth and life. These essential fatty acids include linoleic and y-linolenic 
acids. These must be obtained by mammals in their diet (specifically from plant 
sources). Arachidonic acid, which is not found in plants, can be synthesized by 
mammals only from linoleic acid. At least one function of the essential fatty acids is 
to serve as a precursor for the synthesis of eicosanoids, such as prostaglandins, a 
class of compounds that exert hormonelike effects in many physiological processes 
(discussed in Chapter 24). 

Fats in the modern human diet vary widely in their fatty acid composition (Table 
8.2). The incidence of cardiovascular disease is correlated with diets high in satu- 
rated fatty acids. By contrast, a diet relatively higher in unsaturated fatty acids (es- 
pecially polyunsaturated fatty acids) may reduce the risk of heart attacks and 
strokes. Although vegetable oils usually contain a higher proportion of unsatu- 
rated fatty acids than do animal oils and fats, several plant oils are actually high in 
saturated fats. Palm oil is low in polyunsaturated fatty acids and particularly high 
in (saturated) palmitic acid (hence the name palmitic). Coconut oil is particularly 
high in lauric and myristic acids (both saturated) and contains little unsaturated 
fatty acid. 

Canola oil has been promoted as a healthy dietary oil because it consists primar- 
ily of oleic acid (60%), linoleic acid (20%), and o-linolenic acid (9%) with very low 
saturated fat content (7%). Canola oil is actually rapeseed oil, from the seeds of the 
rape plant Brassica rapa (from the Latin rapa, meaning “turnip”), a close relative of 
mustard, kale, cabbage, and broccoli. Asians and Europeans used rapeseed oil in 
lamps for hundreds of years, but it was not usually considered edible because of its 
high erucic acid content, a 22:14!8 monounsaturated fatty acid (often 20% to 60%). 
In the first half of the 20th century, it was used as a steam engine lubricant (espe- 
cially in World War II). Conventional breeding techniques have reduced the erucic 
acid content to less than 1%, producing the “canola oil” (the name is derived from 
Canadian oil, low acid) now used so commonly for cooking and baking. 
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CENGAGENOW” ANIMATED FIGURE 8.1 The structures of some typical fatty acids. Note that most natural fatty 
acids contain an even number of carbon atoms and that the double bonds are nearly always cis and rarely 
conjugated. See this figure animated at www.cengage.com/login. 








TABLE 8.2 { 

Lauric and 
Source Myristic Palmitic Stearic Oleic Linoleic 
Beef 5 24-32 20-25 37-43 2-3 
Milk 25 12 33 3 
Coconut 74 10 2 7 
Corn 8-12 3-4 19-49 34-62 
Olive 9 2 84 4 
Palm 39 4 40 8 
Safflower 6 3 13 78 
Soybean 9 6 20 52 
Sunflower 6 1l 21 66 











Data from Merck Index, 10th ed. Rahway, NJ: Merck and Co.; and Wilson, E. D., et al., 1979, Principles of Nutrition, 4th ed. New 
York: Wiley. 
*Values are percentages of total fatty acids. 
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Elaidic acid 
trans double bond at 9,10 position 


H OH 


Vaccenic acid 
trans double bond at 11,12 position 


FIGURE 8.2 Structures of elaidic acid and vaccenic acid, 
two trans fatty acids. 


HC —CH— CH, 


HC — CH — CH, 


Although most unsaturated fatty acids in nature are cis fatty acids, trans fatty acids 
are formed by some bacteria via double-bond migration and isomerization. These 
bacterial reactions produce trans fats in ruminant animals (which carry essential 
bacteria in their rumen), and butter, milk, cheese and the meat of these animals 
contain modest quantities of trans fats (typically 2% to 8% by weight), such as those 
in Figure 8.2. Margarine and other “processed fats,” made by partial hydrogenation 
of polyunsaturated oils (for example, corn, safflower, and sunflower) contain sub- 
stantial levels of various trans fats, and clinical research has shown that chronic con- 
sumption of processed foods containing partially hydrogenated vegetable oils can 
contribute to cardiovascular disease. Diets high in trans fatty acids raise plasma low- 
density lipoprotein (LDL) cholesterol levels and triglyceride levels while lowering 
high-density lipoprotein (HDL) cholesterol levels. The effects of trans fatty acids on 
LDL, HDL, and cholesterol levels are similar to those of saturated fatty acids. Diets 
aimed at reducing the risk of coronary heart disease should be low in both trans and 
saturated fatty acids. 


8:25 What Are the Structures and Chemistry 
of Triacylglycerols? 


A significant number of the fatty acids in plants and animals exist in the form of tri- 
acylglycerols (also called triglycerides). Triacylglycerols are a major energy reserve 
and the principal neutral derivatives of glycerol found in animals. These molecules 
consist of a glycerol esterified with three fatty acids (Figure 8.3). If all three fatty acid 
groups are the same, the molecule is called a simple triacylglycerol. Examples in- 
clude tristearoylglycerol (common name tristearin) and trioleoylglycerol (triolein). 
Mixed triacylglycerols contain two or three different fatty acids. Triacylglycerols in 
animals are found primarily in the adipose tissue (body fat), which serves as a depot 
or storage site for lipids. Monoacylglycerols and diacylglycerols also exist, but they 
are far less common than the triacylglycerols. Most natural plant and animal fat is 
composed of mixtures of simple and mixed triacylglycerols. 

Acylglycerols can be hydrolyzed by heating with acid or base or by treatment with 
lipases. Hydrolysis with alkali is called saponification and yields salts of free fatty 
acids and glycerol. This is how our ancestors made soap (a metal salt of an acid de- 
rived from fat). One method used potassium hydroxide (potash) leached from 
wood ashes to hydrolyze animal fat (mostly triacylglycerols). (The tendency of such 
soaps to be precipitated by Mg** and Ca?* ions in hard water makes them less use- 
ful than modern detergents.) When the fatty acids esterified at the first and third 
carbons of glycerol are different, the second carbon is asymmetric. The various acyl- 
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A DEEPER LOOK 





Polar Bears Prefer Nonpolar Food 


The polar bear is magnificently adapted to thrive in its harsh Arc- 
tic environment. Research by Malcolm Ramsay (at the University 
of Saskatchewan in Canada) and others has shown that polar 
bears eat only during a few weeks out of the year and then fast 
for periods of 8 months or more, consuming no food or water 
during that time. Eating mainly in the winter, the adult polar 
bear feeds almost exclusively on seal blubber (largely composed 
of triacylglycerols), thus building up its own triacylglycerol re- 
serves. Through the Arctic summer, the polar bear maintains 
normal physical activity, roaming over long distances, but relies 
entirely on its body fat for sustenance, burning as much as 1 to 
1.5 kg of fat per day. It neither urinates nor defecates for ex- 
tended periods. All the water needed to sustain life is provided 
from the metabolism of triacylglycerols (because oxidation of 
fatty acids yields carbon dioxide and water). 

Ironically, the word Arctic comes from the ancient Greeks, who 
understood that the northernmost part of the earth lay under the 
stars of the constellation Ursa Major, the Great Bear. Although un- 
aware of the polar bear, they called this region Arktikós, which 
means “the country of the great bear.” 








© John Shaw/Photo Researchers, Inc. 
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glycerols are normally soluble in benzene, chloroform, ether, and hot ethanol. Al- 
though triacylglycerols are insoluble in water, monoacylglycerols and diacylglycerols 
readily form organized structures in water (see Chapter 9), owing to the polarity of 
their free hydroxyl groups. 

Triacylglycerols are rich in highly reduced carbons and thus yield large amounts 
of energy in the oxidative reactions of metabolism. Complete oxidation of 1 g of tri- 
acylglycerols yields about 38 kJ of energy, whereas proteins and carbohydrates yield 
only about 17 kJ/g. Also, their hydrophobic nature allows them to aggregate in 
highly anhydrous forms, whereas polysaccharides and proteins are highly hydrated. 
For these reasons, triacylglycerols are the molecules of choice for energy storage in 
animals. Body fat (mainly triacylglycerols) also provides good insulation. Whales 
and Arctic mammals rely on body fat for both insulation and energy reserves. 


8:35 What Are the Structures and Chemistry 
of Glycerophospholipids? 


A 1,2-diacylglycerol that has a phosphate group esterified at carbon atom 3 of the 
glycerol backbone is a glycerophospholipid, also known as a phosphoglyceride or a 
glycerol phosphatide (Figure 8.4). These lipids form one of the largest and most im- 
portant classes of natural lipids. They are essential components of cell membranes 
and are found in small concentrations in other parts of the cell. It should be noted 


FIGURE 8.4 Phosphatidic acid, the parent compound for glycerophospholipids. 
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CENGAGENOW” ACTIVE FIGURE 8.5 (a) The two identi- 
cal —CH,OH groups on the central carbon of glycerol 
may be distinguished by imagining a slight increase in pri- 
ority for one of them (by replacement of an H by a D) as 
shown. (b) The absolute configuration of sn-glycerol-3- 
phosphate is shown. The pro-R and pro-S positions of the 
parent glycerol are also indicated. Test yourself on the 
concepts in this figure at www.cengage.com/login 
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that all glycerophospholipids are members of the broader class of lipids known as 
phospholipids. 

The numbering and nomenclature of glycerophospholipids present a dilemma in 
that the number 2 carbon of the glycerol backbone of a phospholipid is asymmetric. 
It is possible to name these molecules either as D- or L-isomers. Thus, glycerol phos- 
phate itself can be referred to either as D-glycerol-l-phosphate or as L-glycerol-3- 
phosphate (Figure 8.5). Instead of naming the glycerol phosphatides in this way, 
biochemists have adopted the stereospecific numbering or sn- system. The stereospe- 
cific numbering system is based on the concept of prochirality. If a tetrahedral cen- 
ter in a molecule has two identical substituents, it is referred to as prochiral because 
if either of the like substituents is converted to a different group, the tetrahedral 
center then becomes chiral. Consider glycerol (Figure 8.5): The central carbon of 
glycerol is prochiral because replacing either of the —CH»OH groups would make 
the central carbon chiral. Nomenclature for prochiral centers is based on the (R,S) 
system (see Chapter 4). To name the otherwise identical substituents of a prochiral 
center, imagine increasing slightly the priority of one of them (by substituting a deu- 
terium for a hydrogen, for example) as shown in Figure 8.5. The resulting molecule 
has an (S) configuration about the (now chiral) central carbon atom. The group 
that contains the deuterium is thus referred to as the pro-S group. As a useful exer- 
cise, you should confirm that labeling the other CHOH group with a deuterium 
produces the (R) configuration at the central carbon so that this latter CHOH 
group is the pro-R substituent. 

Now consider the two presentations of glycerol phosphate in Figure 8.5. In the 
stereospecific numbering system, the pro-S position of a prochiral atom is denoted 
as the I-position, the prochiral atom as the 2-position, and so on. When this scheme 
is used, the prefix sn- precedes the molecule name (glycerol phosphate in this case) 
and distinguishes this nomenclature from other approaches. In this way, the glyc- 
erol phosphate in natural phosphoglycerides is named sn-glycerol-3-phosphate. 


Glycerophospholipids Are the Most Common Phospholipids 


Phosphatidic acid, the parent compound for the glycerol-based phospholipids 
(Figure 8.4), consists of sn-glycerol-3-phosphate, with fatty acids esterified at the 
l- and 2-positions. Phosphatidic acid is found in small amounts in most natural 
systems and is an important intermediate in the biosynthesis of the more common 
glycerophospholipids (Figure 8.6). In these compounds, a variety of polar groups 
are esterified to the phosphoric acid moiety of the molecule. The phosphate, 
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CENGAGENOW™ ANIMATED FIGURE 8.6 
Structures of several glycerophospholipids 
and space-filling models of phosphatidyl- 
choline, phosphatidylglycerol, and phos- 
phatidylinositol. See this figure animated 
at www.cengage.com/login. 





together with such esterified entities, is referred to as a “head” group. Phos- 
phatides with choline or ethanolamine are referred to as phosphatidylcholine 
(known commonly as lecithin) or phosphatidylethanolamine, respectively. These 
phosphatides are two of the most common constituents of biological membranes. 
Other common head groups found in phosphatides include glycerol, serine, 
and inositol (Figure 8.6). Another kind of glycerol phosphatide found in many tis- 
sues is diphosphatidylglycerol. First observed in heart tissue, it is also called 
cardiolipin. In cardiolipin, a phosphatidylglycerol is esterified through the C-1 hy- 
droxyl group of the glycerol moiety of the head group to the phosphoryl group of 
another phosphatidic acid molecule. 

Phosphatides exist in many different varieties, depending on the fatty acids ester- 
ified to the glycerol group. As we shall see, the nature of the fatty acids can greatly 
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affect the chemical and physical properties of the phosphatides and the membranes 
that contain them. In most cases, glycerol phosphatides have a saturated fatty acid at 
position 1 and an unsaturated fatty acid at position 2 of the glycerol. Thus, 1-stearoyl- 
2-oleoyl-phosphatidylcholine (Figure 8.7) is a common constituent in natural mem- 
branes, but 1-linoleoyl-2-palmitoylphosphatidylcholine is not. 

Both structural and functional strategies govern the natural design of the many 
different kinds of glycerophospholipid head groups and fatty acids. The structural 
roles of these different glycerophospholipid classes are described in Chapter 9. Cer- 
tain phospholipids, including phosphatidylinositol and phosphatidylcholine, par- 
ticipate in complex cellular signaling events. These roles are described in Section 
8.8 and Chapter 32. 





URES Wap-celing msde ori aat Ether Glycerophospholipids Include PAF and Plasmalogens 


phosphatidylcholine. Ether glycerophospholipids possess an ether linkage instead of an acyl group at the 
C-1 position of glycerol (Figure 8.8a). One of the most versatile biochemical signal 
molecules found in mammals is platelet-activating factor, or PAF, a unique ether 
glycerophospholipid (Figure 8.8b). The alkyl group at C-1 of PAF is typically a 
16-carbon chain, but the acyl group at C-2 is a 2-carbon acetate unit. By virtue of this 
acetate group, PAF is much more water soluble than other lipids, allowing PAF to 
function as a soluble messenger in signal transduction. 
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FIGURE 8.8 (a) A 1-alkyl 2-acyl-phosphatidylethanolamine (an ether glycerophospholipid). (b) The structure of 
1-alkyl 2-acetyl-phosphatidylcholine, also known as platelet-activating factor or PAF. 


8.4 What Are Sphingolipids, and How Are They Important for Higher Animals? 


HUMAN BIOCHEMISTRY 





Platelet-Activating Factor: A Potent Glyceroether Mediator 


Platelet-activating factor (PAF) was first identified by its ability (at 
low levels) to cause platelet aggregation and dilation of blood ves- 
sels, but it is now known to be a potent mediator in inflammation, 
allergic responses, and shock. PAF effects are observed at tissue 
concentrations as low as 107!°M. PAF causes a dramatic inflam- 
mation of air passages and induces asthmalike symptoms in labo- 
ratory animals. Toxic shock syndrome occurs when fragments of 
destroyed bacteria act as toxins and induce the synthesis of PAF. 


blood pumped by the heart, which leads to shock and, in severe 
cases, death. 

Beneficial effects have also been attributed to PAF. In repro- 
duction, PAF secreted by the fertilized egg is instrumental in the 
implantation of the egg in the uterine wall. PAF is produced in 
significant quantities in the lungs of the fetus late in pregnancy 
and may stimulate the production of fetal lung surfactant, a 
protein-lipid complex that prevents collapse of the lungs in a 
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PAF causes a drop in blood pressure and a reduced volume of newborn infant. 
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FIGURE 8.9 The structure and a space-filling model of a choline plasmalogen. 


Plasmalogens are ether glycerophospholipids in which the alkyl moiety is cis-a,B- 
unsaturated (Figure 8.9). Common plasmalogen head groups include choline, 
ethanolamine, and serine. These lipids are referred to as phosphatidal choline, 
phosphatidal ethanolamine, and phosphatidal serine. 


8:45 What Are Sphingolipids, and How Are They Important 
for Higher Animals? 


Sphingolipids represent another class of lipids frequently found in biological mem- 
branes. An 18-carbon amino alcohol, sphingosine (Figure 8.10a), forms the back- 
bone of these lipids rather than glycerol. Typically, a fatty acid is joined to a sphin- 
gosine via an amide linkage to form a ceramide (Figure 8.10b). Sphingomyelins 
represent a phosphorus-containing subclass of sphingolipids and are especially 
important in the nervous tissue of higher animals. A sphingomyelin is formed 
by the esterification of a phosphorylcholine or a phosphorylethanolamine to the 
1-hydroxy group of a ceramide (Figure 8.10c). 

There is another class of ceramide-based lipids that, like the sphingomyelins, 
are important components of muscle and nerve membranes in animals. These are 
the glycosphingolipids, and they consist of a ceramide with one or more sugar 
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FIGURE 8.10 Sphingolipids are based on the structure of sphingosine. A ceramide with a phosphocholine 
head group is a choline sphingomyelin. A ceramide with a single sugar is a cerebroside. Gangliosides are cera- 
mides with three or more sugars esterified, one of which is a sialic acid. 


residues in a -glycosidic linkage at the 1-hydroxyl moiety. The neutral glyco- 
sphingolipids contain only neutral (uncharged) sugar residues. When a single glu- 
cose or galactose is bound in this manner, the molecule is a cerebroside (Figure 
8.10d). Another class of lipids is formed when a sulfate is esterified at the 3-position 
of the galactose to make a sulfatide. Gangliosides (Figure 8.10e) are more complex 
glycosphingolipids that consist of a ceramide backbone with three or more sugars 
esterified, one of these being a sialic acid such as N-acetylneuraminic acid. These 


8.6 What Are Terpenes, and What Is Their Relevance to Biological Systems? 





A DEEPER LOOK 


Moby Dick and Spermaceti: A Valuable Wax from Whale Oil 


When oil from the head of the sperm whale is cooled, spermaceti, 
a translucent wax with a white, pearly luster, crystallizes from the 
mixture. Spermaceti, which makes up 11% of whale oil, is com- 
posed mainly of the wax cetyl palmitate: 


CH; (CH) gz ~COO— (CHo) 15CH3 


as well as smaller amounts of cetyl alcohol: 


In the literary classic Moby Dick, Herman Melville describes 
Ishmael’s impressions of spermaceti, when he muses that the 
waxes “discharged all their opulence, like fully ripe grapes their 
wine; as I snuffed that uncontaminated aroma—literally and truly, 
like the smell of spring violets.”* 


“Melville, H., 1984. Moby Dick. London: Octopus Books, p. 205. (Adapted 
from Waddell, T. G., and Sanderlin, R. R., 1986. Chemistry in Moby Dick. 
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HO (CH3) CH; Journal of Chemical Education 63:1019-1020.) 


Spermaceti and cetyl palmitate have been widely used in the mak- 
ing of cosmetics, fragrant soaps, and candles. 








latter compounds are referred to as acidic glycosphingolipids, and they have a net 
negative charge at neutral pH. 

The glycosphingolipids have a number of important cellular functions, despite 
the fact that they are present only in small amounts in most membranes. Glyco- 
sphingolipids at cell surfaces appear to determine, at least in part, certain elements 
of tissue and organ specificity. Cell—cell recognition and tissue immunity depend on 
specific glycosphingolipids. Gangliosides are present in nerve endings and are im- 
portant in nerve impulse transmission. A number of genetically transmitted diseases 
involve the accumulation of specific glycosphingolipids due to an absence of the en- 
zymes needed for their degradation. Such is the case for ganglioside Gy in the 
brains of Tay-Sachs disease victims, a rare but fatal childhood disease characterized 
by a red spot on the retina, gradual blindness, and self-mutilation. 


8:51 What Are Waxes, and How Are They Used? 


Waxes are esters of long-chain alcohols with long-chain fatty acids. The resulting 
molecule can be viewed (in analogy to the glycerolipids) as having a weakly polar 
head group (the ester moiety itself) and a long, nonpolar tail (the hydrocarbon 
chains) (Figure 8.11). Fatty acids found in waxes are usually saturated. The alcohols 
found in waxes may be saturated or unsaturated and may include sterols, such as 
cholesterol (see later section). Waxes are water insoluble due to their mostly hy- 
drocarbon composition. As a result, this class of molecules confers water-repellant 
character to animal skin, to the leaves of certain plants, and to bird feathers. The 
glossy surface of a polished apple results from a wax coating. Carnauba wax, ob- 
tained from the fronds of a species of palm tree in Brazil, is a particularly hard wax 
used for high-gloss finishes, such as in automobile wax, boat wax, floor wax, and 
shoe polish. Lanolin,! a wool wax, is used as a base for pharmaceutical and cosmetic 
products because it is rapidly assimilated by human skin. The brand name Oil of 
Olay® was coined by Graham Wulff, a South African chemist who developed it. The 
name refers to lanolin, a key ingredient. 


8:65 What Are Terpenes, and What Is Their Relevance 
to Biological Systems? 


The terpenes are a class of lipids formed from combinations of two or more mole- 
cules of 2-methyl-1,3-butadiene, better known as isoprene (a five-carbon unit that is 
abbreviated C5). A monoterpene (C10) consists of two isoprene units, a sesquiterpene 


Lanolin is a complex mixture of waxes with 33 different alcohols esterified to 36 different fatty acids. 
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Oleoyl stearate FIGURE 8.11 Waxes consist of long-chain alcohols esterified to long-chain fatty acids. Triacontanol palmitate is 


the principal component of beeswax. Waxes are components of the waxy coating on the leaves of plants, such 
as jade plants (shown here). Such species typically contain dozens of different waxy esters. 
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FIGURE 8.12 The structure of isoprene (2-methyl-1,3- | = 
butadiene) and the structure of head-to-tail and tail-to- J/N 
tail linkages. lsoprene itself can be formed by distillation HC CH 
of natural rubber, a linear head-to-tail polymer of iso- 
prene units. Isoprene Geraniol 
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CENGAGENOW” ACTIVE FIGURE 8.13 Many monoterpenes are readily recognized by their characteristic flavors 
or odors (limonene in lemons; citronellal in roses, geraniums, and some perfumes; and menthol from peppermint, 
used in cough drops and nasal inhalers). The diterpenes, which are Cz terpenes, include retinal (the essential 
light-absorbing pigment in rhodopsin, the photoreceptor protein of the eye), and phytol (a constituent of 
chlorophyll). The triterpene lanosterol is a constituent of wool fat. Lycopene is a carotenoid found in ripe fruit, 
especially tomatoes. Test yourself on the concepts in this figure at www.cengage.com/login 


© Steven Lunetta Photography, 2007 


8.6 What Are Terpenes, and What Is Their Relevance to Biological Systems? 


A DEEPER LOOK 


Why Do Plants Emit Ilsoprene? 


The Blue Ridge Mountains of Virginia are so named for the misty 
blue vapor or haze that hangs over them through much of the 
summer season. This haze is composed in part of isoprene that 
is produced and emitted by the plants and trees of the moun- 
tains. Global emission of isoprene from vegetation is estimated at 
3 X 10 g/yr. Plants frequently emit as much as 15% of the car- 
bon fixed in photosynthesis as isoprene, and Thomas Sharkey, a 
botanist at the University of Wisconsin, has shown that the kudzu 
plant can emit as much as 67% of its fixed carbon as isoprene as 
the result of water stress. Why should plants and trees emit large 
amounts of isoprene and other hydrocarbons? Sharkey has 
shown that an isoprene atmosphere or “blanket” can protect 
leaves from irreversible damage induced by high (summerlike) 
temperatures. He hypothesizes that isoprene in the air around 
plants dissolves into leaf-cell membranes, altering the lipid bi- 
layer and/or lipid-protein and protein-protein interactions 
within the membrane to increase thermal tolerance. 








A Blue Ridge Mountains 
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(Cis) consists of three isoprene units, a diterpene (Co) has four isoprene units, and 
so on. Isoprene units can be linked in terpenes to form straight-chain or cyclic mol- 
ecules, and the usual method of linking isoprene units is head to tail (Figure 8.12). 
Monoterpenes occur in all higher plants, whereas sesquiterpenes and diterpenes are 
less widely known. Several examples of these classes of terpenes are shown in Figure 
8.13. The triterpenes are Cg terpenes and include squalene and lanosterol, two of 
the precursors of cholesterol and other steroids (discussed later). Tetraterpenes 
(C10) are less common but include the carotenoids, a class of colorful photosynthetic 
pigments. 6-Carotene is the precursor of vitamin A, whereas lycopene, similar to 
b-carotene, is a pigment found in tomatoes. 

Long-chain polyisoprenoid molecules with a terminal alcohol moiety are called 
polyprenols. The dolichols, one class of polyprenols (Figure 8.14), consist of 16 to 
22 isoprene units and, in the form of dolichyl phosphates, function to carry carbo- 
hydrate units in the biosynthesis of glycoproteins in animals. Polyprenyl groups 
serve to anchor certain proteins to biological membranes (discussed in Chapter 9). 


The Membranes of Archaea Are Rich in Isoprene-Based Lipids Archaea are 
found primarily in harsh environments. Some thrive in the high temperatures of 
geysers and ocean steam vents, whereas others are found in extremely acidic, cold, 
or salty environments. Archaea also live in extremes of pH in the digestive tracts 
of cows, termites, and humans. Archaea are ideally adapted to their harsh 
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Undecaprenyl alcohol (bactoprenol) 


FIGURE 8.14 Dolicho! phosphate is an initiation point 
for the synthesis of carbohydrate polymers in animals. 
The analogous alcohol in bacterial systems, undeca- 
prenol, also known as bactoprenol, consists of 11 iso- 
prene units. Undecaprenyl phosphate delivers sugars 
from the cytoplasm for the synthesis of cell wall compo- 
nents such as peptidoglycans, lipopolysaccharides, and 
glycoproteins. 
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HUMAN BIOCHEMISTRY 


Coumadin or Warfarin—Agent of Life or Death 


The isoprene-derived molecule whose structure is shown here is oO (0) 
known alternately as Coumadin and warfarin. By the former O 
name, it is a widely prescribed anticoagulant. By the latter name, L I 


it is a component of rodent poisons. How can the same chemi- 
cal species be used for such disparate purposes? The key to both 
uses lies in its ability to act as an antagonist of vitamin K in the HO 
body. 

Vitamin K is necessary for the carboxylation of glutamate 


; : a 3 E fari di 
residues on certain proteins, including some proteins in the blood- Warfänin (Coumtadin) 


clotting cascade (including prothrombin, factor VII, factor IX, and y-carboxy-Glu 
factor X, which undergo a Ca°t-dependent conformational change © 
in the course of their biological activity, as well as protein C and RS eye 
protein S, two regulatory proteins in coagulation). Carboxylation | 

of these coagulation factors is catalyzed by a carboxylase that re- H | 

quires the reduced form of vitamin K (vitamin KH»), molecular O Wee O 





oxygen, and carbon dioxide. KH is oxidized to vitamin K epoxide, 
which is recycled to KH» by the enzymes vitamin K epoxide reduc- 
tase (1) and vitamin K reductase (2, 3). Coumadin/warfarin exerts 
its anticoagulant effect by inhibiting vitamin K epoxide reductase 
and possibly also vitamin K reductase. This inhibition depletes vit- 


amin KHy, and reduces the activity of the carboxylase. COs 
Coumadin/warfarin, given at a typical dosage of 4 to 5 mg/day, Warfarin 

prevents the deleterious formation in the bloodstream of small resistant 

blood clots and thus reduces the risk of heart attacks and strokes for K —, > KH, 


individuals whose arteries contain sclerotic plaques. Taken in much 


larger doses, as for example in rodent poisons, Coumadin/warfarin 
can cause massive hemorrhages and death. 
2 
K 
CI 


inhibits 











environments, and one such adaptation is found in their cell membranes, which 
contain isoprene-based lipids (Figure 8.15). These isoprene chains are linked at 
both ends by ether bonds to glycerols. Ether bonds are more stable to hydrolysis 
than the ester linkages of glycerophospholipids (Figure 8.6). With a length twice 
that of typical glycerophospholipids, these molecules can completely span a cell 
membrane, providing additional stability. Interestingly, the glycerols in archaeal 
lipids are in the (R) configuration, whereas glycerolipids of animals, plants, and eu- 
bacteria are almost always in the (S) configuration. 
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FIGURE 8.15 The structure of caldarchaeol, an isoprene-based lipid found in archaea. 
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81/5 What Are Steroids, and What Are Their 
Cellular Functions? 


Cholesterol 


A large and important class of terpene-based lipids is the steroids. This molecu- 
lar family, whose members affect an amazing array of cellular functions, is based 
on a common structural motif of three 6-membered rings and one 5-membered 
ring all fused together. Cholesterol (Figure 8.16) is the most common steroid in 
animals and the precursor for all other animal steroids. The numbering system 
for cholesterol applies to all such molecules. Many steroids contain methyl 
groups at positions 10 and 13 and an 8- to 10-carbon alkyl side chain at position 
17. The polyprenyl nature of this compound is particularly evident in the side 
chain. Many steroids contain an oxygen at C-3, either a hydroxyl group in sterols 
or a carbonyl group in other steroids. Significantly, the carbons at positions 10 
and 13 and the alkyl group at position 17 are nearly always oriented on the same 
side of the steroid nucleus, the B-orientation. Alkyl groups that extend from the 
other side of the steroid backbone are in an a-orientation. 

Cholesterol is a principal component of animal cell plasma membranes, and 
smaller amounts of cholesterol are found in the membranes of intracellular or- 
ganelles. The relatively rigid fused ring system of cholesterol and the weakly polar 
alcohol group at the C-3 position have important consequences for the properties 
of plasma membranes. Cholesterol is also a component of lipoprotein complexes in 
the blood, and it is one of the constituents of plaques that form on arterial walls in 
atherosclerosis. 


Steroid Hormones Are Derived from Cholesterol 


Steroids derived from cholesterol in animals include five families of hormones (the 
androgens, estrogens, progestins, glucocorticoids, and mineralocorticoids) and bile 
acids (Figure 8.17). Androgens such as testosterone and estrogens such as estradiol 
mediate the development of sexual characteristics and sexual function in animals. 
The progestins such as progesterone participate in control of the menstrual cycle 
and pregnancy. Glucocorticoids (cortisol, for example) participate in the control of 
carbohydrate, protein, and lipid metabolism, whereas the mineralocorticoids regu- 
late salt (Na*, K+, and Cl) balances in tissues. The bile acids (including cholic and 
deoxycholic acid) are detergent molecules secreted in bile from the gallbladder that 
assist in the absorption of dietary lipids in the intestine. 








Cholesterol 


FIGURE 8.16 The structure of cholesterol, shown with 
steroid ring designations and carbon numbering. 
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8:8» How Do Lipids and Their Metabolites Act 
as Biological Signals? 


Glycerophospholipids and sphingolipids play important structural roles as the prin- 
cipal components of biological membranes (see Chapter 9). However, their modi- 
fication and breakdown also produce an eclectic assortment of substances that act 
as powerful chemical signals (Figures 8.18 and 8.19). In contrast to the steroid hor- 
mones (Figure 8.17), which travel from tissue to tissue in the blood to exert their 
effects, these lipid metabolites act locally, either within the cell in which they are 
made or on nearby cells. Signal molecules typically initiate a cascade of reactions 
with multiple possible effects, and the lifetimes of these powerful signals in or near 
a cell are usually very short. Thus, the creation and breakdown of signal molecules 
is almost always carefully timed and regulated. 
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FIGURE 8.18 (a) Phospholipases A; and A; cleave fatty acids from a glycerophospholipid, producing lyso- 
phospholipids. Phospholipases C and D hydrolyze on either side of the phosphate in the polar head group. 
(b) Phospholipases are components of the venoms of many poisonous snakes. The pain and physiological 
consequences of a snake bite partly result from breakdown of cell membranes by phospholipases. 
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A DEEPER LOOK 


Glycerophospholipid Degradation: One of the Effects of Snake Venom 


The venoms of poisonous snakes contain (among other things) a catalyzes the hydrolysis of fatty acids at the C-2 position of glyc- 
class of enzymes known as phospholipases, enzymes that cause — erophospholipids (Figure 8.18). The phospholipid breakdown 
the breakdown of phospholipids. For example, the venoms of the product of this reaction, lysolecithin, acts as a detergent and dis- 
eastern diamondback rattlesnake (Crotalus adamanteus) and the solves the membranes of red blood cells, causing them to rupture. 
Indian cobra (Naja naja) both contain phospholipase A», which Indian cobras kill several thousand people each year. 








Enzymes known as phospholipases hydrolyze the ester bonds of glycerophos- 
pholipids as shown in Figure 8.18. Phospholipases Ay and Ay remove fatty acid 
chains from the 1- and 2-positions of glycerophospholipids, respectively. Phospholi- 
pases C and D attack the polar head group of a glycerophospholipid. Hydrolysis of 
inositol phospholipids by phospholipase C produces a diacylglycerol and inositol- 
1,4,5-trisphosphate (IP3) (Figure 8.19), two signal molecules whose combined ac- 
tions trigger signaling cascades that regulate many cell processes (see Chapter 32). 
Action of phospholipase Ay on a phosphatidic acid releases a fatty acid and a 
lysophosphatidic acid (LPA, Figure 8.19). If the fatty acid is arachidonic acid, fur- 
ther chemical modifications can produce a family of 20-carbon compounds—that 
is, eicosanoids. The eicosanoids are local hormones produced as a response to in- 
jury and inflammation. They exert their effects on cells near their sites of synthesis 
(see Chapter 24). LPA produced outside the cell is a signal that can bind to recep- 
tor proteins on nearby cells, thereby regulating a host of processes, including brain 
development, cell proliferation and survival, and olfaction (the “sense of smell”). 

Sphingolipids can also be modified or broken down to produce chemical signals. 
Sphingosine itself can be phosphorylated to produce sphingosine-1-phosphate 
(SIP) inside cells (Figure 8.20). SIP may either exert a variety of intracellular effects 
or may be excreted from the cell, where it can bind to membrane receptor proteins, 
either on adjacent cells or on the cell from which the S1P was released. Excreted 
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FIGURE 8.19 Modification and breakdown of glycerophospholipids produce a variety of signals and regulatory 
effects. Phospholipase A; cleaves a fatty acid from phosphatidic acid to produce lysophosphatidic acid 

(LPA), which can act as an extracellular signal. If the fatty acid released is arachidonic acid, it can be the 
substrate for synthesis of prostaglandins, thromboxanes, and leukotrienes. Phospholipase C action on 
phosphatidylinositol-4,5-bisphosphate produces diacylglycerol and inositol-1,4,5-trisphosphate, two signal 
molecules that work together to active protein kinases—enzymes that phosphorylate other proteins in 
signaling pathways. 
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HUMAN BIOCHEMISTRY 


Plant Sterols and Stanols—Natural Cholesterol Fighters 


Dietary guidelines for optimal health call for reducing the choles- 
terol intake. One strategy involves eating plant sterols and stanols 
in place of cholesterol-containing fats such as butter (figure). De- 
spite their structural similarity to cholesterol, minor isomeric dif- 
ferences and the presence of methyl and ethyl groups in the side 
chains of these substances result in their poor absorption by in- 
testinal mucosal cells. Interestingly, stanols are even less well ab- 
sorbed than their sterol counterparts. Both sterols and stanols 
bind to cholesterol receptors on intestinal cells and block the ab- 
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sorption of cholesterol itself. Stanols esterified with long-chain 
fatty acids form micelles (see page 244) that are more effectively 
distributed in the fat phase of the food digest and provide the 
most effective blockage of cholesterol uptake. (Stanols are fully re- 
duced sterols.) Raisio Group, a Finnish company, has developed 
Benecol, a stanol ester spread that can lower LDL cholesterol by 
up to 14% if consumed daily (see graph). McNeil Nutritionals has 
partnered with Raisio Group to market Benecol in the United 
States. 
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< Serum cholesterol levels before and after the con- 
sumption of margarine with and without sitostanol 
ester for 12 months. Green circles: 0 g/day. Red 
squares: 2.6 g/day. Blue triangles: 1.8 g/day. Note: The 
y-axis begins at 200 mg cholesterol/dL. (Adapted from 
Miettinen, T. A., et al., 1995. Reduction of serum cholesterol with 
sitostanol-ester margarine in a mildly hypercholesterolemic population. 
New England Journal of Medicine 333:1308-1312.) 
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Sphingosine-1-phosphate (S1P) 


NH, 


CH3 





Fumonisin B; 


S1P binds to many different receptor proteins and provokes many different cell and 
tissue effects, among them inflammation in allergic reactions, heart rate, and move- 
ment and migration of certain cells. Sphingolipid signal molecules are carefully bal- 
anced and regulated in organisms, and chemical agents that disturb this balance 
can be highly toxic. For example, fumonisin is a common fungal contaminant of 
corn and corn-based products that inhibits sphingolipid biosynthesis (Figure 8.20; 
see also Chapter 24). Fumonisin can trigger esophageal cancer in humans and leu- 
coencephalomalacia, a fatal neurological disease in horses. 


8:9» What Can Lipidomics Tell Us about Cell, Tissue, 
and Organ Physiology? 


The crucial role of lipids in cells is demonstrated by the large number of human 
diseases that involve the disruption of lipid metabolic enzymes and pathways. Ex- 
amples of such diseases include atherosclerosis, diabetes, cancer, infectious dis- 
eases, and neurodegenerative diseases. Emerging analytical techniques are making 
possible the global analysis of lipids and their interacting protein partners in 
organs, cells, and organelles—an approach termed lipidomics. A typical cell may 
contain more than a thousand different lipids, each with a polar head and a hydro- 
phobic tail or tails. Despite this general similarity, proteins recognize lipids with 
exquisite specificity. Local concentrations of lipids vary between organelles and be- 
tween specific areas of cellular membranes. 

Complete understanding of lipid function, as well as alteration of such function 
in disease states, will require the determination of which lipids are present and in 


FIGURE 8.20 Structures of sphingosine-1-phosphate 
(S1P) and fumonisin B4. 
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HUMAN BIOCHEMISTRY 


17B-Hydroxysteroid Dehydrogenase 3 Deficiency 


Testosterone, the principal male sex steroid hormone, is synthe- 
sized in five steps from cholesterol, as shown in the following fig- 
ure. In the last step, five isozymes catalyze the 17B-hydroxysteroid 
dehydrogenase reaction that interconverts 4-androstenedione and 
testosterone. Defects in the synthesis or action of testosterone can 
impair the development of the male phenotype during embryoge- 
nesis and cause the disorders of human sexuality termed male 
pseudohermaphroditism. Specifically, mutations in isozyme 3 of 


the formation of testosterone and give rise to genetic males with 
female external genitalia and blind-ending vaginas. Such individ- 
uals are typically raised as females but virilize at puberty, due to an 
increase in serum testosterone, and develop male hair growth pat- 
terns. Fourteen different mutations of 178-hydroxysteroid dehy- 
drogenase 3 have been identified in 17 affected families in the 
United States, the Middle East, Brazil, and western Europe. These 
families account for about 45% of the patients with this disorder 


the 176-hydroxysteroid dehydrogenase in the fetal testes impair reported in scientific literature. 
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what concentrations in every intracellular location. The same knowledge will be 
needed about each lipid’s interaction partners. Mass spectrometric analyses of rat 
heart muscle reveal that the onset of diabetes results in dramatic changes in triglyc- 
eride levels, an increase in phosphatidylinositol levels, and a decrease in phos- 
phatidylethanolamine. On the other hand, mass spectrometric analyses of brain 
white matter in the very early stages of Alzheimer’s disease show a dramatic decrease 
in one type of plasmalogen and a threefold increase in ceramide levels. 

Cellular lipidomics provides a framework for understanding the myriad roles of 
lipids, which include (but are not limited to) membrane transport (see Chapter 9), 
metabolic regulation (see Chapters 18-27), and cell signaling (see Chapter 32). For 
example, six different classes of lipids have been shown to modulate systems im- 
portant in the regulation of pain responses. Each of these classes of lipids exerts its 
action by interacting with one or more receptor proteins. True understanding of 
the molecular basis for diseases and metabolic and physiologic conditions may re- 
quire comprehensive and simultaneous analyses of many lipid species and their re- 
spective receptors. 
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SUMMARY 


Lipids are a class of biological molecules defined by low solubility 
in water and high solubility in nonpolar solvents. As molecules that 
are largely hydrocarbon in nature, lipids represent highly reduced 
forms of carbon and, upon oxidation in metabolism, yield large 
amounts of energy. Lipids are thus the molecules of choice for meta- 
bolic energy storage. The lipids found in biological systems are either 
hydrophobic (containing only nonpolar groups) or amphipathic 
(containing both polar and nonpolar groups). The hydrophobic na- 
ture of lipid molecules allows membranes to act as effective barriers 
to more polar molecules. 


8.1 What Are the Structures and Chemistry of Fatty Acids? A fatty 
acid is composed of a long hydrocarbon chain (“tail”) and a terminal 
carboxyl group (“head”). The carboxyl group is normally ionized un- 
der physiological conditions. Fatty acids occur in large amounts in bi- 
ological systems but only rarely in the free, uncomplexed state. They 
typically are esterified to glycerol or other backbone structures. 


8.2 What Are the Structures and Chemistry of Triacylglycerols? A sig- 
nificant number of the fatty acids in plants and animals exist in the form 
of triacylglycerols (also called triglycerides). Triacylglycerols are a major 
energy reserve and the principal neutral derivatives of glycerol found in 
animals. These molecules consist of a glycerol esterified with three fatty 
acids. Triacylglycerols in animals are found primarily in the adipose tis- 
sue (body fat), which serves as a depot or storage site for lipids. Monoa- 
cylglycerols and diacylglycerols also exist, but they are far less common 
than the triacylglycerols. 


8.3 What Are the Structures and Chemistry of Glycerophospholipids? 
A 1,2-diacylglycerol that has a phosphate group esterified at carbon 
atom 3 of the glycerol backbone is a glycerophospholipid, also known as 
a phosphoglyceride or a glycerol phosphatide. These lipids form one of 
the largest and most important classes of natural lipids. They are essen- 
tial components of cell membranes and are found in small concentra- 
tions in other parts of the cell. All glycerophospholipids are members of 
the broader class of lipids known as phospholipids. 


8.4 What Are Sphingolipids, and How Are They Important for Higher 
Animals? Sphingolipids represent another class of lipids in biological 
membranes. An 18-carbon amino alcohol, sphingosine, forms the back- 
bone of these lipids rather than glycerol. Typically, a fatty acid is joined 
to a sphingosine via an amide linkage to form a ceramide. Sphingo- 
myelins are a phosphorus-containing subclass of sphingolipids espe- 
cially important in the nervous tissue of higher animals. A sphingo- 
myelin is formed by the esterification of a phosphorylcholine or a 
phosphorylethanolamine to the 1-hydroxy group of a ceramide. Glyco- 
sphingolipids are another class of ceramide-based lipids that, like the 
sphingomyelins, are important components of muscle and nerve mem- 
branes in animals. Glycosphingolipids consist of a ceramide with one or 
more sugar residues in a B-glycosidic linkage at the l-hydroxyl moiety. 


8.5 What Are Waxes, and How Are They Used? Waxes are esters of 
long-chain alcohols with long-chain fatty acids. The resulting molecule 
can be viewed (in analogy to the glycerolipids) as having a weakly polar 
head group (the ester moiety itself) and a long, nonpolar tail (the 


hydrocarbon chains). Fatty acids found in waxes are usually saturated. 
The alcohols found in waxes may be saturated or unsaturated and may 
include sterols, such as cholesterol. Waxes are water insoluble due to 
their predominantly hydrocarbon nature. 


8.6 What Are Terpenes, and What Is Their Relevance to Biological 
Systems? The terpenes are a class of lipids formed from combinations 
of two or more molecules of 2-methyl-1,3-butadiene, better known as 
isoprene (a five-carbon unit abbreviated C;). A monoterpene (C10) con- 
sists of two isoprene units, a sesquiterpene (Cj5) consists of three iso- 
prene units, a diterpene (Cə) has four isoprene units, and so on. Iso- 
prene units can be linked in terpenes to form straight-chain or cyclic 
molecules, and the usual method of linking isoprene units is head to 
tail. Monoterpenes occur in all higher plants, whereas sesquiterpenes 
and diterpenes are less widely known. 


8.7 What Are Steroids, and What Are Their Cellular Functions? A 
large and important class of terpene-based lipids is the steroids. This 
molecular family, whose members affect an amazing array of cellular 
functions, is based on a common structural motif of three 6-membered 
rings and one 5-membered ring all fused together. Cholesterol is the 
most common steroid in animals and the precursor for all other animal 
steroids. The numbering system for cholesterol applies to all such mol- 
ecules. The polyprenyl nature of this compound is particularly evident 
in the side chain. Many steroids contain an oxygen at C-3, either a hy- 
droxyl group in sterols or a carbonyl group in other steroids. The 
methyl groups at positions 10 and 13 and the alkyl group at position 
17 are usually oriented on the same side of the steroid nucleus, the 
B-orientation. Alkyl groups that extend from the other side of the 
steroid backbone are in an a-orientation. Cholesterol is a principal com- 
ponent of animal cell plasma membranes. Steroids derived from cho- 
lesterol in animals include five families of hormones (the androgens, 
estrogens, progestins, glucocorticoids, and mineralocorticoids) and 
bile acids. 


8.8 How Do Lipids and Their Metabolites Act as Biological Signals? 
Modification and breakdown of cellular lipids produce an eclectic as- 
sortment of substances that act as powerful chemical signals. Signal mol- 
ecules typically initiate a cascade of reactions with multiple possible ef- 
fects. The creation and breakdown of signal molecules is almost always 
carefully timed and regulated. Phospholipases initiate the production of 
a variety of lipid signals, including arachidonic acid (the precursor 
to eicosanoids), lysophosphatidic acid, inositol-1,4,5-trisphosphate, and 
diacylglycerol. 


8.9 What Can Lipidomics Tell Us about Cell, Tissue, and Organ Physi- 
ology? The comprehensive analysis of lipids and their interacting 
protein partners in organs, cells, and organelles is termed lipidomics. 
A typical cell may contain more than a thousand different lipids. Com- 
plete understanding of lipid function, as well as alteration of such func- 
tion in disease states, will require the determination of which lipids are 
present and in what concentrations in every intracellular location. The 
same knowledge will be needed about each lipid’s interaction partners. 


PROBLEMS 


CENGAGENOW’ Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login 
1. Draw the structures of (a) all the possible triacylglycerols that can 
be formed from glycerol with stearic and arachidonic acid and 
(b) all the phosphatidylserine isomers that can be formed from 
palmitic and linolenic acids. 


2. Describe in your own words the structural features of 
a. a ceramide and how it differs from a cerebroside. 
b. a phosphatidylethanolamine and how it differs from a phos- 
phatidylcholine. 
c. an ether glycerophospholipid and how it differs from a plas- 
malogen. 
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10. 


11. 


12. 


13. 


14. 


15. 


d. a ganglioside and how it differs from a cerebroside. 
e. testosterone and how it differs from estradiol. 


. From your memory of the structures, name 


a. the glycerophospholipids that carry a net positive charge. 
b. the glycerophospholipids that carry a net negative charge. 
c. the glycerophospholipids that have zero net charge. 


. Compare and contrast two individuals, one whose diet consists 


largely of meats containing high levels of cholesterol and the other 
whose diet is rich in plant sterols. Are their risks of cardiovascular 
disease likely to be similar or different? Explain your reasoning. 


. James G. Watt, Secretary of the Interior (1981-1983) in Ronald Rea- 


gan’s first term, provoked substantial controversy by stating publicly 
that trees cause significant amounts of air pollution. Based on your 
reading of this chapter, evaluate Watt’s remarks. 


. In a departure from his usual and highly popular westerns, author 


Louis L’Amour wrote a novel in 1987, Last of the Breed (Bantam 
Press), in which a military pilot of Native American ancestry is shot 
down over the former Soviet Union and is forced to use the survival 
skills of his ancestral culture to escape his enemies. On the rare 
occasions when he is able to trap and kill an animal for food, he 
selectively eats the fat, not the meat. Based on your reading of this 
chapter, what is his reasoning for doing so? 


. As you read Section 8.7, you might have noticed that phospholipase 


Av, the enzyme found in rattlesnake venom, is also the enzyme that 
produces essential and beneficial lipid signals in most organisms. Ex- 
plain the differing actions of phospholipase As in these processes. 


. Visit a grocery store near you, stop by the rodent poison section, 


and examine a container of warfarin or a related product. From 
what you can glean from the packaging, how much warfarin would 
a typical dog (40 lbs) have to consume to risk hemorrhages and/or 
death? 


. Refer to Figure 8.13 and draw each of the structures shown and try 


to identify the isoprene units in each of the molecules. (Note that 
there may be more than one correct answer for some of these mol- 
ecules, unless you have the time and facilities to carry out MC la- 
beling studies with suitable organisms.) 

(Integrates with Chapter 3.) As noted in the Deeper Look box on 
polar bears, a polar bear may burn as much as 1.5 kg of fat resources 
per day. What weight of seal blubber would you have to ingest if you 
were to obtain all your calories from this energy source? 

If you are still at the grocery store working on problem 8, stop by 
the cookie shelves and choose your three favorite cookies from 
the shelves. Estimate how many calories of fat, and how many 
other calories from other sources, are contained in 100 g of each 
of these cookies. Survey the ingredients listed on each package, 
and describe the contents of the package in terms of (a) satu- 
rated fat, (b) cholesterol, and (c) trans fatty acids. (Note that 
food makers are required to list ingredients in order of decreas- 
ing amounts in each package.) 

Describe all of the structural differences between cholesterol and 
stigmasterol. 

Describe in your own words the functions of androgens, glucocorti- 
coids, and mineralocorticoids. 

Look through your refrigerator, your medicine cabinet, and your 
cleaning solutions shelf or cabinet, and find at least three commer- 
cial products that contain fragrant monoterpenes. Identify each 
one by its scent and then draw its structure. 

Our ancestors kept clean with homemade soap (page 222), often 
called “lye soap.” Go to http://www.wikihow.com/Make-Your-Own-Soap 


16 


TZ; 


18 


19 


20 


and read the procedure for making lye soap from vegetable oils and 
lye (sodium hydroxide). What chemical process occurs in the mak- 
ing of lye soap? Draw reactions to explain. How does this soap work 
as a cleaner? 

Mayonnaise is mostly vegetable oil and vinegar. So what’s the essen- 
tial difference between oil and vinegar salad dressing and mayon- 
naise? Learn for yourself: Combine a half cup of pure vegetable oil 
(olive oil will work) with two tablespoons of vinegar in a bottle, cap 
it securely, and shake the mixture vigorously. What do you see? Now 
let the mixture sit undisturbed for an hour. What do you see now? 
Add one egg yolk to the mixture, and shake vigorously again. Let 
the mixture stand as before. What do you see after an hour? After 
two hours? Egg yolk is rich in phosphatidylcholine. Explain why the 
egg yolk caused the effect you observed. 

The cholesterol-lowering benefit of stanol-ester margarine is only 
achieved after months of consumption of stanol esters (see graph, 
page 236). Suggest why this might be so. Suppose dietary sources 
represent approximately 25% of total serum cholesterol. Based on 
the data in the graph, how effective are stanol esters at preventing 
uptake of dietary cholesterol? 


Statins are cholesterol-lowering drugs that block cholesterol syn- 
thesis in the human liver (see Chapter 24). Would you expect the 
beneficial effects of stanol esters and statins to be duplicative or ad- 
ditive? Explain. 

If most plant-derived food products contain plant sterols and 
stanols, would it be as effective (for cholesterol-lowering purposes) 
to simply incorporate plant fats in one’s diet as to use a sterol- or 
stanol-fortified spread like Benecol? Consult a suitable reference 
(for example, http://Ipi.oregonstate.edu/infocenter/phytochemicals/sterols/ 
#sources at the Linus Pauling Institute) to compose your answer. 
Tetrahydrogestrinone is an anabolic steroid. It was banned by the 
U.S. Food and Drug Administration in 2003, but it has been used il- 
legally since then by athletes to increase muscle mass and strength. 
Nicknamed “The Clear,” it has received considerable attention in 
high-profile steroid-abuse cases among athletes such as baseball 
player Barry Bonds and track star Marion Jones. Use your favorite 
Web search engine to learn more about this illicit drug. How is it 
synthesized? Who is “the father of prohormones” who first synthe- 
sized it? Why did so many prominent athletes use The Clear (and its 
relative, “The Cream”) when less expensive and more commonly 
available anabolic steroids are in common use? (Hint: There are at 
least two answers to this last question.) 





Tetrahydrogestrinone 


Preparing for the MCAT Exam 


21. Make a list of the advantages polar bears enjoy from their nonpolar 


diet. Why wouldn’t juvenile polar bears thrive on an exclusively 
nonpolar diet? 


22. Snake venom phospholipase Ay causes death by generating 


membrane-soluble anionic fragments from glycerophospholipids. 
Predict the fatal effects of such molecules on membrane proteins 
and lipids. 
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Frog eggs are macroscopic facsimiles of microscopic 
cells. All cells are surrounded by a thin, ephemeral yet 
stable membrane. 
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It takes a membrane to make sense out of disorder 
in biology. 


Lewis Thomas 


The World's Biggest Membrane, 
The Lives of a Cell (1974) 
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Membranes 
and Membrane Transport 


ESSENTIAL QUESTION 


Membranes serve a number of essential cellular functions. They constitute the 
boundaries of cells and intracellular organelles, and they provide a surface where 
many important biological reactions and processes occur. Membranes have proteins 
that mediate and regulate the transport of metabolites, macromolecules, and ions. 
Hormones and many other biological signal molecules and regulatory agents exert 
their effects via interactions with membranes. Photosynthesis, electron transport, 
oxidative phosphorylation, muscle contraction, and electrical activity all depend on 
membranes and membrane proteins. For example, 30 percent of the genes of 
Mycoplasma genitalium are thought to encode membrane proteins. 


What are the properties and characteristics of biological membranes that ac- 
count for their broad influence on cellular processes and transport? 


Membranes are key structural and functional elements of cells. All cells have a cyto- 
plasmic membrane, or plasma membrane, that functions (in part) to separate the 
cytoplasm from the surroundings. The plasma membrane is also responsible for 
(1) the exclusion of certain toxic ions and molecules from the cell, (2) the accu- 
mulation of cell nutrients, and (3) energy transduction. It functions in (4) cell loco- 
motion, (5) reproduction, (6) signal transduction processes, and (7) interactions 
with molecules or other cells in the vicinity. 

Even the plasma membranes of prokaryotic cells are complex (Figure 9.1). With 
no intracellular organelles to divide and organize the work, bacteria carry out 
processes either at the plasma membrane or in the cytoplasm itself. Eukaryotic cells, 
however, contain numerous intracellular organelles that perform specialized tasks. 
Nucleic acid biosynthesis is handled in the nucleus; mitochondria are the site of 
electron transport, oxidative phosphorylation, fatty acid oxidation, and the tricar- 
boxylic acid cycle; and secretion of proteins and other substances is handled by the 
endoplasmic reticulum (ER) and the Golgi apparatus. This partitioning of labor is 
not the only contribution of the membranes in these cells. Many of the processes 
occurring in these organelles (or in the prokaryotic cell) actively involve mem- 
branes. Thus, some of the enzymes involved in nucleic acid metabolism are mem- 
brane associated. The electron transfer chain and its associated system for ATP syn- 
thesis are embedded in the mitochondrial membrane. Many enzymes responsible 
for aspects of lipid biosynthesis are located in the ER membrane. 

This chapter discusses the composition, structure, and dynamic processes of 
biological membranes. 


Qxl What Are the Chemical and Physical Properties 
of Membranes? 


Water’s tendency to form hydrogen bonds and share in polar interactions, and the 
hydrophobic effect, which promotes self-association of lipids in water to maximize 
entropy, are the basis for the interactions of lipids and proteins to form membranes. 
These forces drive amphiphilic glycerolipids, sphingolipids, and sterols to form 
membrane structures in water, and these forces facilitate the association of proteins 
(and thus myriad biological functions) with membranes. A symphony of molecular 
events over a range of times from picoseconds to many seconds results in the move- 
ment of lipids and proteins across and between membranes; catalyzes reactions at 
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(d) 


(b) 


FIGURE 9.1 Electron micrographs of several different membrane structures: (a) Plasma membrane of Menoid- 
ium, a protozoan. (b) Two plasma membranes from adjacent neurons in the central nervous system. (c) Golgi 
apparatus. (d) Many membrane structures are evident in pancreatic acinar cells. 


or in the membrane and the transport of ions, sugars, and amino acids across mem- 
branes; and organizes and directs hundreds of cell signaling events. 


The Composition of Membranes Suits Their Functions 


Biological membranes may contain as much as 75% to 80% protein (and only 20% 
to 25% lipid) or as little as 15% to 20% protein. Membranes that carry out many 
enzyme-catalyzed reactions and transport activities (the inner mitochondrial mem- 
brane, chloroplast membranes, and the plasma membrane of Escherichia coli, for 
example) are typically richer in protein, whereas membranes that carry out fewer 
protein-related functions (myelin sheaths, the protective coating around neurons, 
for example) are richer in lipid. 

Cellular mechanisms adjust lipid composition to functional needs. Thus, for ex- 
ample, the lipid makeup of red blood cell membranes is consistent across species, 
whereas the lipid complement of different (specialized) membranes within a par- 
ticular cell type (rat liver, Figure 9.2) reflects differences of function. Plasma mem- 
branes are enriched in cholesterol but do not contain diphosphatidylglycerol 
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FIGURE 9.2 The lipid composition of rat liver cell membranes, in weight percent. (Adapted from Andreoli, T. E., 1987. 
Membrane Physiology, 2nd ed. Chapter 27, Table Il, and Daum, G., 1985. Lipids of mitochondria. Biochimica et Biophysica Acta 
822:1-42.) 
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(cardiolipin), whereas mitochondria contain considerable amounts of cardiolipin 
(essential for some mitochondrial proteins) and no cholesterol. The protein com- 
ponents of membranes vary even more greatly than their lipid compositions. 


Lipids Form Ordered Structures Spontaneously in Water 


Monolayers and Micelles Amphipathic lipids spontaneously form a variety of struc- 
tures when added to aqueous solution. All these structures form in ways that minimize 
contact between the hydrophobic lipid chains and the aqueous milieu. For example, 
when small amounts of a fatty acid are added to an aqueous solution, a monolayer is 
formed at the air—water interface, with the polar head groups in contact with the wa- 
ter surface and the hydrophobic tails in contact with the air (Figure 9.3). Few lipid 
molecules are found as monomers in solution. 

Further addition of fatty acid eventually results in the formation of micelles. 
Micelles formed from an amphipathic lipid in water position the hydrophobic tails 
in the center of the lipid aggregation with the polar head groups facing outward. 
Amphipathic molecules that form micelles are characterized by a unique critical 
micelle concentration, or CMC. Below the CMC, individual lipid molecules pre- 
dominate. Nearly all the lipid added above the CMC, however, spontaneously forms 
micelles. Micelles are the preferred form of aggregation in water for detergents and 
soaps. Some typical CMC values are listed in Figure 9.4. 


Lipid Bilayers Lipid bilayers consist of back-to-back arrangements of monolayers 
(Figure 9.3). The nonpolar portions of the lipids face the middle of the bilayer, with 
the polar head groups arrayed on the bilayer surface. Phospholipid bilayers form 


(a) Monolayers and bilayers (b) Micelles (c) Unilamellar vesicle 
Monolayer 

Air 

Water 
Bilayer Inside-out 


Hat idan 


a V 





i 
| 


i Water 





Normal 


(d) Multilamellar vesicle (e) 





David Phillips/Visuals Unlimited 


FIGURE 9.3 Several spontaneously formed lipid structures. Drawings of (a) monolayers and bilayers, (b) mi- 
celles, (c) a unilamellar vesicle, (d) a multilamellar vesicle, and (e) an electron micrograph of a multilamellar 
Golgi structure. 
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FIGURE 9.4 The structures of some common detergents and their physical properties. Micelles formed by 
detergents can be quite large. Triton X-100, for example, typically forms micelles with a total molecular mass of 
90 to 95 kD. This corresponds to approximately 150 molecules of Triton X-100 per micelle. 


rapidly and spontaneously when phospholipids are added to water, and they are sta- 
ble structures in aqueous solution. As opposed to micelles, which are small, self- 
limiting structures of a few hundred molecules, bilayers may form spontaneously over 
large areas (10° nm? or more). Because exposure of the edges of the bilayer to solvent 
is highly unfavorable, extensive bilayers normally wrap around themselves and form 
closed vesicles (Figure 9.3). The nature and integrity of these vesicle structures are 
very much dependent on the lipid composition. Phospholipids can form either uni- 
lamellar vesicles (with a single lipid bilayer), known as liposomes, or multilamellar vesicles. 
These latter structures are reminiscent of the layered structure of onions. 

Liposomes are highly stable structures, a consequence of the amphipathic na- 
ture of the phospholipid molecule. Ionic interactions between the polar head 
groups and water are maximized, whereas hydrophobic interactions (see Chapter 
2) facilitate the association of hydrocarbon chains in the interior of the bilayer. 
The formation of vesicles results in a favorable increase in the entropy of the solu- 
tion, because the water molecules are not required to order themselves around the 
lipid chains. It is important to consider for a moment the physical properties of the 
bilayer membrane, which is the basis of vesicles and also of natural membranes. Bi- 
layers have a polar surface and a nonpolar core. This hydrophobic core provides a 
substantial barrier to ions and other polar entities. The rates of movement of such 
species across membranes are thus quite slow. However, this same core also pro- 
vides a favorable environment for nonpolar molecules and hydrophobic proteins. 
We will encounter numerous cases of hydrophobic molecules that interact with 
membranes and regulate biological functions in some way by binding to or em- 
bedding themselves in membranes. 


The Fluid Mosaic Model Describes Membrane Dynamics 


In 1972, S. J. Singer and G. L. Nicolson proposed the fluid mosaic model for mem- 
brane structure, which suggested that membranes are dynamic structures com- 
posed of proteins and phospholipids. In this model, the phospholipid bilayer is a 
fluid matrix, in essence, a two-dimensional solvent for proteins. Both lipids and pro- 
teins are capable of rotational and lateral movement. 

Singer and Nicolson also pointed out that proteins can be associated with the 
surface of this bilayer or embedded in the bilayer to varying degrees (Figure 9.5). 
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FIGURE 9.5 The fluid mosaic model of membrane structure proposed by S. J. Singer and G. L. Nicolson. In this 
model, the lipids and proteins are assumed to be mobile; they can diffuse laterally in the plane of the mem- 
brane. Transverse motion may also occur, but it is much slower. 


They defined two classes of membrane proteins. The first, called peripheral pro- 
teins (or extrinsic proteins), includes those that do not penetrate the bilayer to any 
significant degree and are associated with the membrane by virtue of ionic interac- 
tions and hydrogen bonds between the membrane surface and the surface of the 
protein. Peripheral proteins can be dissociated from the membrane by treatment 
with salt solutions or by changes in pH (treatments that disrupt hydrogen bonds 
and ionic interactions). Integral proteins (or intrinsic proteins), in contrast, possess 
hydrophobic surfaces that can readily penetrate the lipid bilayer itself, as well as sur- 
faces that prefer contact with the aqueous medium. These proteins can either insert 
into the membrane or extend all the way across the membrane and expose them- 
selves to the aqueous solvent on both sides. Singer and Nicolson also suggested that 
a portion of the bilayer lipid interacts in specific ways with integral membrane pro- 
teins and that these interactions might be important for the function of certain 
membrane proteins. Because of these intimate associations with membrane lipid, 
integral proteins can be removed from the membrane only by agents capable of 
breaking up the hydrophobic interactions within the lipid bilayer itself (such as 
detergents and organic solvents). The fluid mosaic model became the paradigm for 
modern studies that have advanced our understanding of membrane structure and 
function. 


The Thickness of a Membrane Depends on Its Components Electron micro- 
graphs of typical cellular membranes show the thickness of the entire membrane— 
including lipid bilayer and embedded protein—to be 50 Å or more. Electron mi- 
croscopy, NMR, and X-ray and neutron diffraction measurements have shown that 
membrane thickness is influenced by the particular lipids and proteins in the mem- 
brane. The thickness of a phospholipid bilayer made from dipalmitoyl phos- 
phatidylcholine, measured as the phosphorus-to-phosphorus spacing, is about 37 À, 
and the hydrophobic phase of such membranes is approximately 26 Å thick. Nat- 
ural membranes are thicker overall than simple lipid bilayers because many mem- 
brane proteins extend out of the bilayer significantly. 

Among the known membrane protein structures, there is considerable variation 
in the hydrophobic surface perpendicular to the membrane plane. If the hy- 
drophobic surface of the protein is larger or smaller than the lipid bilayer, the thick- 
ness of the lipid bilayer must be increased or decreased. The change in bilayer 
thickness due to membrane proteins can be as much as 5 A. 
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FIGURE 9.6 Membrane protein structures, by year published. (Data from the Web site Membrane Proteins of Known 
3D Structure at the laboratory of Stephen White, http://blanco.biomol.uci.edu/Membrane_Proteins_xtal.html, and from the 
Web site of Hartmut Michel, http://www.mpibp-frankfurt.mpg.de/michel/public/memprotstruct.html.) 


Lipid Chains May Bend and Tilt in the Membrane The long hydrocarbon chains 
of lipids are typically portrayed as more or less perpendicular to the membrane 
plane (Figure 9.3). In fact, the hydrocarbon tails of phospholipids may tilt and bend 
and adopt a variety of orientations. Typically, the portions of a lipid chain near the 
membrane surface lie most nearly perpendicular to the membrane plane, and lipid 
chain ordering decreases toward the end of the chain (toward the middle of the 
bilayer). 


Membranes Are Crowded with Many Different Proteins Membranes are crowded 
places, with a large number of proteins either embedded or associated in some way. 
The E. coli genome codes for more than a thousand membrane proteins. Moreover, 
as more membrane protein structures are determined (Figure 9.6), it has become 
apparent that many membrane proteins have large structures extending outside the 
lipid bilayer that share steric contacts and other interactions. Donald Engelman has 
suggested that most membranes are more crowded than first portrayed in Singer 
and Nicolson’s model (Figure 9.7). 





FIGURE 9.7 An updated model for membrane structure, 
as proposed by Donald Engelman. (Adapted from 
Engelman, D., 2005. Membranes are more mosaic than fluid. 
Nature 438:578-580.) 
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9:25 What Are the Structure and Chemistry 
of Membrane Proteins? 


Although the lipid bilayer constitutes the fundamental structural unit of all biolog- 
ical membranes, proteins carry out essentially all of the active functions of mem- 
branes. Singer and Nicolson defined peripheral proteins as globular proteins that 
interact with the membrane mainly through electrostatic and hydrogen-bonding 
interactions, and integral proteins as those that are strongly associated with the lipid 
bilayer. Another class of proteins not anticipated by Singer and Nicolson, the lipid- 
anchored proteins, is important in a variety of functions in different cells and tis- 
sues. These proteins associate with membranes by means of a variety of covalently 
linked lipid anchors. 


Peripheral Membrane Proteins Associate Loosely with the Membrane 


Peripheral proteins can bind to membranes in several ways (Figure 9.8). They may 
form ionic interactions and hydrogen bonds with polar head groups of membrane 
lipids or with other (integral) proteins, or they may interact with the nonpolar 
membrane core by inserting a hydrophobic loop or an amphipathic a-helix. Exam- 
ples of each of these interaction types are shown in Figure 9.9. 


Integral Membrane Proteins Are Firmly Anchored in the Membrane 


Hundreds of structures of integral membrane proteins are now available in the Pro- 
tein Data Bank, and the number of membrane protein structures is doubling about 
every 3 years. The known structures show a surprising diversity, but in all cases the 
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FIGURE 9.8 Four possible modes for the binding of peripheral membrane proteins. 
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FIGURE 9.9 Models for membrane association of peripheral proteins. (a) Bee venom phospholipase Az (pdb id 
= 1POC), (b) p40 phox PX domain of NADH oxidase (pdb id = 1H6H), and (c) PH domain of phospholipase Cé 
(pdb id = 1MAN. 
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portions of the protein in contact with the nonpolar core of the lipid bilayer are 
dominated by a-helices or B-sheets, because these secondary structures neutralize 
the highly polar N—H and C=O functions of the peptide backbone through H- 
bond formation. 


Proteins with a Single Transmembrane Segment In proteins that are anchored 
by a single hydrophobic segment, that segment typically takes the form of an 
a-helix. One of the best examples is glycophorin. Most of glycophorin’s mass is ori- 
ented on the outside surface of the red blood cell, exposed to the aqueous milieu 
(Figure 9.10). Hydrophilic oligosaccharide units are attached to this extracellular 
domain. These oligosaccharide groups constitute the ABO and MN blood group 
antigenic specificities of the red cell. Glycophorin has a total molecular weight of 
about 31,000 and is approximately 40% protein and 60% carbohydrate. The gly- 
cophorin primary structure consists of a segment of 19 hydrophobic amino acid 
residues with a short hydrophilic sequence on one end and a longer hydrophilic 
sequence on the other end. The 19-residue sequence is just the right length to span 
the cell membrane if it is coiled in the shape of an a-helix. 

Monoamine oxidase from the mitochondrial outer membrane is another typical 
single transmembrane-segment protein (Figure 9.11); this enzyme is the target for 
many antidepressant and neuroprotective drugs. Each monomer of the dimeric 
protein binds to the membrane through a C-terminal transmembrane a-helix. 
Residues in two loops (Pro-109 and Ile-110 in the 99-112 loop and Phe-481, Leu- 
482, Leu-486, and Pro-487 in the 481-488 loop) also provide nonpolar residues that 
participate in membrane binding. 

Approximately 10% to 30% of transmembrane proteins have a single helical 
transmembrane segment. In animals, many of these function as cell surface recep- 
tors for extracellular signaling molecules or as recognition sites that allow the im- 
mune system to recognize and distinguish cells of the host organism from invading 
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FIGURE 9.10 Glycophorin A spans the membrane of the 
human erythrocyte via a single a-helical transmem- 
brane segment. The C-terminus of the peptide faces the 
cytosol of the erythrocyte; the N-terminal domain is 
extracellular. Points of attachment of carbohydrate 
groups are indicated by triangles. 
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FIGURE 9.11 (a) Major histocompatibility antigen 
HLA-A2 (pdb id = 1JF1) and (b) monoamine oxidase 
(pdb id = 1GOS) are membrane-associated proteins 
with a single transmembrane helical segment. 
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foreign cells or viruses. The proteins that represent the major transplantation antigens 
H2in mice (Figure 9.11) and human leukocyte associated (HLA) proteins in humans are 
members of this class. Other such proteins include the surface immunoglobulin recep- 
tors on B lymphocytes and the spike proteins of many membrane viruses. The function 
of many of these proteins depends primarily on their extracellular domain; thus, 
the segment facing the intracellular surface is often a shorter one. 


Proteins with Multiple Transmembrane Segments Most integral transmembrane 
proteins cross the lipid bilayer more than once. These multi-spanning membrane 
proteins typically have 2 to 12 transmembrane segments, and they carry out a vari- 
ety of cellular functions (Figure 9.12). A well-characterized example of such a pro- 
tein is bacteriorhodopsin (Figure 9.13), which clusters in purple patches in the 
membrane of the archaeon Halobacterium halobium. The name Halobacterium refers 
to the fact that this prokaryote thrives in solutions having high concentrations of 
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FIGURE 9.12 Most membrane proteins possess 2 to 12 transmembrane segments. Those involved in transport 
functions have between 6 and 12 transmembrane segments. (Adapted from von Heijne, G., 2006. Membrane-protein 
topology. Nature Reviews Molecular Cell Biology 7:909-918.) 
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sodium chloride, such as the salt ponds of San Francisco Bay. Halobacterium carries 
out a light-driven proton transport by means of bacteriorhodopsin, named in ref- 
erence to its spectral similarities to rhodopsin in the rod outer segments of the 
mammalian retina. The amino acid sequence of bacteriorhodopsin contains seven 
different segments, each about 20 nonpolar residues in length—just the right size 
for an a-helix that could span a bilayer membrane. (Twenty residues times 1.5 A per 
residue equals 30 A) 

Bacteriorhodopsin clusters in symmetric, repeating arrays in the purple mem- 
brane patches of Halobacterium, and it was this orderly, repeating arrangement of 
proteins in the membrane that enabled Nigel Unwin and Richard Henderson in 
1975 to determine the bacteriorhodopsin structure. The polypeptide chain crosses 
the membrane seven times, in seven a-helical segments, with very little of the pro- 
tein exposed to the aqueous milieu. The bacteriorhodopsin structure became a 
model of globular membrane protein structure. Many other integral membrane 
proteins contain numerous hydrophobic sequences that, like those of bacterio- 
rhodopsin, form a-helical transmembrane segments. 


Membrane Protein Topology Can Be Revealed by Hydropathy Plots The topol- 
ogy of a membrane protein is a specification of the number of transmembrane seg- 
ments and their orientation across the membrane. The topology of a transmem- 
brane helical protein can be revealed by a hydropathy plot based on its amino acid 
sequence. If a measure of hydrophobicity is assigned to each amino acid (Table 
9.1), then the overall hydrophobicity of a segment of a polypeptide chain can be es- 
timated. The hydropathy index for any segment is an average of the hydrophobic- 
ity values for its residues. 

The hydropathy index can be calculated at any residue in a sequence by averag- 
ing the hydrophobicity values for a segment surrounding that residue. Typically, 
segment sizes for such calculations can be 7 to 21 residues. With a 7-residue seg- 
ment size, the calculation of hydropathy index at residue 10 would average the val- 
ues for residues 7 through 13. The calculation for a 21-residue segment around 
residue 100 would include residues 90 to 110. A polypeptide segment approxi- 
mately 20 residues long with a high hydropathy index is likely to be an a-helical 
transmembrane segment. A hydropathy plot for glycophorin (Figure 9.14a) reveals 
a single region of high hydropathy index between residues 73 and 93, the location 
of the a-helical segment in this transmembrane protein (Figure 9.10). A hydropa- 
thy plot for rhodopsin (Figure 9.14b) reveals the locations of its seven a-helical 
transmembrane segments. Rhodopsin, the light-absorbing pigment protein of the 
eye, is a member of the G-protein—coupled receptor (GPCR) family of membrane 
proteins (see Chapter 32). 


Proline Residues Can Bend a Transmembrane a-Helix Transmembrane a-helices 
often contain distortions and “kinks’—more so than for water-soluble proteins. As 
more integral membrane protein structures have been determined, it has become 
clear that most transmembrane a-helices contain significant distortions from ideal 
helix geometry. Helix distortions may have evolved in membrane proteins because 
(1) helices, even distorted ones, are highly stable in the membrane environment, 
and (2) helix distortions may be one way to create structural diversity from the sim- 
ple helix building blocks of most membrane proteins. 

About 60% of known membrane helix distortions are kinks at proline residues 
(Figure 9.13). Proline distorts the ideal a-helical geometry because of steric conflict 
with the preceding residue and because of the loss of a backbone H bond. Proline- 
induced kinks create weak points in the helix, which may facilitate movements re- 
quired for transmembrane transport channels. 


Amino Acids Have Preferred Locations in Transmembrane Helices Transmem- 
brane protein sequences and structures are adapted to the transition from water on 
one side of the membrane, to the hydrocarbon core of the membrane, and then to 
water on the other side of the membrane. The amino acids that make up trans- 
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FIGURE 9.13 Bacteriorhodopsin is composed of seven 
transmembrane a-helical segments connected by short 
loops (pdb id = 1MOM). Nearly all of this protein is 
embedded in the membrane. Only the short loops con- 
necting helices are exposed to solvent. A retinal chro- 
mophore (a light-absorbing molecule, shown in blue) 
lies approximately parallel to the membrane and 
between the helical segments. A proline residue (red) 
induces a kink in one of the helical segments (green). 
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Side Chain 

Isoleucine 4.5 
Valine 4.2 
Leucine 3.8 
Phenylalanine 2.8 
Cysteine 2.5 
Methionine 1.9 
Alanine 1.8 
Glycine —0.4 
Threonine —0.7 
Serine —0.8 
Tryptophan —0.9 
Tyrosine =1.3 
Proline —1.6 
Histidine = 3.2 
Glutamic acid —3.5 
Glutamine =3.5 
Aspartic acid —3.5 
Asparagine =3:5 
Lysine =3.9 
Arginine —4.5 








*From Kyte, J., and Doolittle, R., 1982. A simple method for dis- 
playing the hydropathic character of a protein. Journal of 
Molecular Biology 157:105-132. 
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FIGURE 9.14 Hydropathy plots for (a) glycophorin and (b) rhodopsin. Hydropathy index is plotted versus 
residue number. At each position in the polypeptide chain, the average of hydropathy indices for a certain 
number of adjacent residues (eight, in this case) is calculated and plotted on the y-index, and the number of 
the residue in the middle of this “window” is shown on the x-axis. 


membrane segments reflect these transitions. Hydrophobic amino acids (Ala, Val, 
Leu, Ile, and Phe) are found most often in the hydrocarbon interior, where charged 
and polar amino acid almost never reside (Figure 9.15b). Charged residues (Figure 
9.15a) occur commonly at the lipid-water interface, but positively charged residues 
are found more often on the cytoplasmic face of transmembrane proteins. Gunnar 
von Heijne has termed this the “positive inside rule.” Tryptophan, histidine, and 
tyrosine are special cases (Figure 9.15c). These residues have a mixed character, 
with nonpolar aromatic rings that also contain polar parts (the ring N—H of Trp 
and the substituent —OH of Tyr). As such, Trp and Tyr are found commonly at the 
lipid—water interface of transmembrane proteins. 

The amino acids Lys and Arg frequently behave in novel ways at the lipid—water 
interface. Both of these residues possess long aliphatic side chains with positively 
charged groups at the end. In many membrane proteins, the aliphatic chain of Lys 
or Arg is associated with the hydrophobic portion of the bilayer, with the positively 
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FIGURE 9.15 Amino acids have distinct preferences for different parts of the membrane. The graphs show rela- 
tive stabilization energies as a function of location in the membrane for (a) Arg, Asp, Glu, Lys, Asn, Gln, and Pro; 
(b) Ala, Gly, Ile, Leu, Met, Phe, and Val; and (c) His, Tyr, and Trp. Polar and charged residues are less stable in the 
membrane interior, whereas nonpolar residues tend to be more stable in the membrane interior. The stability 
profiles for His, Tyr, and Trp are more complex. (Adapted from von Heijne, G., 2006. Membrane-protein topology. Nature 
Reviews Molecular and Cell Biology 7:909-918.) 
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charged groups (amino or guanidinium) extending beyond to associate with nega- 
tively charged phosphate groups. This behavior, with the side chain pointing up out 
of the membrane core, has been termed snorkeling (Figure 9.16). If a Phe residue 
occurs near the lipid—water interface, it is typically arranged with the aromatic ring 
oriented toward the membrane core. This is termed antisnorkeling. 


Membrane Protein Structures Show Many Variations on the Classical Themes 
Although it revealed many insights of membrane protein structure, bacterio- 
rhodopsin gave a relatively limited view of the structural landscape. Many mem- 
brane protein structures obtained since bacteriorhodopsin (and a few others) have 
provided a vastly more complex picture to biochemists. For example, the structures 
of a homodimeric chloride ion transport protein and a glutamate transport protein 
show several novel structural features (Figure 9.17). In addition to several trans- 
membrane helices that lie perpendicular to the membrane plane (like those of bac- 
teriorhodopsin), these structures each contain several long, severely tilted helices that 
span the membrane. Both these proteins also contain several reentrant loops, con- 
sisting of a pair of short O-helices and a connecting loop that together penetrate 
part way into the membrane core. There are also regions of nonhelical polypeptide 
deep in the membrane core of these proteins, with helical segments on either side 
that extend to the membrane surface (Figure 9.17). 

Finally, most membrane protein structures are relatively stable; that is, trans- 
membrane helices do not flip in and out of the membrane, and they do not flip 
across the lipid bilayer, inverting their orientation. However, a few membrane pro- 
teins can in fact change their membrane orientation. Aquaporin-1 is a protein that 
functions normally with six transmembrane a-helices. When this protein is first in- 
serted into its membrane, it has only four transmembrane a-helices (Figure 9.18a). 
One of these, the third transmembrane helix (TM3), reorients across the mem- 
brane, pulling helices 2 and 4 into the membrane. Similarly, a glycoprotein of the 
hepatitis B virus is initially inserted into the viral membrane with its N-terminal do- 
main lying outside. During the viral maturation process, about half of these glyco- 
proteins rearrange (Figure 9.18b), with the N-terminal segment moving across the 
membrane as TM4 creates a new transmembrane segment. 


Some Proteins Use -Strands and B-Barrels To Span the Membrane The a-helix 
is not the only structural motif by which a protein can cross a membrane. Some in- 
tegral transmembrane proteins use structures built from -strands and -sheets to 
diminish the polar character of the peptide backbone as it crosses the nonpolar 
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FIGURE 9.16 Snorkeling and antisnorkeling behavior in 
membrane proteins. The SdhC subunit of succinate 
dehydrogenase (pdb id = 1NEK). Lys!!! snorkels away 
from the membrane core and Phe” antisnorkels toward 
the membrane core. (Adapted from Liang, J, Adamian, L., and 
Jackups, R., Jr, 2005. The membrane-water interface region of 
membrane proteins: Structural bias and the anti-snorkeling 
effect. Trends in Biochemical Sciences 30:355-357.) 





FIGURE 9.17 Not all the embedded segments of membrane proteins are transmembrane and oriented per- 
pendicular to the membrane plane. (a) The glutamate transporter homolog (pdb id = 1XFH).”Reentrant” 
helices (orange) and interrupted helices (red) are shown. Several of the transmembrane helices deviate signifi- 
cantly from the perpendicular. (b) The E. coli CIC chloride transporter (pdb id = 1KPK). Few of the transmem- 
brane helices are perpendicular to the membrane plane. 
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(a) Aquaporin-1 





(b) Hepatitis B virus 





FIGURE 9.18 Dynamic insertion of helical segments of membrane proteins. (a) Aquaporin-1.The second and 
fourth transmembrane helices insert properly across the membrane only after reorientation of the third trans- 
membrane helix. (b) The large envelope glycoprotein of the hepatitis B virus. The N-terminal “pre-S” domain 
translocates across the endoplasmic reticulum membrane in a slow process in 50% of the molecules. (Adapted 
from von Heijne, G., 2006. Nature Reviews Molecular and Cell Biology 7:909-918.) 
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FIGURE 9.19 Some proteins traverse the membrane with B-barrel structures. 
Several examples are shown, including (a) maltoporin from S. typhimurium 

(pdb id = 2MPR), (b) ferric enterobactin receptor (pdb id = 1FEP), (c) ToIC, 

an outer membrane protein from E. coli (pdb id = 1EK9), (d) the translocator 
domain of the NalP autotransporter of N. meningitides (pdb id = 1UYN), (e) the 
translocator domain of the Hia autotransporter from H. influenzae (pdb id = 
2GR8), (f) the outer membrane cobalamin transporter from E. coli, in a complex 
with the 100 A coiled coil of colicin E3 (pdb id = 1UJW), (g) the fatty acid trans- 
porter FadL from £. coli (odb id = 1716). 
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membrane core. These -barrel structures (Figure 9.19) maximize hydrogen bond- 
ing and are highly stable. The barrel interior is large enough to accommodate wa- 
ter molecules and often structures as large as peptide chains, and most barrels are 
literally water filled. 

How does the B-barrel structure tolerate water on one surface (the inside) and 
the nonpolar membrane core on the other? In all transmembrane f-barrels, polar 
and nonpolar residues alternate along the 6-strands, with polar residues facing the 
center of the barrel and nonpolar residues facing outward, where they can interact 
with the hydrophobic lipid milieu of the membrane. 

Porin proteins found in the outer membranes (OMs) of Gram-negative bacte- 
ria such as Escherichia coli, and also in the outer mitochondrial membranes of eu- 
karyotic cells, span their respective membranes with large B-barrels. A good ex- 
ample is maltoporin, also known as LamB protein or lambda receptor, which 
participates in the entry of maltose and maltodextrins into E. coli. Maltoporin is 
active as a trimer. The 421-residue monomer forms an 18-strand -barrel with 
antiparallel 6-strands connected to their nearest neighbors either by long loops 
or by -turns (Figure 9.20; see also Figure 9.19a). The long loops are found at 
the end of the barrel that is exposed to the cell exterior, whereas the turns are 
located on the intracellular face of the barrel. Three of the loops fold into the 
center of the barrel. 

B-barrels can also be constructed from multiple subunits. The a-hemolysin toxin 
(Figure 9.21) forms a 14-stranded -barrel with seven identical subunits that each 
contribute two antiparallel 6-strands connected by a short loop. Staphylococcus aureus 
secretes monomers of this toxin, which bind to the plasma membranes of host 





FIGURE 9.20 The arrangement of the peptide chain in 
maltoporin from E. coli. 


CENGAGENOW" Go to CengageNOW at 
www.cengage.com/login and click 
Biochemistrylnteractive to discover how a 
B-sheet is expoilted by maltoporin. 


CENGAGENOW” ACTIVE FIGURE 9.21 The structure of 
the heptameric channel formed by Staphylococcus 
aureus a-hemolysin. Each of the seven subunits con- 
tributes a B-sheet hairpin to the transmembrane chan- 
nel (pdb id = 7AHL). Test yourself on the concepts in 
this figure at www.cengage.com/login. 
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blood cells. Upon binding, the monomers oligomerize to form the 7-subunit struc- 
ture. The channel thus formed facilitates uncontrolled permeation of water, ions, 
and small molecules, destroying the host cell. 

Why have certain proteins evolved to use -strands instead of a-helices as 
membrane-crossing devices? Among other reasons, there is an advantage of genetic 
economy in the use of -strands to traverse the membrane instead of a-helices. An 
a-helix requires 21 to 25 amino acid residues to span a typical biological membrane; a 
B-strand can cross the same membrane with 9 to 11 residues. Therefore, a given 
amount of genetic information could encode a larger number of membrane-spanning 
segments using a -strand motif instead of a-helical arrays. 


Transmembrane Barrels Can also Be Formed with a-Helices Many bacteria, in- 
cluding E. coli, produce extracellular polysaccharides, some of which form a discrete 
structural layer—the capsule, which shields the cell, allowing it to evade or counter- 
act host immune systems. In £. coli, the components of this polysaccharide capsule are 
synthesized inside the cell and then transported outward through an octameric outer 
membrane protein called Wza. To cross the outer membrane, Wza uses a novel 
a-helical barrel (Figure 9.22). Wza is composed of three novel domains that, with the 
a-helical barrel, form a large central cavity that accommodates the transported poly- 
saccharides. The transmembrane a-helices of Wza are amphiphilic, with hydrophobic 
outer surfaces that face the lipid bilayer and hydrophilic inner surfaces that face the 
water-filled pore. 


Lipid-Anchored Membrane Proteins Are Switching Devices 


Certain proteins are found to be covalently linked to lipid molecules. For many of 
these proteins, covalent attachment of lipid is required for association with a mem- 
brane. The lipid moieties can insert into the membrane bilayer, effectively anchor- 
ing their linked proteins to the membrane. Some proteins with covalently linked 
lipid normally behave as soluble proteins; others are integral membrane proteins 
and remain membrane associated even when the lipid is removed. Covalently 
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FIGURE 9.22 The structure of Wza, an octameric membrane protein that anchors the peptidoglycan layer and 


the outer membrane of Gram-negative bacteria. The structure contains a central barrel constructed from 
a-helical segments (pdb id = 2558). 
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Exterminator Proteins—Biological Pest Control at the Membrane 


Control of biological pests, including mosquitoes, houseflies, gnats, 
and tree-consuming predators like the eastern tent caterpillar, is 
frequently achieved through the use of microbial membrane pro- 
teins. For example, several varieties of Bacillus thuringiensis produce 
proteins that bind to cell membranes in the digestive systems of in- 
sects that consume them, creating transmembrane ion channels. 
Leakage of Na‘, Kt, and H* ions through these membranes in the 
insect gut destroys crucial ion gradients and interferes with diges- 
tion of food. Insects that ingest these toxins eventually die of star- 
vation. B. thuringiensis toxins account for more than 90% of sales of 
biological pest control agents. 

B. thuringiensis is a common Gram-positive, spore-forming soil 
bacterium that produces inclusion bodies, microcrystalline clus- 
ters of many different proteins. These crystalline proteins, called 
6-endotoxins, are the ion channel toxins that are sold commer- 
cially for pest control. Most such endotoxins are protoxins, which 
are inactive until cleaved to smaller, active proteins by proteases in 
the gut of a susceptible insect. One such crystalline protoxin, 


lethal to mosquitoes, is a 27-kD protein, which is cleaved to form 
the active 25-kD toxin in the mosquito. This toxin has no effect on 
membranes at neutral pH, but at pH 9.5 (the pH of the mosquito 
gut) the toxin forms cation channels in the gut membranes. 

This 25-kD protein is not toxic to tent caterpillars, but a larger, 
130-kD protein in the B. thuringiensis inclusion bodies is cleaved by 
a caterpillar gut protease to produce a 55-kD toxin that is active in 
the caterpillar. Remarkably, the strain of B. thuringiensis known as 
azawai produces a protoxin with dual specificity: In the caterpillar 
gut, this 130-kD protein is cleaved to form a 55-kD toxin active in 
the caterpillar. However, when the same 130-kD protoxin is con- 
sumed by mosquitoes or houseflies, it is cleaved to form a 53-kD 
protein (15 amino acid residues shorter than the caterpillar toxin) 
that is toxic to these latter organisms. Understanding the molecu- 
lar basis of the toxicity and specificity of these proteins and the 
means by which they interact with membranes to form lethal ion 
channels is a fascinating biochemical challenge with far-reaching 
commercial implications. 
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bound lipid in these latter proteins can play a role distinct from membrane an- 
choring. In many cases, attachment to the membrane via the lipid anchor serves to 
modulate the activity of the protein. 

Another interesting facet of lipid anchors is that they are transient. Lipid anchors 
can be reversibly attached to and detached from proteins. This provides a “switch- 
ing device” for altering the affinity of a protein for the membrane. Reversible lipid 
anchoring is one factor in the control of signal transduction pathways in eukaryotic 
cells (see Chapter 32). 

Four different types of lipid-anchoring motifs have been found to date. These are 
amide-linked myristoyl anchors, thioester-linked fatty acyl anchors, thioether-linked 
prenyl anchors, and amide-linked glycosyl phosphatidylinositol anchors. Each of 
these anchoring motifs is used by a variety of membrane proteins, but each 
nonetheless exhibits a characteristic pattern of structural requirements. 


Amide-Linked Myristoyl Anchors Myristic acid may be linked via an amide bond 
to the a-amino group of the N-terminal glycine residue of selected proteins (Figure 
9.23a). The reaction is referred to as Nmyristoylation and is catalyzed by 
myristoyl—CoA: protein N-myristoyltransferase, known simply as NMT. N-Myristoyl- 
anchored proteins include the catalytic subunit of cAMP-dependent protein kinase, the 
ppoO”™ tyrosine kinase, the phosphatase known as calcineurin B, the a-subunit of G pro- 
teins (involved in GTP-dependent transmembrane signaling events), and the gag pro- 
teins of certain retroviruses (including the HIV-1 virus that causes AIDS). 


Thioester-Linked Fatty Acyl Anchors A variety of cellular and viral proteins contain 
fatty acids covalently bound via ester linkages to the side chains of cysteine and some- 
times to serine or threonine residues within a polypeptide chain (Figure 9.23b). This 
type of fatty acyl chain linkage has a broader fatty acid specificity than M-myristoylation. 
Myristate, palmitate, stearate, and oleate can all be esterified in this way, with the C16 
and Cs chain lengths being most commonly found. Proteins anchored to membranes 
via fatty acyl thioesters include G-protein—coupled receptors, the surface glycoproteins of sev- 
eral viruses, the reggie proteins of nerve axons, and the transferrin receptor protein. 


Thioether-Linked Prenyl Anchors As noted in Chapter 8, polyprenyl (or simply 
prenyl) groups are long-chain polyisoprenoid groups derived from isoprene units. 
Prenylation of proteins destined for membrane anchoring can involve either far- 
nesyl or geranylgeranyl groups (Figure 9.23c and d). The addition of a prenyl group 
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FIGURE 9.23 Certain proteins are anchored to biological 
membranes by lipid anchors. Shown are (a) the 
N-myristoyl motif, (b) the S-palmitoy! motif, (c) the far- 
nesyl motif, (d) the geranylgeranyl motif, and (e) several 
cases of the glycosyl phosphatidylinositol (GPI) motif. 
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Prenylation Reactions as Possible Chemotherapy Targets 


The protein called p21", or simply Ras, is a small GTP-binding 
protein involved in cell signaling pathways that regulate growth 
and cell division. Mutant forms of Ras cause uncontrolled cell 
growth, and Ras mutations are involved in one-third of all human 
cancers. Because the signaling activity of Ras is dependent on 
prenylation, the prenylation reaction itself, as well as the proteo- 
lysis of the -AAX motif and the methylation of the prenylated Cys 
residue, have been considered targets for development of new 
chemotherapy strategies. 

Farnesyl transferase from rat cells is a heterodimer consisting 
of a 48-kD a-subunit and a 46-kD B-subunit. In the structure shown 
here, helices 2 to 15 of the a-subunit are folded into seven short, 
coiled coils that together form a crescent-shaped envelope par- 
tially surrounding the B-subunit. Twelve helices of the B-subunit 
form a novel barrel motif that creates the active site of the enzyme. 
Farnesyl transferase inhibitors, one of which is shown here, are po- 
tent suppressors of tumor growth in experimental animals. 





The structure of the farnesyl transferase heterodimer (pdb id = 
1JCQ). A novel barrel structure is formed from 12 helical segments 
in the B-subunit (yellow). The a-subunit (green) consists largely of 
seven successive pairs of a-helices that form a series of right-handed 
antiparallel coiled coils running along the bottom of the structure. 
These “helical hairpins” are arranged in a double-layered, right- 
handed superhelix resulting in a crescent-shaped subunit that 
envelopes part of the subunit. 
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2(S)-{(S)-[2(R)-amino-3-mercapto ] propylamino- 
3(S)-methyl}pentyloxy-3-phenylpropionyl- 
methioninesulfone methyl ester 
This substance, also known as I-739,749, is a farnesyl transferase 
inhibitor that is a potent tumor growth suppressor. 





Mutations that inhibit prenyl transferases cause defective 
growth or death of cells, raising questions about the usefulness of 
prenyl transferase inhibitors in chemotherapy. However, Victor 
Boyartchuk and his colleagues at the University of California, 
Berkeley, and Acacia Biosciences have shown that the protease 
that cleaves the -AAX motif from Ras following the prenylation re- 
action may be a better chemotherapeutic target. They have iden- 
tified two genes for the prenyl protein protease in the yeast Sac- 
charomyces cerevisiae and have shown that deletion of these genes 
results in loss of proteolytic processing of prenylated proteins, in- 
cluding Ras. Interestingly, normal yeast cells are unaffected by this 
gene deletion. However, in yeast cells that carry mutant forms of 
Ras and that display aberrant growth behaviors, deletion of the 
protease gene restores normal growth patterns. If these remark- 
able results translate from yeast to human tumor cells, inhibitors 
of CAAX proteases may be more valuable chemotherapeutic 
agents than prenyl transferase inhibitors. 
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A The farnesylation and subsequent processing of the Ras protein. Follow- 
ing farnesylation by the FTase, the carboxy-terminal VLS peptide is removed 
by a prenyl protein-specific endoprotease (PPSEP) in the ER; then a prenyl- 
protein-specific methyltransferase (PPSMT) donates a methyl group from 
S-adenosylmethionine (SAM) to the carboxy-terminal S-farnesylated cys- 
teine. In addition, palmitates are added to cysteine residues near the 
C-terminus of the protein (not shown). 
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typically occurs at the cysteine residue of a carboxy-terminal CAAX sequence of the 
target protein, where C is cysteine, A is any aliphatic residue, and X can be any 
amino acid. As shown in Figure 9.23c and d, the result is a thioether-linked farnesyl 
or geranylgeranyl group. Once the prenylation reaction has occurred, a specific 
protease cleaves the three carboxy-terminal residues, and the carboxyl group of the 
now terminal Cys is methylated to produce an ester. All of these modifications ap- 
pear to be important for subsequent activity of the prenyl-anchored protein. Pro- 
teins anchored to membranes via prenyl groups include yeast mating factors, the 
p21 protein (the protein product of the ras oncogene; see Chapter 32), and the nu- 
clear lamins, structural components of the lamina of the inner nuclear membrane. 


Glycosyl Phosphatidylinositol Anchors Glycosyl phosphatidylinositol, or GPI, 
groups are structurally more elaborate membrane anchors than fatty acyl or prenyl 
groups. GPI groups modify the carboxy-terminal amino acid of a target protein via an 
ethanolamine residue linked to an oligosaccharide, which is linked in turn to the in- 
ositol moiety of a phosphatidylinositol (Figure 9.23e). The oligosaccharide typically 
consists of a conserved tetrasaccharide core of three mannose residues and a glu- 
cosamine, which can be altered by addition of galactosyl side chains of various sizes 
and extra phosphoethanolamines, Macetylgalactose, or mannosyl residues (Figure 
9.23e). The inositol moiety can also be modified by an additional fatty acid, and a va- 
riety of fatty acyl groups are found linked to the glycerol group. GPI groups anchor a 
wide variety of surface antigens, adhesion molecules, and cell surface hydrolases to plasma 
membranes in various eukaryotic organisms. GPI anchors have not yet been observed 
in prokaryotic organisms or plants. 


9:35 How Are Biological Membranes Organized? 


Membranes Are Asymmetric and Heterogeneous Structures 


Biological membranes are asymmetric and heterogeneous structures. The two mono- 
layers of the lipid bilayer have different lipid compositions and different comple- 
ments of proteins. The membrane composition is also different from place to place 
across the plane of the membrane. There are clusters of particular kinds of lipids, par- 
ticular kinds of proteins, and a variety of specific lipid-protein associations and 
aggregates, all of which serve the functional needs of the cell. We say that both the 
lipids and the proteins of membranes exhibit lateral heterogeneity and transverse 
asymmetry. Lateral heterogeneity arises when lipids or proteins of particular types 
cluster in the plane of the membrane. Transverse asymmetry refers to different lipid 
or protein compositions in the two leaflets or monolayers of a bilayer membrane. 

Many properties of a membrane depend on its two-sided nature. Properties that are 
a consequence of membrane “sidedness” include membrane transport, which is driven 
in one direction only; the effects of hormones at the outsides of cells; and the 
immunological reactions that occur between cells (necessarily involving only the out- 
side surfaces of the cells). The proteins involved in these and other interactions must 
be arranged asymmetrically in the membrane. 

Lipid transverse asymmetry can be seen in the typical animal cell, where the 
amine-containing phospholipids are enriched in the cytoplasmic leaflet of the 
plasma membrane, and the choline-containing phospholipids and sphingolipids 
are enriched in the outer leaflet (Figure 9.24). In the erythrocyte, for example, 
phosphatidylcholine (PC) comprises about 29% of the total phospholipid in the 
membrane. Of this amount, 76% is found in the outer monolayer and 24% is found 
in the inner monolayer. 

Asymmetric lipid distributions are important to cells in several ways. The carbohy- 
drate groups of glycolipids (and of glycoproteins) always face the outside of plasma 
membranes, where they participate in cell recognition phenomena. Asymmetric lipid 
distributions may also be important to various integral membrane proteins, which may 
prefer particular lipid classes in the inner and outer monolayers. 
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FIGURE 9.24 Phospholipids are distributed asymmetrically in most membranes, including the human erythro- 
cyte membrane, as shown here. (a) The distribution of phospholipids across the inner and outer leaflets of 
human erythrocytes. The x-axis values show, for each lipid type, its percentage of the total phospholipid in the 
membrane. (b) The phospholipid compositions of the inner and outer leaflets. All percentages in (a) and (b) 
are weight percentages. (Adapted from Zachowski, A., 1993. Phospholipids in animal eukaryotic membranes: Transverse 
asymmetry and movement. Biochemical Journal 294:1-14; and from Andreoli, T. E., 1987. Membrane Physiology, 2nd ed. Chapter 
27, Table |. New York: Springer.) 


Loss of transverse lipid asymmetry has dramatic (and often severe) consequences 
for cells and organisms. For example, appearance of PS in the outer leaflet of the 
plasma membrane triggers apoptosis, the programmed death of the cell. Similarly, ag- 
ing erythrocytes and platelets slowly externalize PS, culminating in engulfment by 
macrophages. Many disease states, including diabetes and malaria, involve microvas- 
cular occlusions that may result in part from alterations of transverse lipid asymmetry. 


9:4» What Are the Dynamic Processes That Modulate 
Membrane Function? 


Lipids and Proteins Undergo a Variety of Movements in Membranes 


Motions of lipids and proteins in membranes underlie many cell functions. Lipid 
movements (Figure 9.25) range from bond vibrations (at 10! per sec), to bilayer 
undulations (1 to 10° per sec), to transverse motion—called “flip-flop” (roughly one 
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FIGURE 9.25 Lipid motions in the membrane and their 
characteristic frequencies. (Adapted from Gawrisch, K., 2005. 
The dynamics of membrane lipids. In The Structure of Biological 
Membranes, Chapter 4, Figure 4.1, Yeagle, P. L., ed., 2005. Boca 
Raton: CRC Press.) 
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(a) 


per day to one per sec). Lateral movement of lipids (in the plane of the membrane) 
is rapid. Adjacent lipids can change places with each other on the order of 107/sec. 
Thus, a typical phospholipid can diffuse laterally in a membrane at a linear rate of 
several microns per second. At that rate, a phospholipid molecule travels from one 
end of a bacterial cell to the other in less than a second or traverse a typical animal 
cell in a few minutes. 

Many membrane proteins move laterally (through the plane of the membrane) 
at a rate of a few microns per minute. On the other hand, some integral membrane 
proteins are more restricted in their lateral movement, with diffusion rates of about 
10 nm per sec or even slower. Slower protein motion is likely for proteins that asso- 
ciate and bind with each other and for proteins that are anchored to the cytoskele- 
ton, a complex latticelike structure that maintains the cell’s shape and assists in the 
controlled movement of various substances through the cell. 


Flippases, Floppases, and Scramblases: Proteins That Redistribute Lipids Across 
the Membrane Proteins that can “flip” and “flop” phospholipids from one side of 
a bilayer to the other have also been identified in several tissues (Figure 9.26). 
Three classes of such proteins are known: 


1. ATP-dependent flippases that transport PS, and to a lesser extent PE, from the 
outer leaflet to the inner leaflet of the plasma membrane 

2. ATP-dependent floppases that transport a variety of amphiphilic lipids, especially 
cholesterol, PC, and sphigomyelin from the inner leaflet to the outer leaflet 

3. bidirectional, Ca?'-activated (but ATP-independent) scramblases that function to 
randomize lipids and thus degrade transverse asymmetry 


These proteins reduce the half-time for phospholipid movement across a mem- 
brane from days to a few minutes or less. Approximately one ATP is consumed per 
lipid transported by flippases and floppases. Energy-dependent lipid flippase activ- 
ity is essential for the creation and maintenance of transverse lipid asymmetries. A 
number of diseases have been linked to defects in flippases and floppases. Tangier 
disease causes accumulation of high concentrations of cholesterol in various tissues 
and leads to cardiovascular problems. Infants with respiratory distress syndrome 
produce low amounts of lung surfactant (a mix of lipids) and typically die a few days 
after birth. Both of these diseases involve flippase or floppase defects. 


Membrane Lipids Can Be Ordered to Different Extents 


The phospholipids and sterols of membranes can adopt different structures de- 
pending on the exact lipid and protein composition of the membrane and on the 
temperature. At low temperatures, bilayer lipids are highly ordered, forming a gel 
phase with the acyl chains nearly perpendicular to the plane of the membrane plane 
(Figure 9.27). In this state—called the solid-ordered state (or S, state)—the lipid 
chains are tightly packed and undergo relatively little motion. The lipid chains are 
in their fully extended conformation, the surface area per lipid is minimal, and the 
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CENGAGENOW” ANIMATED FIGURE 9.26 (a) Phospholipids can be flipped, flopped, or scrambled across a 
bilayer membrane by the action of flippase, floppase, and scramblase proteins. (b) When, by normal diffusion 
through the bilayer, the lipid encounters one of these proteins, it can be moved quickly to the other face of 
the bilayer. See this figure animated at www.cengage.com/login. 
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CENGAGENOW” ANIMATED FIGURE 9.27 The gel-to-liquid crystalline phase transition, which occurs when a 
membrane is warmed through the transition temperature, Tm. In the transition, the surface area increases, the 
membrane thickness decreases, and the mobility of the lipid chains increases dramatically. Membrane phase 
transitions can be characterized by measuring the rate of heat absorption by a membrane sample in a 
calorimeter. Pure, homogeneous bilayers (containing only a single lipid component) give sharp calorimetric 
peaks. As membrane heterogeneity increases, the calorimetric peaks broaden. Below phase transitions, lipid 
chains primarily adopt the anti conformation. Above the phase transition, lipid chains have absorbed a sub- 
stantial amount of heat. This is reflected in the adoption of higher-energy conformations, including the gauche 
conformations shown. See this figure animated at www.cengage.com/login. 


bilayer thickness is maximal. At higher temperatures, the acyl chains undergo much 
more motion, with rotations around the acyl chain C-C bonds and significant large- 
scale bending of the acyl chains. The membrane is then said to be in a liquid crys- 
talline phase or liquid-disordered state (La state) (Figure 9.27). In this less ordered 
state, the surface area per lipid increases and the bilayer thickness decreases by 10% 
to 15%. Under most conditions, the transition from the gel phase to the liquid crys- 
talline phase is a true phase transition, and the temperature at which this change 
occurs is referred to as a transition temperature or melting temperature (T,,). 

The sharpness of the transition in pure lipid preparations shows that the phase 
change is a cooperative behavior. This is to say that the behavior of one or a few mol- 
ecules affects the behavior of many other molecules in the vicinity. The sharpness 
of the transition then reflects the number of molecules that are acting in concert. 
Sharp transitions involve large numbers of molecules all “melting” together. 

Phase transitions have been characterized in a number of different pure and 
mixed lipid systems. Table 9.2 shows a comparison of the transition temperatures ob- 
served for several different phosphatidylcholines with different fatty acyl chain com- 
positions. General characteristics of bilayer phase transitions include the following: 


1. The transitions are always endothermic; heat is absorbed as the temperature in- 
creases through the transition (Figure 9.27). 
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FIGURE 9.28 (a) A model for a membrane raft. Relative 
to other parts of the membrane, rafts are presumed to 
be enriched in cholesterol, fatty acyl-anchored proteins, 
and GPl-anchored proteins. Sphingolipids are found 
predominantly in the outer leaflet of the raft bilayer. 

(b) Rafts are postulated to “grow” by accumulation of 
these components as they diffuse through the plane of 
the membrane. They become increasingly stable as they 
grow. Green circles represent GPl-anchored proteins, 
which accumulate in lipid rafts as they grow in size. 
(Adapted from Hancock, J. F., 2006. Lipid rafts: Contentious only 
from simplistic standpoints. Nature Reviews Molecular Cell Biology 
7:456-462, and Parton, R. G., and K. Simons, 2007. The multiple 
faces of caveolae. Nature Reviews Molecular Cell Biology 
8:185-194.) 











TABLE 9.2 Pi 

Transition Temperature 
Phospholipid (Tm), °C 
Dilauroyl phosphatidylcholine (Di 14:0) 23.6 
Dipalmitoyl phosphatidylcholine (Di 16:0) 41.4 
Distearoyl phosphatidylcholine (Di 18:0) 58 
1-Stearoyl-2-oleoyl-phosphatidylcholine 3 
(1-18:0, 2-18:1 PC) 
Dioleoyl phosphatidylcholine (Di 18:1 PC) =22 
Egg phosphatidylcholine (Egg PC) —15 
Dipalmitoyl phosphatidic acid (Di 16:0 PA) 67 
Dipalmitoyl phosphatidylethanolamine (Di 16:0 PE) 63.8 
Dipalmitoyl phosphatidylglycerol (Di 16:0 PG) 41.0 











Adapted from Jain, M., and Wagner, R. C., 1980. Introduction to Biological Membranes. New York: John Wiley and Sons; and 
Martonosi, A., ed., 1982. Membranes and Transport, Vol. 1. New York: Plenum Press. 


2. Particular phospholipids display characteristic transition temperatures (Tm). As 
shown in Table 9.2, Tn increases with chain length, decreases with unsaturation, 
and depends on the nature of the polar head group. 

3. For pure phospholipid bilayers, the transition occurs over a narrow temperature 
range. The phase transition for dimyristoyl lecithin has a peak width of about 
0.2°C. 
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4. Native biological membranes also display characteristic phase transitions, but 
these are broad and strongly dependent on the lipid and protein composition of 
the membrane. 

5. With certain lipid bilayers, a change of physical state referred to as a pretransition 
occurs 5° to 15°C below the phase transition itself. These pretransitions involve 
a tilting of the hydrocarbon chains. 

. Avolume change is usually associated with phase transitions in lipid bilayers. 

7. Bilayer phase transitions are sensitive to the presence of solutes that interact with 
lipids, including multivalent cations, lipid-soluble agents, peptides, and proteins. 


rn 


Cells adjust the lipid composition of their membranes to maintain proper fluidity 
as environmental conditions change. 


The Evidence for Liquid Ordered Domains and Membrane Rafts In addition to 
the solid ordered (S,) and liquid disordered (La) states, model lipid bilayers can 
exhibit a third structural phase if the membrane contains sufficient cholesterol. 
The liquid-ordered (L,) state is characterized by a high degree of acyl chain order- 
ing (like the S, state) but has the translational disorder characteristic of the Lg state. 
Lipid diffusion in the L, phase is about twofold to threefold slower than in the 
La phase. 

Biological membranes are hypothesized to contain regions equivalent to the 
Lo phase of model membranes. These microdomains are postulated to be aggre- 
gates of cholesterol and glycosphingolipids with long, saturated fatty acyl chains 
(particularly ceramides and gangliosides), and they are termed membrane rafts. 
The physical evidence for membrane rafts is indirect; thus, their existence is a mat- 
ter of debate among membrane biochemists. Direct measurements of rafts are dif- 
ficult, because they are small (with postulated diameters of 10 to 50 nm) and be- 
cause they are apparently transient, with lifetimes from a tenth of a millisecond or 
less to a few seconds or more. The most likely scenario, based on existing data, is 
shown in Figure 9.28. 

Many of the proteins that appear to associate with and stabilize rafts are cell sur- 
face receptor proteins and other proteins involved in cell signaling processes (see 
Chapter 32). Association in rafts may be advantageous for the functioning of these 
proteins. 


Lateral Membrane Diffusion Is Restricted by Barriers and Fences A variety of 
studies of lateral diffusion rates in membranes have shown that membrane proteins 
and lipids in plasma membranes diffuse laterally at a rate 5 to 50 times slower than 
those of artificial lipid membranes. Why should this be? Part of the answer has come 
from single particle tracking experiments (Figure 9.29), which reveal that lipids and 
at least some membrane proteins tend to undergo hop diffusion, such that they can 
diffuse freely within a membrane “compartment” for a time and then hop to an ad- 
jacent compartment, where the process repeats. Akihiro Kusumi and colleagues have 
proposed the membrane-skeleton fence model to explain this behavior, suggesting 
that certain proteins that comprise the cytoskeleton—a network of proteins on the 
cytoplasmic face of the plasma membrane—restrict the lateral diffusion of other 
membrane proteins (Figure 9.30). The “fence” proteins may include spectrin, a fila- 
mentous cytoskeletal protein in red blood cells, and actin, a cytoskeletal protein 
found in many other eukaryotic cells. The single particle tracking experiments show 
that lipid molecules are typically confined within fenced compartments for approxi- 
mately 13 to 15 msec, whereas transmembrane proteins are typically confined for 45 
to 65 msec. Even lipids in the outer leaflet of the plasma membrane undergo hop 
diffusion, leading Kusumi and colleagues to postulate that transmembrane proteins 
act as rows of “pickets” extending across both monolayers in these membrane fences. 
Fences thus define regions of relatively unrestricted lipid diffusion. 


Lipids and Proteins Direct Dynamic Membrane Remodeling and Curvature The 
complex shapes of cells and organelles are the result of forces that operate on their 
membranes, and these forces in turn are orchestrated by lipids and proteins. 





1pm 


FIGURE 9.29 Motions of a single (fluorescently labeled) 
lipid molecule on the surface of a cell can be measured 
by video fluorescence microscopy. Video recording at 
40,000 frames per second yields a time resolution of 

25 microseconds. Data collected over 62 milliseconds 

(a total of 2500 frames) show that a lipid diffuses rapidly 
within small domains (defined by colors) and occasional- 
ly jumps or hops to an adjacent region (shown as a color 
change). (Adapted from Kusumi, A, et al, 2005. Paradigm shift of 
the plasma membrane concept from the two-dimensional con- 
tinuum fluid to the partitioned fluid: High-speed single-molecule 
tracking of membrane molecules. Annual Review of Biophysics 
and Biomolecular Structure 34:35 1-378.) 
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FIGURE 9.30 Studies of single lipid molecule movement 
in membranes (Figure 9.29) are consistent with a“com- 
partmentalized” model for the membrane, in which 
lipids and proteins undergo short-term diffusion within 
“fenced” compartments, with occasional hops to adja- 
cent compartments. Akihiro Kusumi has suggested that 
elements of the cytoskeleton may define the fence 
boundaries at the membrane. 





Membranes change their shapes in special ways during movement, cell division, and 
other cellular events. This dynamic membrane remodeling is also accomplished by 
the interplay of lipids and proteins. The various membrane subdomains with par- 
ticular curvatures have precise and specialized biological properties and functions. 

There are several ways to induce curvature in a membrane (Figure 9.31). Lipids 
can influence or accommodate membrane curvature, either because of lipid mole- 
cule geometry or because of an imbalance in the number of lipids in the inner and 
outer leaflets of the bilayer. (In a liposome of 50-nm diameter, there is 56% more 
lipid in the outer leaflet than in the inner leaflet.) Integral membrane proteins with 
conical shapes can promote membrane curvature. The structure of a voltage-gated 
K* channel is an example of a shape conducive to membrane curvature (see Figure 
9.41). Proteins of the cytoskeleton, such as actin, typically contact the plasma mem- 
brane and can generate curvature by rearrangements of their own structure. More- 
over, motor proteins (see Chapter 16) moving along filaments of the cytoskeleton 
can generate curvature in the membrane. Scaffolding proteins, which can bind on 
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FIGURE 9.31 Membrane curvature can occur by several different mechanisms, including (a) changes in lipid 
composition, (b) insertion of membrane proteins that have intrinsic curvature or that oligomerize, (c) insertion 
of amphipathic helices into one leaflet of the bilayer, (d) interaction of the bilayer with molecular scaffolding 
proteins, or (e) changes induced by the cytoskeletal filaments inside the cell. (Adapted from McMahon, H.T. and 
Gallop, J. L., 2005. Membrane curvature and mechanisms of dynamic cell membrane remodeling. Nature 438:590-596.) 


9.4 What Are the Dynamic Processes That Modulate Membrane Function? 267 





BAR domain dimer 


Helix 1 insert (H1I) 






a N-terminal amphipathic 
helix 


either side of the plasma membrane, can influence membrane curvature in many 
ways. For example, BAR domains are dimeric, banana-shaped structures (Figure 
9.32) that bind preferentially to and stabilize curved regions of the plasma mem- 
brane. Finally, amphipathic a-helices can insert into bilayers, parallel to the mem- 
brane surface, thus forcing curvature on the membrane. N-BAR domains are BAR 
domains that have an N-terminal a-helix preceding the BAR domain. The helix typ- 
ically inserts to induce curvature, and the BAR domain binds to stabilize the curved 
structure. Harvey McMahon and his colleagues have proposed a structure (Figure 
9.32b) for N-BAR domain-mediated membrane curvature by endophilin-Al, a pro- 
tein found at synapses and implicated in the formation of synaptic vesicles. Mem- 
brane curvature is essential to a variety of cellular functions, including cell division, 
viral budding, and the processes of endocytosis and exocytosis, described in the 
next section. 


Caveolins and Caveolae Respond to Plasma Membrane Changes _ Caveolae are 
flask-shaped indentations in plasma membranes. Caveolae (Figure 9.33) are rich in 
cholesterol, sphingolipids, and caveolin, an integral membrane protein of 22,000 
MW. There are three members of the caveolin family. CAV] and CAV2 are found in 
endothelial, fibrous, and adipose (fat) tissue, whereas CAV3 is unique to skeletal 
muscle. Caveolins form homodimers in the plasma membrane, with both N- and 
C-termini oriented toward the cytosolic face of the membrane. The C-terminal 
domain has several palmitoyl lipid anchors and is separated from the N-terminal 
oligomerization domain by a 33-residue intramembrane hairpin domain. A typical 
caveolar structure consists of about 144 caveolin molecules, with up to 20,000 cho- 
lesterol molecules. Caveolae participate in mechanosensation, the detection and 
sensing of mechanical forces at the membrane, and mechanotransduction, the con- 
version of mechanical forces into biochemical signals that result in cell responses 
that regulate cell growth, differentiation, cell shape, and cell death. 


Vesicle Formation and Fusion Are Essential Membrane Processes The membranes 
of cells are not static. Normal cell function requires that the various membrane- 
enclosed compartments of the cell constantly reorganize and exchange proteins, 
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FIGURE 9.34 Vesicle-mediated transport in cells involves 
budding of vesicles from a donor membrane, followed 
by fusion of the vesicle membrane with the membrane 
of a target compartment, a process that transfers the 
contents of the donor compartment, as well as selected 
membrane proteins. (Adapted from Alberts, B, 2007. Molecular 
Biology of the Cell, 5th edition. New York, Garland Science.) 
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FIGURE 9.33 (a) Caveolin possesses a central hydrophobic segment flanked by three covalently bound fatty 
acyl anchors on the C-terminal side and a scaffolding domain on the N-terminal side. (b) Approximately 

144 molecules of caveolin combine to force curvature in the lipid bilayer and form a caveolar structure. A 
caveola may also contain as many as 20,000 cholesterol molecules. (Adapted from Parton, R. G., and K. Simons, 2007. 
The multiple faces of caveolae. Nature Reviews Molecular Cell Biology 8:185-194.) 


lipids, and other materials. These processes, and others such as cell division, exocyto- 
sis, endocytosis, and viral infection, all involve either fusion of one membrane with an- 
other or budding and separation of a vesicle from a membrane. Eukaryotic organelles 
communicate with one another by the exchange of “trafficking vesicles.” Vesicles are 
generated at a precursor membrane, transported to their destination, and then fused 
with the target compartment (Figure 9.34). Although the organelles involved in these 
processes are diverse—endoplasmic reticulum, Golgi, endosomes, and others—the ba- 
sic reactions of budding and fusion are accomplished by protein families and multi- 
protein complexes that have been conserved throughout eukaryotic evolution. The 
molecular events of exocytotic release of neurotransmitters into the synapses of nerve 
cells are good examples of such processes. 

Neurons communicate with one another by converting electrical signals into 
chemical signals and back again. When electrical signals arrive at the synapse, vesi- 
cles containing neurotransmitters (such as acetylcholine—see Chapter 32) fuse 
with the plasma membrane, releasing the neurotransmitters into the synaptic cleft. 
Binding of neurotransmitters to receptors on an adjacent neuron generates an elec- 
trical signal that is passed along. The fusion of vesicles with the plasma membrane 
is directed by SNAREs—a family of proteins that “snare” vesicles to initiate the fu- 
sion process. (The acronym, a somewhat strained effort to describe their function 
cleverly, stands for soluble Methylmaleimide-sensitive factor attachment protein 
receptor.) SNAREs are small proteins with a simple domain structure (Figure 9.35) 
that includes a SNARE motif, consisting of 60 to 70 residues of classical 7-residue 
repeats (see Chapter 6). The N-terminal domains are variable across the SNARE 
family, but at their C-termini most SNAREs have a single transmembrane domain 
joined to the SNARE motif by a short linker. Q,,SSNARE and Qpe SNARE are named 
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for a conserved glutamine (Q) residue, whereas R-SNARE is named for a conserved 
arginine (R). Q,-SNARE, also known as SNAP-25, consists of two SNARE domains 
joined by a linker with two palmitoyl lipid anchors. 

Q-SNAREs are organized in clusters on the plasma membrane and can form 
acceptor complexes (Figure 9.36). When a neurotransmitter-laden vesicle ap- 
proaches the plasma membrane, Q,-"SNARE and Q -SNARE on the plasma mem- 
brane join with R-SNARE on the vesicle to form a loose trans-complex through the 
N-terminal ends of their SNARE motifs. The four SNARE motifs in these three pro- 
teins “zip up” to form an increasingly stable helical complex (Figure 9.36), pulling 
the two membranes together and inducing the binding of complexin, a small heli- 
cal protein. Complexin binding “clamps” the complex so that it is poised for mem- 
brane fusion but is unable to complete the process. Arrival of an action potential 
(electrical signal—see Chapter 32) triggers flow of Ca** ions into the cell through 
channel proteins, and binding of Ca** ion to synaptotagmin displaces complexin 
and promotes joining of the membranes (to form the cis-complex) and the forma- 
tion of a fusion pore. The complexin clamp (Figure 9.36) ensures that neurotrans- 
mitter release can occur in an instant following Ca** influx, because the slow steps 
of SNARE assembly have already been completed. 


9:51 How Does Transport Occur Across Biological Membranes? 


Transport processes are vitally important to all life forms, because all cells must ex- 
change materials with their environment. Cells obviously must have ways to bring nu- 
trient molecules into the cell and ways to send waste products and toxic substances 
out. Also, inorganic electrolytes must be able to pass in and out of cells and across or- 
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FIGURE 9.35 (a) The domain structure of the SNARE 
protein families. A variety of N-terminal domains are 
found in Qa SNARE proteins, including the three-helix 
bundle of syntaxin-1 (pdb id = 1BRO); (b) Qu. SNAREs 
are anchored in the membrane by palmitic acid lipid 
anchors; (c) Many R SNAREs contain small globular 
N-terminal domains such as Vam7, a PX-homology 
domain (pdb id = 10CS). (Adapted from Jahn, R., and R.H. 
Scheller, 2006. SNAREs—engines for membrane fusion. Nature 
Reviews Molecular Cell Biology 7:631-643.) 
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FIGURE 9.36 SNARE complex assembly and its control. Step 1: Q SNARES, organized in clusters, assemble into 
acceptor complexes in the plasma membrane. Step 2: Acceptor complexes interact with R-SNAREs in an 
approaching vesicle through the N-terminal end of the SNARE motifs, forming a four-helical transcomplex. 
Step 3: The transcomplex tightens or “zips up,” but membrane fusion and pore formation is prevented by bind- 
ing of complexin. Step 4: Arrival of an action potential (nerve impulse) triggers displacement of complexin by 
synaptotagmin, initiating fusion and pore formation. Step 5: Upon completion of the fusion process, the 
transcomplex relaxes. Step 6: SNARES are redistributed to their respective membrane domains and vesicles are 
reformed. (Adapted from Jahn, R, and R. H. Scheller, 2006. SNAREs—engines for membrane fusion. Nature Reviews Molecular 
Cell Biology 7:631-643.) 


ganelle membranes. All cells maintain concentration gradients of various metabolites 
across their plasma membranes and also across the membranes of intracellular or- 
ganelles. By their very nature, cells maintain a very large amount of potential energy 
in the form of such concentration gradients. Sodium and potassium ion gradients 
across the plasma membrane mediate the transmission of nerve impulses and the nor- 
mal functions of the brain, heart, kidneys, and liver, among other organs. Storage and 
release of calcium from cellular compartments controls muscle contraction, as well as 
the response of many cells to hormonal signals. High acid concentrations in the stom- 
ach are required for the digestion of food. Extremely high hydrogen ion gradients are 
maintained across the plasma membranes of the mucosal cells lining the stomach in 
order to maintain high acid levels in the stomach. 

We shall consider the molecules and mechanisms that mediate these transport 
activities. In nearly every case, the molecule or ion transported is water soluble, yet 
moves across the hydrophobic, impermeable lipid membrane at a rate high enough 
to serve the metabolic and physiological needs of the cell. This perplexing problem 
is solved in each case by a specific transport protein. The transported species either 
diffuses through a channel-forming protein or is carried by a carrier protein. Trans- 
port proteins are all classed as integral membrane proteins. 

From a thermodynamic and kinetic perspective, there are only three types of 
membrane transport processes: passive diffusion, facilitated diffusion, and active trans- 
port. To be thoroughly appreciated, membrane transport phenomena must be con- 


sidered in terms of thermodynamics. Some of the important kinetic considerations 
also will be discussed. 


9:6) What Is Passive Diffusion? 


Passive diffusion is the simplest transport process. In passive diffusion, the trans- 
ported species moves across the membrane in the thermodynamically favored direc- 
tion without the help of any specific transport system/molecule. For an uncharged 
molecule, passive diffusion is an entropic process, in which movement of molecules 
across the membrane proceeds until the concentration of the substance on both 
sides of the membrane is the same. For an uncharged molecule, the free energy dif- 
ference between side 1 and side 2 of a membrane (Figure 9.37) is given by 


[Cə] 
[Ci] 


The difference in concentrations, [C2] — [C1], is termed the concentration gradient, 
and AG here is the chemical potential difference. 





AG= G— G = RT]n (9.1) 


Charged Species May Cross Membranes by Passive Diffusion 


For a charged species, the situation is slightly more complicated. In this case, the 
movement of a molecule across a membrane depends on its electrochemical po- 
tential. This is given by 





[C2] 
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(9.2) 
where Z is the charge on the transported species, F is Faraday’s constant (the 
charge on 1 mole of electrons = 96,485 coulombs/mol = 96,485 joules/volt-mol, 
since 1 volt = 1 joule/coulomb), and Ay is the electric potential difference (that is, 
voltage difference) across the membrane. The second term in the expression thus 
accounts for the movement of a charge across a potential difference. Note that the 
effect of this second term on AG depends on the magnitude and the sign of both Z 
and Aw. 

For example, as shown in Figure 9.38, if side 2 has a higher potential than side 1 
(so that Ay is positive), for a negatively charged ion the term Z¥AW makes a nega- 
tive contribution to AG. In other words, the negative charge is spontaneously at- 
tracted to the more positive potential—and AG is negative. In any case, if the sum 
of the two terms on the right side of Equation 9.2 is a negative number, transport 
of the ion in question from side 1 to side 2 would occur spontaneously. The driving 
force for passive transport is the AG term for the transported species itself. 


9:7) How Does Facilitated Diffusion Occur? 


The transport of many substances across simple lipid bilayer membranes via passive 
diffusion is far too slow to sustain life processes. On the other hand, the transport 
rates for many ions and small molecules across actual biological membranes are 
much higher than anticipated from passive diffusion alone. This difference is due 
to specific proteins in the cell membranes that facilitate transport of these species 
across the membrane. Proteins capable of effecting facilitated diffusion of a variety 
of solutes are present in essentially all natural membranes. Such proteins have two 
features in common: (1) They facilitate net movement of solutes only in the ther- 
modynamically favored direction (that is, AG < 0), and (2) they display a measur- 
able affinity and specificity for the transported solute. Consequently, facilitated dif- 
fusion rates display saturation behavior similar to that observed with substrate 
binding by enzymes (see Chapter 13). Such behavior provides a simple means for 
distinguishing between passive diffusion and facilitated diffusion experimentally. 
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CENGAGENOW” ACTIVE FIGURE 9.37 Passive diffusion 
of an uncharged species across a membrane depends 
only on the concentrations (C, and G) on the two sides 
of the membrane. Test yourself on the concepts in 
this figure at www.cengage.com/login. 
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CENGAGENOW” ACTIVE FIGURE 9.38 The passive diffu- 
sion of a charged species across a membrane depends 
on the concentration and also on the charge of the par- 
ticle, Z, and the electrical potential difference across the 
membrane, Aw. Test yourself on the concepts in this 
figure at www.cengage.com/login. 
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FIGURE 9.39 Passive diffusion and facilitated diffusion 
may be distinguished graphically. The plots for facili- 
tated diffusion are similar to plots of enzyme-catalyzed 
processes (see Chapter 13), and they display saturation 
behavior. 


The dependence of transport rate on solute concentration takes the form of a rec- 
tangular hyperbola (Figure 9.39), so the transport rate approaches a limiting value, 
Vinax» at very high solute concentration. Figure 9.39 also shows the graphical behav- 
ior exhibited by simple passive diffusion. Because passive diffusion does not involve 
formation of a specific solute:protein complex, the plot of rate versus concentration 
is linear, not hyperbolic. 


Membrane Channel Proteins Facilitate Diffusion 


The structures of hundreds of membrane proteins have been determined by 
X-ray diffraction and NMR spectroscopy. Many of these proteins are membrane 
transport channels that carry out facilitated diffusion (Figure 9.40). In contrast to 
active transport systems (or “pumps”) like Na*, K*-ATPase and Ca?*-ATPase, chan- 
nels simply enable the (energetically passive) downhill movement of ions and other 
molecules. However, active pumps and most channels share one fundamental prop- 
erty: an ability to transport species in a selective manner. Molecular selectivity is cru- 
cial to the operation of both pumps and channels. 

The membrane channel structures determined to date have revealed some of 
nature’s strategies for moving ions and molecules across biological membranes. 
Channel composition can take several forms. A single channel pore can be 
formed from dimers, trimers, tetramers, or pentamers of protein subunits (for ex- 
ample, channels for Nat, K*, Mg?*, and glutamate; see Table 9.3). On the other 
hand, multimeric assemblies in which each subunit has its own pore are known 
(in channels for Cl”, NHs, water, and glycerol). Figure 9.40 presents several of the 
known channel structures, including channels for K+, C17, NH3, HO, glycerol, 
glutamate, and proteins themselves. Several recurring themes are apparent from 
these structures: 


e Each of these channels possesses a selectivity filter—a group of amino acid 
residues that selects for and binds the transported species as a prelude to transport. 

e In several of these channels (for example, the K* and glutamate channels, as well 
as the Na* channel from B. cereus), the protein creates an aqueous cavity or 
vestibule (sometimes reaching more than halfway across the bilayer) so that the 
transported species can reach the selectivity filter deep in the membrane by sim- 
ple diffusion. 

e Other channels do not possess large aqueous vestibules. The chloride, water, 
glycerol, and ammonia channels employ “funnels” on either side of the mem- 
brane. These funnels lead to narrow constrictions—the selectivity filters—at the 
middle of the bilayer. When viewed parallel to the membrane, the two funnels 
are seen to be related by a pseudo-twofold axis of symmetry. 

e Selectivity filters often consist of a channel that binds multiple transported 
species. Thus, the chloride channel binds two Cl” ions, the ammonia channel 
binds three ammonia molecules, and the potassium channel binds four K* ions. 

e Most membrane channels are “gated”—that is, in response to a triggering signal, 
they undergo a conformational change that opens the channel. Gating may be 
signaled by binding of an ion, a small organic molecule, or even a protein. Some 
channels are voltage-gated and open and close in response to a change of mem- 
brane electrical potential (that is, voltage). The conformation change that gates 
a channel can be a substantial rearrangement of the protein structure or merely 
a movement of a single residue. 


These recurring themes are illustrated particularly well by the K* channels charac- 
terized by Roderick MacKinnon and his colleagues. 


Potassium Channels Combine High Selectivity with High Conduction Rates 
Potassium transport (that is, conduction) is essential for many cell processes, in- 
cluding regulation of cell volume, electrical impulse formation (in electrically ex- 
citable cells, such as neurons), and secretion of hormones; all cells thus conduct K* 
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FIGURE 9.40 The structures of channel proteins that transport (a) glycerol (pdb id = 1FX8), (b) glutamate (pdb id 
= 1XFH), (c) ammonia (pdb id = 1U7G), (d) chloride (pdb id = 10TS), (e) potassium (pdb id = 2A79), (f) water 
(pdb id = 1J4N), and (g) proteins (pdb id = 1RHZ).In all cases, the view is in the plane of the membrane. Note the 
transmembrane a-helices in the membrane-spanning part of each structure. 
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FIGURE 9.42 Model for outward and inward transport 
through the KcsA potassium channel. The selectivity fil- 
ter in the channel contains four K*-binding sites, only 
two of which are filled at any time. 


(c) 





FIGURE 9.41 Structure of the KcsA potassium channel from Streptococcus lividans (pdb id = 1K4C). (a) The four 
identical subunits of the channel, which surround a central pore, are shown in different colors. (b) Each subunit 
contributes three a-helices (blue, green, red) to the tetramer structure. (c) The selectivity filter is made from 
loops from each of the subunits, two of which are shown here. (d) The tetrameric channel, as viewed through 
the pore. 


ions across the cell membrane. Potassium channels are facilitated diffusion devices, 
conducting K* down the electrochemical gradient for K*. Whether found in bacte- 
ria, archaea, plants, or animals, all known potassium channels are members of a sin- 
gle protein family. Potassium channels have two important characteristics: They are 
highly selective for K* ions over Na* ions, and they conduct K* ions at very high 
rates (almost as fast as any entity can diffuse in water—the so-called diffusion limit). 

All K* channels have two essential structural features: (1) a selectivity filter, a 
structural element that allows K* to pass through the channel but prevents passage 
of Na‘, and (2) a gate, a structure that opens and closes the channel. Some K* 
channels are ligand-gated, such that an ion, a small organic molecule, or even an- 
other protein can open the gate by binding to the channel. Other K* channels are 
voltage-gated, in which a portion of the channel protein is able to move (and open 
or close the channel) in response to a change in voltage across the membrane. The 
selectivity filter is conserved and nearly identical across all organisms, whereas gat- 
ing mechanisms are diverse and varied. 

The structure of KesA, the K* channel from Streptococcus lividans, is typical 
(Figure 9.41). The structure consists of four identical subunits, and, facing the cy- 
tosol, it has a water-filled pore that traverses more than half of the membrane bi- 
layer, ending at the selectivity filter. A hydrated K* ion is suspended in the center of 
the pore. Each subunit contributes three a-helices to the pore structure: two trans- 
membrane helices (M1 and M2, the outer and inner helices, respectively) and one 
helix that extends only halfway across the membrane, with its C-terminal end (with 
a partial negative charge) facing the center of the pore. 

The selectivity filter in KcsA consists of four pentapeptides, one from each sub- 
unit, with the sequence TVGYG. The backbone carbonyls of the first four residues 
and the threonine side-chain oxygen—evenly spaced—face the center of the pore. 
These oxygens create four possible K*-binding sites. In each site, a bound, dehy- 
drated K* is surrounded by eight oxygens from the protein: four above and four be- 
low. The arrangement of protein oxygens at each site is very similar to the arrange- 
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ment of water molecules around a hydrated K*. This simple structure is strikingly 
selective for K*. The physical basis for selection between K* and Na* is the atomic 
radius—1.33 A for K* and 0.95 A for Na’. Still, Kt channels select for Kt over Nat 
by a factor of more than a thousand! 

As K* moves through the KcsA channel, there are, on average, two K* ions bound 
in the selectivity filter at any given time, either in positions 1 and 3 or positions 2 and 
4 (with water molecules occupying the other positions). Ions can move in either di- 
rection across the channel, depending on the existing electrochemical gradient. 
One K* enters the channel from one side as a different ion exits on the other side. 
The cycle of steps for inward or outward movement is shown in Figure 9.42. 

High selectivity, along with high conduction rates, seems at first paradoxical. If 
K* ions bind too tightly in the filter, they could not move quickly through the pore. 
Two factors keep the binding just tight enough, but not too tight: (1) repulsion be- 
tween the closely spaced K* ions at their two sites and (2) a conformational change 
induced by K* binding. At low K* concentrations, the filter conformation is very dif- 
ferent and only one K* can bind at a time. When K* concentration increases, some 
ion-binding energy is used to induce the conformation change that creates a more 
symmetric pore, weakening K* binding. Weaker binding makes higher conduction 
rates possible. 

The KcsA channel is gated by intracellular pH. It is closed at neutral pH and 
above, and it opens at acidic pH. What is the conformational change that opens this 
and other K* channels? After comparing the closed pore conformation of KcsA 
with the opened pore conformation of the related MthK channel (Figure 9.43), 
MacKinnon has proposed that helix bending and rearrangement deep in the mem- 
brane opens K* channels. The inner helices obstruct the central pore in the closed 
conformation. However, bending at a glycine residue near the center of the mem- 
brane splays the inner helices outward from the channel center, allowing free access 
for ions between the cytosol and the selectivity filter. This critical Gly residue is con- 
served in most K* channel sequences, making this a likely gating mechanism for 
most K* channels. 


The B. cereus NaK Channel Uses a Variation on the K* Selectivity Filter 


Could the K* channel selectivity filter be modified to accommodate other ions, for 
instance Na*? Comparison of amino acid sequences from a variety of ion channels 
(Figure 9.44) shows that this is indeed the case. Variations on the TVGYG filter se- 
quence are found in ion channels with a range of selectivities for K*, Na*, and even 
Ca?*. Bacillus cereus contains an ion channel with equal preference for Nat and K* 
that is similar to the transient receptor potential (TRP) channels found widely in eu- 
karyotes. The structure of this channel (Figure 9.45) is similar in many ways to the 
K* channels, but the selectivity filter sequence of this NaK channel is TVGDG. The 
substitution of D for Y changes the filter in several ways. Binding sites 1 and 2, the 
sites most selective for K*, are eliminated, leaving a “pore vestibule” that can 
accommodate an ion but not bind it tightly. The remaining sites, binding sites 3 and 
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FIGURE 9.43 Comparison of the closed (pdb id = 1K4C) 
and open (pdb id = 1LNQ) states of the potassium 
channel. 


Sequence of selectivity filter Selective for 


WV GWG ID IL We Kt 
T G GNFSP Nat, Kt 
LTG DWNSV Cadi 


FIGURE 9.44 lon selectivity in cation channels is a func- 
tion of peptide sequence of the selectivity filter. 
Conserved glycines in the TVGYG motif of the KcsA 
potassium channel are shown in red. Amino acids that 
are chemically similar are yellow. From top to bottom, 
the selectivity for KT over Na* decreases and the selec- 
tivity for Ca?* increases. (Adapted from Zagotta, W. N., 2006. 
Permutations of permeability. Nature 440:427-428.) 
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FIGURE 9.46 The structure of the pentameric CorA Mg?* 
channel from Thermus maritima (pdb id = 2HN2). The 
transmembrane pore is formed from five short a-helices 
(red) and stabilized by four longer a-helices (aqua). The 
pore entrance from the periplasm is gated closed by a 
ring of 5 Asn residues (gold), and a ring of 5 Leu (orange) 
and 5 Met (beige) residues narrows the pore to 2.5 A. 
The large cytosolic domain includes a basic sphincter of 
20 Lys residues (blue), and a ring of 50 Asp and Glu 
residues (green) at the tips of the willow helices. Asp*? 
and Asp?” (purple) participate in Mg?* binding. 


(a) (b) (c) 


Extracellular 





Intracellular ° 


FIGURE 9.45 Structure of the channel from Bacillus cereus (pdb id = 2AHY), which has equal preference for Na* 
and K+. (a) One subunit of the tetramer has been removed to reveal the five ion binding sites in the center of 
the channel. A Ca* ion is bound at the extracellular entrance to the channel (aqua, top), and Kt is bound to 
the other four sites. (b) The tetrameric channel, as viewed through the pore. (c) Substitution of D for Y in the 
selectivity filter eliminates binding sites 1 and 2, leaving a pore vestibule that binds a K* with low affinity. Sites 
3 and 4 are preserved, but bind Na* and K+ equally well. The bottom site contains a fully hydrated K+, in a 
manner similar to the KcsA channel (Figure 9.41). 


4, bind Nat and K* equally well. In addition, the D for Y substitution creates a Ca?*- 
binding site at the extracellular entrance to the selectivity filter (see Figure 9.45c). 
It appears likely that variations of the selectivity filter sequence can “tune” it to ac- 
commodate and select for a variety of transported cations (Figure 9.45). 


CorA Is a Pentameric Mg?* Channel 


The transport of Mg?* in bacteria and archaea is accomplished primarily by the 
CorA family of membrane channels. Its pentameric structure (Figure 9.46) contrasts 
with the tetrameric K* and NaK channels in several ways. With a large N-terminal 
cytosolic domain and C-terminal transmembrane domain, it resembles a funnel or 
cone. One of the two transmembrane a-helices extends 100 A into the cytosol and is 
the longest continuous a-helix in any known protein. Remarkable features of the 
structure include a ring of 20 lysine residues around the outside of the structure near 
the membrane-cytosol interface and a cluster of 50 aspartate and glutamate residues 
(on so-called willow helices) adjacent to the lysines and extending into the cytosol. 
The available structures of CorA are closed-pore structures. A ring of five Asn”! 
residues blocks the opening to the pore on the periplasmic face, a ring of Met”! 
residues narrows the pore to 3.3 A at the center of the membrane, and a ring of 
Leu?” residues reduce the pore diameter to 2.5 A at the cytosolic face of the mem- 
brane. Gating of the pore must overcome these obstacles, as well as the repulsive ring 
of positive Lys side chains. It is tempting to imagine that the long a-helix and the 
negatively charged willow helices could act as a lever to pry apart the repulsive ring 
of lysines and open the Mg™ pore. 


Chloride, Water, Glycerol, and Ammonia Flow Through 
Single-Subunit Pores 


Membrane channels can also be formed within a single subunit of a protein. The 
CIC channels (ubiquitous in cells from bacteria to animals) are homodimeric, 
each subunit having two similar halves, with opposite orientation in the mem- 
brane. The CIC pore is hourglass-shaped, with a 15-A-long selectivity filter in the 
middle (Figure 9.47a). The filter contains 3 Cl--binding sites, with coordination 
from Tyr and Ser hydroxyls and several peptide backbone NH groups. The C1- 
binding site nearest the extracellular solution can be occupied either by a Cl” ion 
or by a glutamate carboxyl group. With the glutamate carboxyl in place, the pore 
is closed, but an increase in Cl” concentration can displace the Glu side chain, 
with Cl~ binding to this position and opening the pore. Thus, this Cl~ channel is 
chloride-gated. 


9.8 How Does Energy Input Drive Active Transport Processes? 277 


(b) (c) 





Channel proteins often solve chemical and thermodynamic problems in innova- 
tive ways. Ion selectivity, for example, requires that ions be dehydrated in the chan- 
nel, and dehydration is energetically expensive. Binding sites have to compensate 
for the energetic cost of dehydration by providing favorable compensatory interac- 
tions between the ion and the binding amino acid residues. The ammonia transport 
channel solves a different problem. Ammonia is a gas, but the protonated ammo- 
nium ion, NH,4", is the species that diffuses to the channel opening. The transport 
channel in this case is a hydrophobic pore 20 A in length (Figure 9.47b). The 
hydrophobic character of the channel lowers the pK, of ammonium from its nor- 
mal 9.25 to less than 6, facilitating the transport of NH; but not the monovalent 
cation, NH4*. In the narrow hydrophobic channel, His'® and His*!* line the pore 
and stabilize three NH; molecules through hydrogen bonding. On either side of 
the narrow pore, broad vestibules contain NH; in equilibrium with NH4”. 

Another example of adaptation to the transported species is the tetrameric glyc- 
erol channel GIpF from £. coli, with a transport pore in each monomer. Six trans- 
membrane helices and two halfmembrane-spanning helices form a right-handed 
helical bundle around each channel (Figure 9.47c). Glycerol molecules taken up by 
an E. coli cell first enter a 15-A-wide vestibule in the transport protein, becoming 
progressively dehydrated before entry into a 28-A amphipathic channel and selec- 
tivity filter. The channel accommodates three glycerol molecules, lined up in a sin- 
gle file, with their alkyl backbones wedged against a hydrophobic corner and their 
hydroxyl groups forming hydrogen bonds with the side chains of channel residues. 

The aquaporin water channels are closely related to the GlpF glycerol channel, 
with tetrameric structures and similar right-handed helical bundles forming trans- 
port channels. Selection for water or glycerol in these proteins is based on subtle 
differences in the selectivity filters within the transport channels. For example, the 
Phe and Trp residues that comprise the hydrophobic corner surrounding the alkyl 
moiety of the middle glycerol site in GlpF are replaced by a His residue in the cor- 
responding water-binding site in aquaporin Apql. 
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Passive and facilitated diffusion systems are relatively simple, in the sense that the 
transported species flow downhill energetically, that is, from high concentration to 
low concentration. However, other transport processes in biological systems must be 
driven in an energetic sense. In these cases, the transported species move from low 
concentration to high concentration, and thus the transport requires energy input. 
As such, it is considered active transport. The most common energy input is ATP 
hydrolysis, with hydrolysis being tightly coupled to the transport event. Other energy 
sources also drive active transport processes, including light energy and the energy 
stored in ion gradients. The original ion gradient is said to arise from a primary active 
transport process, and the transport that depends on the ion gradient for its energy 
input is referred to as a secondary active transport process (see later discussion of 


FIGURE 9.47 Structures of channels for (a) chloride, 
(b) ammonia, and (c) glycerol. All structures are axial 
views. (a) The CIC chloride channel from E. coli (pdb id 
= 10TS). (b) The AmtB ammonia channel from E. coli 
with four bound NH,* (pdb id = 1U7G). (c) The GlpF 
glycerol channel from E. coli, with bound glycerol (pdb 
id = 1FX8). 
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the E. coli proton—drug exchanger). When transport results in a net movement of 
electric charge across the membrane, it is referred to as electrogenic transport. 
If no net movement of charge occurs during transport, the process is electrically 
neutral. 


All Active Transport Systems Are Energy-Coupling Devices 


Hydrolysis of ATP is essentially a chemical process, whereas movement of species 
across a membrane is a mechanical process (that is, movement). An active transport 
process that depends on ATP hydrolysis thus couples chemical free energy to me- 
chanical (translational) free energy. The bacteriorhodopsin protein in Halobacterium 
halobium couples light energy and mechanical energy. Oxidative phosphorylation 
(see Chapter 20) involves coupling between electron transport, proton transloca- 
tion, and the capture of chemical energy in the form of ATP synthesis. Similarly, the 
overall process of photosynthesis (see Chapter 21) amounts to a coupling between 
captured light energy, proton translocation, and chemical energy stored in ATP. 


Many Active Transport Processes are Driven by ATP 


Monovalent Cation Transport: Na*,Kt-ATPase All animal cells actively extrude 
Na‘ ions and accumulate K* ions. These two transport processes are driven by 
Na‘*,K*-ATPase, also known as the sodium pump, an integral protein of the plasma 
membrane. Most animal cells maintain cytosolic concentrations of Na* and K* of 
10 mM and 100 mM, respectively. The extracellular milieu typically contains about 
100 to 140 mM Na* and 5 to 10 mM K+*. Potassium is required within the cell to ac- 
tivate a variety of processes, whereas high intracellular sodium concentrations are 
inhibitory. The transmembrane gradients of Nat and K* and the attendant gradi- 
ents of Cl” and other ions provide the means by which neurons communicate. They 
also serve to regulate cellular volume and shape. Animal cells also depend upon 
these Na* and K* gradients to drive transport processes involving amino acids, sug- 
ars, nucleotides, and other substances. In fact, maintenance of these Na* and K+ 
gradients consumes large amounts of energy in animal cells—20% to 40% of total 
metabolic energy in many cases and up to 70% in neural tissue. 

The Na*- and K*-dependent ATPase comprises three subunits: an a-subunit of 
1016 residues (120 kD), a 35-kD B-subunit, and a 6.5-kD y-subunit. The sodium 
pump actively pumps three Na* ions out of the cell and two K* ions into the cell 
per ATP hydrolyzed: 


ATP*- + HO + 3 Nat (inside) + 2 K* (outside) —> ADP- + HPO, + 
3 Na* (outside) + 2 K*(inside) (9.3) 


ATP hydrolysis occurs on the cytoplasmic side of the membrane (Figure 9.48), and 
the net movement of one positive charge outward per cycle makes the sodium 
pump electrogenic in nature. 

The a-subunit of Nat,K*-ATPase consists of ten transmembrane a-helices, with 
three cytoplasmic domains, denoted A, P, and N. A large cytoplasmic loop between 
transmembrane helices 4 and 5 forms the P (phosphorylation) and N (nucleotide- 
binding) domains. The enzyme is covalently phosphorylated at Asp*® during ATP 
hydrolysis. The crystal structure of the enzyme reveals two rubidium ions bound to 
putative K*-binding sites in the center of the protein (Figure 9.48). 

A minimal mechanism for Na*,K*-ATPase postulates that the enzyme cycles be- 
tween two principal conformations, denoted E; and Es (Figure 9.49). E; has a 
high affinity for Nat and ATP and is rapidly phosphorylated in the presence of 
Me?* to form E-P, a state that contains three occluded Na* ions (occluded in the sense 
that they are tightly bound and not easily dissociated from the enzyme in this con- 
formation). A conformation change yields E»-P, a form of the enzyme with relatively 
low affinity for Na* but a high affinity for K*. This state presumably releases 3 Na* 
ions and binds 2 K* ions on the outside of the cell. Dephosphorylation leaves EKo, 
a form of the enzyme with two occluded K* ions. A conformation change, which ap- 
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CENGAGENOW” ANIMATED FIGURE 9.48 (a) A schematic diagram of the Na*,Kt-ATPase of the mammalian 
plasma membrane. ATP hydrolysis occurs on the cytoplasmic side of the membrane, Na* ions are transported 
out of the cell, and K* ions are transported in. The transport stoichiometry is 3 Na* out and 2 K* in per ATP 
hydrolyzed. Ouabain and other cardiac glycosides inhibit Na*,K*-ATPase by binding on the extracellular 
surface of the pump protein. (b) Structure of the Na*,K*-ATPase, showing the a-subunit, residues 28-73 of the 
B-subunit (gray) and the transmembrane helix (residues 23-51, yellow) of the y-subunit (pdb id = 3B8E). See 
this figure animated at www.cengage.com/login. 


pears to be accelerated by the binding of ATP (with a relatively low affinity), releases 
the bound K* inside the cell and returns the enzyme to the E; state. Enzyme forms 
with occluded cations represent states of the enzyme with cations bound in the 
transport channel. The alternation between high and low affinities for Na*, K*, and 
ATP serves to tightly couple the hydrolysis of ATP and ion binding and transport. 


Na*,K*t-ATPase Is Inhibited by Cardiotonic Steroids Certain plant and animal 
steroids such as ouabain (Figure 9.50) specifically inhibit Na*,K*-ATPase and ion 
transport. These substances are traditionally referred to as cardiac glycosides or 
cardiotonic steroids, both names derived from the potent effects of these mole- 
cules on the heart. These molecules all possess a cis-configuration of the C-D ring 
junction, an unsaturated lactone ring (five- or six-membered) in the 6-configura- 
tion at C-17, and a B-OH at C-14. There may be one or more sugar residues at 
C-3. The sugars are not required for inhibition, but do contribute to water solu- 
bility of the molecule. Cardiotonic steroids bind exclusively to the extracellular 
surface of Na*,K*-ATPase when it is in the EsP state, forming a very stable 
E,-P(cardiotonic steroid) complex. 


2 K* 3 Na* ADP 


A ` \ A CENGAGENOW” ANIMATED FIGURE 9.49 A mechanism 
ED ATP K, < E) ATP = ® ATP Nas, -r Nag for Na*,K*-ATPase. The model assumes two principal 


conformations, E; and Ez. Binding of Na* ions to E; is fol- 
Nat lowed by phosphorylation and release of ADP. Na* ions 
ATP are transported and released, and K* ions are bound 








before dephosphorylation of the enzyme. Transport and 
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® HBO 2 Na* animated at www.cengage.com/login. 
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FIGURE 9.50 The structures of several cardiotonic 


steroids. The lactone rings are yellow. 


Digitoxigenin 





Medical researchers studying high blood pressure have consistently found that 
people with hypertension have high blood levels of an endogenous Na*,K*-ATPase 
inhibitor. In such patients, inhibition of the sodium pump in the cells lining the 
blood vessel wall results in accumulation of sodium and calcium in these cells and 
the narrowing of the vessels to create hypertension. An 8-year study aimed at the iso- 
lation and identification of the agent responsible for these effects by researchers at 
the University of Maryland Medical School and the Upjohn Laboratories in Michi- 
gan yielded a surprising result. Mass spectrometric analysis of compounds isolated 
from many hundreds of gallons of blood plasma has revealed that the hypertensive 
agent is ouabain itself or a closely related molecule! 


Calcium Transport: Ca2*-ATPase Calcium, an ion acting as a cellular signal in vir- 
tually all cells (see Chapter 32), plays a special role in muscles. It is the signal that 
stimulates muscles to contract (see Chapter 16). In the resting state, the levels of 
Ca?* near the muscle fibers are very low (approximately 0.1 uM), and nearly all of 
the calcium ion in muscles is sequestered inside a complex network of vesicles 
called the sarcoplasmic reticulum, or SR (see Figure 16.1). Nerve impulses induce 
the SR membrane to quickly release large amounts of Ca”*, with cytosolic levels ris- 
ing to approximately 10 uM. At these levels, Ca?* stimulates contraction. Relaxation 
of the muscle requires that cytosolic Ca?* levels be reduced to their resting values. 
This is accomplished by an ATP-driven Ca** transport protein known as the Ca?*- 
ATPase, which bears many similarities to the Na*,K*-ATPase. It has an a-subunit of 
the same approximate size, it forms a covalent E-P intermediate during ATP hy- 
drolysis, and its mechanism of ATP hydrolysis and ion transport is similar in many 
ways to that of the sodium pump. 

The structure of the Ca?*-ATPase includes a transmembrane (M) domain con- 
sisting of ten a-helical segments and a large cytoplasmic domain that itself consists 
of a nucleotide-binding (N) domain, a phosphorylation (P) domain, and an actua- 
tor (A) domain (Figure 9.51). 

The calcium transport cycle begins with binding of two Ca?* ions. Subsequent 
ATP binding causes a 90° rotation of N and a 30° rotation of A, thus joining all three 
cytoplasmic domains (N, A, and P), and pulling a transmembrane helix partly out of 
the membrane. Phosphorylation of Asp**!, dissociation of ADP, and conversion of 
the E-P state to ExP induce a 110° rotation of A and a rearrangement of the trans- 
membrane domain, which acts like a piston to release Ca?* inside the SR. A TGES 
sequence in A (residues 181 to 184) then guides nucleophilic attack of water on 
EP, releasing phosphate and restoring the original structures of both the trans- 
membrane and the cytoplasmic domains of the enzyme. 


The Gastric H*,K*-ATPase Production of protons is a fundamental activity of cel- 
lular metabolism, and proton production plays a special role in the stomach. The 
highly acidic environment of the stomach is essential for the digestion of food in all 
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CENGAGENOW” ACTIVE FIGURE 9.51 The transport 
cycle of the sarcoplasmic reticulum Ca?*-ATPase 
involves at least five different conformations of the pro- 
tein. The states shown here are E;-2Ca?* (pdb id = 
3B9B); E;-ATP (pdb id = 1SU4); E,-P-ADP (pdb id = 
115C); E2+P; (pdb id = 2ZBD); and E; (pdb id = 2EAR). 
Blue-shaded states in the reaction sequence correspond 


animals. The pH of the stomach fluid is normally 0.8 to 1. The pH of the parietal 
cells of the gastric mucosa in mammals is approximately 7.4. This represents a pH 
gradient across the mucosal cell membrane of 6.6, the largest known transmembrane to adjacent structures. Test yourself on the concepts 
gradient in eukaryotic cells. This enormous gradient must be maintained constantly in this figure at www.cengage.com/login. 

so that food can be digested in the stomach without damage to the cells and organs 

adjacent to the stomach. The gradient of H* is maintained by an H*,Kt-ATPase, 
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A DEEPER LOOK 


Cardiac Glycosides: Potent Drugs from Ancient Times 


The cardiac glycosides have a long and colorful history. Many 
species of plants producing these agents grow in tropical regions 
and have been used by natives in South America and Africa to pre- 
pare poisoned arrows used in fighting and hunting. Zulus in South 
Africa, for example, have used spears tipped with cardiac glycoside 
poisons. The sea onion, found commonly in southern Europe and 
northern Africa, was used by the Romans and the Egyptians as a 
cardiac stimulant, diuretic, and expectorant. The Chinese have 
long used a medicine made from the skins of certain toads for sim- 
ilar purposes. Cardiac glycosides are also found in several species 
of domestic plants, including the foxglove, lily of the valley, olean- 
der (figure part a), and milkweed plants. Monarch butterflies 
(figure part b) acquire these compounds by feeding on milkweed 
and then storing the cardiac glycosides in their exoskeletons. Car- 
diac glycosides deter predation of monarch butterflies by birds, 
which learn by experience not to feed on monarchs. Viceroy but- 
terflies (figure part c) mimic monarchs in overall appearance. Al- 


though viceroys contain no cardiac glycosides and are edible, they 
are avoided by birds that mistake them for monarchs. 

In 1785, the physician and botanist William Withering de- 
scribed the medicinal uses for agents derived from the foxglove 
plant. In modern times, digitalis (a preparation of dried leaves 
prepared from the foxglove, Digitalis purpurea) and other purified 
cardiotonic steroids have been used to increase the contractile 
force of heart muscle, to slow the rate of beating, and to restore 
normal function in hearts undergoing fibrillation (a condition in 
which heart valves do not open and close rhythmically but rather 
remain partially open, fluttering in an irregular and ineffective 
way). Inhibition of the cardiac sodium pump increases the intra- 
cellular Na* concentration, leading to stimulation of the Na*-Ca?* 
exchanger, which extrudes sodium in exchange for inward move- 
ment of calcium. Increased intracellular Ca®* stimulates muscle 
contraction. Careful use of digitalis drugs has substantial thera- 
peutic benefit for patients with heart problems. 
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(b) Monarch butterfly 


(a) Cardiac glycoside inhibitors of Na*,K*-ATPase are produced by many plants, including foxglove, lily of 
the valley, milkweed, and oleander (shown here). (b) The monarch butterfly, which concentrates cardiac 
glycosides in its exoskeleton, is shunned by predatory birds. (c) Predators also avoid the viceroy, even though 
it contains no cardiac glycosides, because it is similar in appearance to the monarch. 
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(c) Viceroy butterfly 








Gastric 
mucosal 
cell 





D 


` 
Stomach \ 

` 
out 





CENGAGENOW” ACTIVE FIGURE 9.52 The H+,K+-ATPase 
of gastric mucosal cells mediates proton transport into 
the stomach. Potassium ions are recycled by means of 
an associated Kt/CI~ cotransport system. The action of 
these two pumps results in net transport of H* and Cl~ 
into the stomach. Test yourself on the concepts in this 
figure at www.cengage.com/login. 


which uses the energy of hydrolysis of ATP to pump H* out of the mucosal cells and 
into the stomach interior in exchange for K* ions. This transport is electrically neu- 
tral, and the K* that is transported into the mucosal cell is subsequently pumped 
back out of the cell together with Cl” in a second electroneutral process (Figure 
9.52). Thus, the net transport effected by these two systems is the movement of HCl 
into the interior of the stomach. (Only a small amount of K* is needed, because it is 
recycled.) The H*,K*-ATPase bears many similarities to the plasma membrane 
Nat ,K*t-ATPase and the SR Ca?*-ATPase described earlier. It has a similar molecular 
weight, it forms an E-P intermediate, and many parts of its peptide sequence are ho- 
mologous with the Na*,K*-ATPase and Ca*t-ATPase. 


Bone Remodeling by Osteoclast Proton Pumps Other proton-translocating 
ATPases exist in eukaryotic and prokaryotic systems. Vacuolar ATPases (V-type 
ATPases) are found in vacuoles, lysosomes, endosomes, Golgi, chromaffin granules, 
and coated vesicles. Various H*-transporting ATPases occur in yeast and bacteria as 
well. H*-transporting ATPases found in osteoclasts (multinucleate cells that break 
down bone during normal bone remodeling) provide a source of circulating cal- 
cium for soft tissues such as nerves and muscles. About 5% of bone mass in the hu- 
man body undergoes remodeling at any given time. Once growth is complete, the 
body balances formation of new bone tissue by cells called osteoblasts with resorp- 
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tion of existing bone matrix by osteoclasts. Osteoclasts possess proton pumps— 
which are in fact V-type ATPases—on the portion of the plasma membrane that at- 
taches to the bone. This region of the osteoclast membrane is called the ruffled 
border. The osteoclast attaches to the bone in the manner of a cup turned upside 
down on a saucer (Figure 9.53), leaving an extracellular space between the bone 
surface and the cell. The H*-ATPases in the ruffled border pump protons into this 
space, creating an acidic solution that dissolves the bone mineral matrix. Bone min- 
eral consists mainly of poorly crystalline hydroxyapatite [Cao (PO4)6(OH)ə] with 
some carbonate (HCO;_) replacing OH™ or PO,’ in the crystal lattice. Transport 
of protons out of the osteoclasts lowers the pH of the extracellular space near the 
bone to about 4, dissolving the hydroxyapatite. 


ABC Transporters Use ATP to Drive Import and Export Functions 
and Provide Multidrug Resistance 


The word cell is from the Latin cella, meaning a “small room.” Cells, just like hu- 
mans, must keep their rooms neat and tidy, and they do this with special membrane 
transporters known as multidrug resistance (MDR) efflux pumps, often referred to 
as “molecular vacuum cleaners.” MDR pumps export cellular waste molecules and 
toxins, as well as drugs that find their way into cells in various ways. Bacteria also 
have influx pumps, which bring essential nutrients (for example vitamin By.) into 
the cell (Figure 9.54). At least five families of influx and efflux pumps are known, 
among them the ABC transporters. In eukaryotes, ABC transporters are problem- 
atic because they export potentially therapeutic drugs (Figure 9.55) from cancer 
cells, so chemotherapy regimens must be changed often to avoid rejection of the 
beneficial drugs. 

All ABC transporters consist of two transmembrane domains (TMDs), which 
form the transport pore, and two cytosolic nucleotide-binding domains (NBDs) that 
bind and hydrolyze ATP. The TMDs and NBDs are separate subunits (thus com- 
posing a tetramer) in bacterial ABC importers (Figure 9.56). Bacterial exporters, 
on the other hand, are homodimers, with each monomer made up of an N-termi- 
nal TMD and a C-terminal NBD. Eukaryotic ABC exporters are monomeric, with all 
four necessary domains in a single polypeptide chain. 

The NBDs of ABC transporters from nearly all sources are similar in size, se- 
quence, and structure. The TMDs, on the other hand, vary considerably in se- 
quence, architecture, and number of transmembrane helices. ABC exporters con- 
tain a conserved core of 12 transmembrane helices, whereas ABC importers can 
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CENGAGENOW”™ ANIMATED FIGURE 9.53 Proton pumps 
cluster on the ruffled border of osteoclast cells and 
function to pump protons into the space between the 
cell membrane and the bone surface. High proton con- 
centration in this space dissolves the mineral matrix of 
the bone. See this figure animated at www.cengage 
.com/login. 


FIGURE 9.54 Influx pumps in the inner membrane of 
Gram-negative bacteria bring nutrients into the cell, 
whereas efflux pumps export cellular waste products 
and toxins. (Adapted from Garmory, H. S. and Titball, R.W. 2004. 
ATP-binding cassette transporters are targets for the develop- 
ment of antibacterial vaccines and therapies. Infection and 
Immunity 72:6757-6763.) 
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FIGURE 9.55 Some of the cytotoxic cancer drugs that 
are transported by the MDR ATPase. 
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FIGURE 9.56 Several ABC transporters are shown in dif- 
ferent stages of their transport cycles. Left to right: 

pdb id = 1L7V, pdb id = 2019, pdb id = 2NQ2. MBP is a 
multidrug binding protein, which binds molecules to be 
transported and delivers them to the transport channel. 
It is shown bound to the transport channel in the mid- ATP 
dle structures. 
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have between 10 and 20 transmembrane helices. A variety of studies show that 
human MDR ATPases are similar to the Sav1866, an exporting ABC transporter 
from S. aureus, and Sav1866 is considered to be a good model for the architecture 
of all ABC exporters. 

The structures of several ABC transporters, in different stages of the transport 
cycle, provide a picture of how ATP binding and hydrolysis by the NBDs might be 
coupled to import and export of molecules (Figure 9.56). The TMDs can cycle from 
inward-facing to outward-facing conformations and back again, whereas the NBDs 
alternate between open and closed states. In all ABC transporters, a short “coupling 
helix” lies at the interface between each NBD and its corresponding TMD. Binding 
of ATP induces “closing,” or joining of the NBD domains, bringing the coupling he- 
lices 10 to 15 A closer to each other than in the ATP-free state. The merger of the 
coupling helices in turn triggers a flip-flop of the TMDs from the inward-facing to 
the outward-facing conformation. In this state, ABC exporters release bound drugs 
to the extracellular environment, whereas ABC importers accept substrate mole- 
cules from their associated substrate-binding proteins. Following ATP hydrolysis, re- 
lease of ADP and inorganic phosphate allows the TMD to revert to its inward-facing 
conformation, where importers can release their substrates into the cytosol and ex- 
porters can bind new substrates to be exported. 


9:9» How Are Certain Transport Processes Driven 
by Light Energy? 


As noted previously, certain biological transport processes are driven by light 
energy rather than by ATP. Two well-characterized systems are bacteriorho- 
dopsin, the light-driven H*-pump, and halorhodopsin, the light-driven C17 
pump, of Halobacterium halobium, an archaeon that thrives in high-salt media. 
H. halobium grows optimally at an NaCl concentration of 4.3 M. It was extensively 
characterized by Walther Stoeckenius, who found it growing prolifically in the 
salt pools near San Francisco Bay, where salt is commercially extracted from sea- 
water. H. halobium carries out normal respiration if oxygen and metabolic energy 
sources are plentiful. However, when these substrates are lacking, H. halobium 
survives by using bacteriorhodopsin to capture light energy. In oxygen- and nu- 
trient-deficient conditions, purple patches appear on the surface of H. halobium. 
These purple patches of membrane are 75% protein, the only protein being 
bacteriorhodopsin (bR). The purple color arises from a retinal molecule that is 
covalently bound in a Schiff base linkage with an e-NHp» group of Lys?!® on each 
bacteriorhodopsin protein (Figure 9.57). Bacteriorhodopsin is a 26-kD trans- 
membrane protein that packs so densely in the membrane that it naturally forms 
a two-dimensional crystal in the plane of the membrane. The retinal moiety lies 
parallel to the membrane plane, about 1 nm below the membrane’s outer sur- 
face (Figure 9.13). 


Bacteriorhodopsin Uses Light Energy to Drive Proton Transport 


Light energy drives transport of protons (H*) through bacteriorhodopsin, provid- 
ing energy for the bacterium in the form of a transmembrane proton gradient. Pro- 
tons hop from site to site across bacteriorhodopsin, just as a person crossing a creek 
would jump from one stepping stone to another. The stepping stones in rhodopsin 
are the carboxyl groups of Asp% and Asp” and the Schiff base nitrogen of the reti- 
nal chromophore (Figure 9.58). The aspartates are able to serve as stepping stones 
because they lie in a hydrophobic environment that makes their side-chain pK, val- 
ues very high (more than 11). Light absorption converts retinal from all-trans to the 
13-cis configuration, triggering conformation changes that induce pK, changes and 
thus facilitate H* transfers (between Asp®, the Schiff base, and Asp”) and net H* 
transport across the membrane. 
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FIGURE 9.57 The Schiff base linkage between the retinal 
chromophore and Lys?!°. 
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FIGURE 9.58 The mechanism of proton transport by 
bacteriorhodopsin. Asp® and Asp% on the third trans- 
membrane segment (C) and the Schiff base of bound 
retinal serve as stepping stones for protons driven 
across the membrane by light-induced conformation 
changes. The hydrophobic environments of Asp® and 
Asp” raise the pKa values of their side-chain carboxyl 
groups, making it possible for these carboxyls to accept 
protons as they are transported across the membrane. 
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Outer membrane 





Inner membrane 


FIGURE 9.59 A tripartite (three-part) complex of pro- 
teins comprises the large structure in E.coli that exports 
waste and toxin molecules. The transport pump is AcrB, 
embedded in the bacterial inner membrane. The rest of 
the channel is composed of TolC, embedded in the bac- 
terial outer membrane, and a ring of AcrA subunits, 
which links AcrB and ToIC. (Adapted from Lomovskaya, O, 
Zgurskaya, H. l., et al., 2007. Waltzing transporters and ‘the dance 
macabre’ between humans and bacteria. Nature Reviews Drug 
Discovery 6:56-65.) 
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9:10: How Is Secondary Active Transport Driven 
by lon Gradients? 


Na* and H* Drive Secondary Active Transport 


The gradients of H*, Na*, and other cations and anions established by ATPases and 
other energy sources can be used for secondary active transport of various sub- 
strates. The best-understood systems use Nat or H* gradients to transport amino 
acids and sugars in certain cells. Many of these systems operate as symports, with the 
ion and the transported amino acid or sugar moving in the same direction (that is, 
into the cell). In antiport processes, the ion and the other transported species move 
in opposite directions. (For example, the anion transporter of erythrocytes is an an- 
tiport.) Proton symport proteins are used by E. coli and other bacteria to accumu- 
late lactose, arabinose, ribose, and a variety of amino acids. E. coli also possesses 
Na*-symport systems for melibiose, as well as for glutamate and other amino acids. 


AcrB Is a Secondary Active Transport System 


The ABC transporters described in Section 9.8 are just one of five different families 
of multidrug resistance transporters. AcrB, the major MDR transporter in E. coli, is 
responsible for pumping a variety of molecules including drugs such as erythro- 
mycin, tetracycline, and the -lactams (for example, penicillin). AcrB is part of a 
large tripartite complex that bridges the £E. coli inner and outer membranes and 
spans the entire periplasmic space (Figure 9.59). AcrB works with its partners, AcrA 
and TolC, to transport drugs and other toxins from the cytoplasm across the entire 
cell envelope and into the extracellular medium. 

AcrB is a secondary active transport system and an H*-drug antiporter. As pro- 
tons flow spontaneously inward through AcrB in the £. coli inner membrane, drug 
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FIGURE 9.60 In the AcrB trimer, the three identical subunits adopt three different conformations. The “loose” 

L state (blue), the “tight”T state (yellow), and the “open” O (orange) state are indicated. Possible transport paths 
of drugs through the tunnels are shown in green. Tunnel 1 is lined with hydrophobic residues and is the likely 
point of entrance for drugs in the membrane bilayer. Tunnel 2 may serve either as an entrance port for water- 
soluble drugs or as an exit channel for nonsubstrates. Tunnel 3 is the exit pathway. Tunnels 1 and 2 converge 
at the hydrophobic substrate binding pocket, where minocyclin (an antibiotic similar to tetracycline) is bound 
in a hydrophobic pocket defined by phenylalanines 136, 178,610, 615,617, and 628; valines 139 and 612; 
isoleucines 277 and 626; and tyrosine 327. (Inset—all shown in spacefill. Minocyclin is shown in stick and wire- 
frame.) Panels A and B represent one step in a L-T-O (or T-L-O) transport cycle. (Image kindly provided by Klass 
Martinus Pos.) 
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FIGURE 9.61 A model for drug transport by AcrB involves three possible conformations—loose (L, blue), tight 
(T, green), and open (O, pink)—for each of the three identical monomer subunits of the complex. The lateral 
grooves in L and T indicate low affinity and high affinity binding of drugs, respectively. The circle in the O state 
indicates that there is no drug binding in this state. Drugs to be transported (such as acridine, shown here) 
bind first to the L state. A conformational change to the T state moves the drug deeper into the tunnel, and a 
second conformation change opens the tunnel to the opposite side of the membrane, followed by release of 
the drug molecule. Binding, transport, and release of H* drives the drug transport cycle. (Adapted from Seeger, M., 
Schiefner, A., et al., 2006. Structural asymmetry of AcrB trimer suggests a peristaltic pump mechanism. Science 313:1295-1298.) 





molecules are driven outward. AcrB is a homotrimer of large, 1100-residue sub- 
units. Remarkably, the three identical subunits adopt slightly different conforma- 
tions, denoted loose (L), tight (T), and open (O). Transported drug molecules en- 
ter AcrB through a tunnel that starts in the periplasmic space, about 15 A above the 
inner membrane, and ends at the trimer center (Figure 9.60). The three confor- 
mations of the AcrB monomers are three consecutive states of a transport cycle. As 
each monomer cycles through the L, T, and O states, drug molecules enter the tun- 
nel, are bound, and then are exported (Figure 9.61). Poetically, this three-step ro- 
tation has been likened to a Viennese waltz, and AcrB has been dubbed a “waltzing 
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pump” by Olga Lomonskaya and her co-workers. 


SUMMARY 


Membranes constitute the boundaries of cells and intracellular organ- 
elles, and they provide an environment where many important biologi- 
cal reactions and processes occur. Membranes have proteins that medi- 
ate and regulate the transport of metabolites, macromolecules, and ions. 


9.1 What Are the Chemical and Physical Properties of Membranes? 
Amphipathic lipids spontaneously form a variety of structures when 
added to aqueous solution, including micelles and lipid bilayers. The 
fluid mosaic model for membrane structure suggests that membranes 
are dynamic structures composed of proteins and phospholipids. In this 
model, the phospholipid bilayer is a fluid matrix, in essence, a two- 
dimensional solvent for proteins. 


9.2 What Are the Structure and Chemistry of Membrane Proteins? 
Peripheral proteins interact with the membrane mainly through elec- 
trostatic and hydrogen-bonding interactions with integral proteins. In- 
tegral proteins are those that are strongly associated with the lipid bi- 
layer, with a portion of the protein embedded in, or extending all the 
way across, the lipid bilayer. Another class of proteins not anticipated by 
Singer and Nicolson, the lipid-anchored proteins, associate with mem- 
branes by means of a variety of covalently linked lipid anchors. 


9.3 How Are Biological Membranes Organized? Biological mem- 
branes are asymmetric structures, and the lipids and proteins of mem- 
branes exhibit both lateral and transverse asymmetries. The two mono- 
layers of the lipid bilayer have different lipid compositions and different 
complements of proteins. Loss of transverse lipid asymmetry has dra- 
matic (and often severe) consequences for cells and organisms. The 
membrane composition is also different from place to place across the 


plane of the membrane. Clustering of lipids and proteins in specific 
ways serves the functional needs of the cell. 


9.4 What Are the Dynamic Processes That Modulate Membrane Func- 
tion? Motions of lipids and proteins in membranes underlie many cell 
functions. Lipid bilayers typically undergo gel-to-liquid crystalline phase 
transitions, with the transition temperature being dependent upon bi- 
layer composition. Lipids and proteins undergo a variety of movements in 
membranes, including bond vibrations, rotations, and lateral and trans- 
verse motion, with a range of characteristic times. These motions modu- 
late a variety of membrane processes, including lipid phase transitions, 
raft formation, membrane curvature, membrane remodeling, caveolae 
formation, and membrane fusion events that regulate vesicle trafficking. 


9.5 How Does Transport Occur Across Biological Membranes? In 
most biological transport processes, the molecule or ion transported is 
water soluble, yet moves across the hydrophobic, impermeable lipid 
membrane at a rate high enough to serve the metabolic and physiolog- 
ical needs of the cell. Most of these processes occur with the assistance 
of specific transport protein. The transported species either diffuses 
through a channel-forming protein or is carried by a carrier protein. 
Transport proteins are all classed as integral membrane proteins. From 
a thermodynamic and kinetic perspective, there are only three types of 
membrane transport processes: passive diffusion, facilitated diffusion, and 
active transport. 


9.6 What Is Passive Diffusion? In passive diffusion, the transported 
species moves across the membrane in the thermodynamically favored 
direction without the help of any specific transport system/molecule. 
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For an uncharged molecule, passive diffusion is an entropic process, in 
which movement of molecules across the membrane proceeds until the 
concentration of the substance on both sides of the membrane is the 
same. The passive transport of charged species depends on their elec- 
trochemical potentials. 


9.7 How Does Facilitated Diffusion Occur? Certain metabolites and 
ions move across biological membrane more readily than can be ex- 
plained by passive diffusion alone. In all such cases, a protein that binds 
the transported species is said to facilitate its transport. Facilitated dif- 
fusion rates display saturation behavior similar to that observed with 
substrate binding by enzymes. 


9.8 How Does Energy Input Drive Active Transport Processes? Active 
transport involves the movement of a given species against its thermody- 
namic potential. Such systems require energy input and are referred to as 
active transport systems. Active transport may be driven by the energy of 
ATP hydrolysis, by light energy, or by the potential stored in ion gradients. 
The original ion gradient arises from a primary active transport process, 
and the transport that depends on the ion gradient for its energy input is 
referred to as a secondary active transport process. When transport results 


in a net movement of electric charge across the membrane, it is referred 
to as an electrogenic transport process. Ifno net movement of charge oc- 
curs during transport, the process is electrically neutral. The Na*,K*- 
ATPase of animal plasma membranes, the Ca**-ATPase of muscle sar- 
coplasmic reticulum, the gastric ATPase, the osteoclast proton pump, and 
the multidrug transporter all use the free energy of hydrolysis of ATP to 
drive transport processes. 


9.9 How Are Certain Transport Processes Driven by Light Energy? 
Light energy drives a series of conformation changes in the transmem- 
brane protein bacteriorhodopsin that drive proton transport. The 
transport involves the cis—trans isomerization of retinal in Schiff base 
linkage to the protein via a lysine residue. 


9.10 How Is Secondary Active Transport Driven by lon Gradients? The 
gradients of H*, Na*, and other cations and anions established by 
ATPases and other energy sources can be used for secondary active trans- 
port of various substrates. Many of these systems operate as symports, with 
the ion and the transported amino acid or sugar moving in the same di- 
rection (that is, into the cell). In antiport processes, the ion and the other 
transported species move in opposite directions. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. In problem 1 (b) in Chapter 8 (page 239), you were asked to draw 
all the possible phosphatidylserine isomers that can be formed from 
palmitic and linolenic acids. Which of the PS isomers are not likely 
to be found in biological membranes? 


2. The purple patches of the Halobacterium halobium membrane, which 
contain the protein bacteriorhodopsin, are approximately 75% pro- 
tein and 25% lipid. If the protein molecular weight is 26,000 and an 
average phospholipid has a molecular weight of 800, calculate the 
phospholipid-to-protein mole ratio. 


3. Sucrose gradients for separation of membrane proteins must be 
able to separate proteins and protein—lipid complexes having a 
wide range of densities, typically 1.00 to 1.35 g/mL. 

a. Consult reference books (such as the CRC Handbook of Biochem- 
istry) and plot the density of sucrose solutions versus percent 
sucrose by weight (g sucrose per 100 g solution), and versus per- 
cent by volume (g sucrose per 100 mL solution). Why is one plot 
linear and the other plot curved? 

b. What would be a suitable range of sucrose concentrations for sep- 
aration of three membrane-derived protein—lipid complexes with 
densities of 1.03, 1.07, and 1.08 g/mL? 


4. Phospholipid lateral motion in membranes is characterized by a 
diffusion coefficient of about 1 X 1078 cm?/sec. The distance trav- 
eled in two dimensions (across the membrane) in a given time 
is r= (4D#)!/", where ris the distance traveled in centimeters, D is 
the diffusion coefficient, and tis the time during which diffusion 
occurs. Calculate the distance traveled by a phospholipid across a 
bilayer in 10 msec (milliseconds). 


5. Protein lateral motion is much slower than that of lipids because 
proteins are larger than lipids. Also, some membrane proteins can 
diffuse freely through the membrane, whereas others are bound or 
anchored to other protein structures in the membrane. The diffu- 
sion constant for the membrane protein fibronectin is approxi- 
mately 0.7 X 10°! cm?/sec, whereas that for rhodopsin is about 
3 X 107° cm?/sec. 

a. Calculate the distance traversed by each of these proteins in 
10 msec. 

b. What could you surmise about the interactions of these proteins 
with other membrane components? 


6. Discuss the effects on the lipid phase transition of pure dimyristoyl 
phosphatidylcholine vesicles of added (a) divalent cations, (b) cho- 


lesterol, (c) distearoyl phosphatidylserine, (d) dioleoyl phospha- 
tidylcholine, and (e) integral membrane proteins. 
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Calculate the free energy difference at 25°C due to a galactose gra- 
dient across a membrane, if the concentration on side 1 is 2 mM 
and the concentration on side 2 is 10 mM. 


= 


Consider a phospholipid vesicle containing 10 mM Na* ions. The 
vesicle is bathed in a solution that contains 52 mM Na* ions, and 
the electrical potential difference across the vesicle membrane 
Aw = outside — Winside = —30 mV. What is the electrochemical po- 
tential at 25°C for Na* ions? 

9. Transport of histidine across a cell membrane was measured at sev- 
eral histidine concentrations: 


[Histidine], uM 





Transport, wzmol/min 


2.5 42.5 
7 119 
16 272 
31 527 
72 1220 
Does this transport operate by passive diffusion or by facilitated 


diffusion? 
10. (Integrates with Chapter 3.) Fructose is present outside a cell at 
1 uM concentration. An active transport system in the plasma 
membrane transports fructose into this cell, using the free energy 
of ATP hydrolysis to drive fructose uptake. What is the highest 
intracellular concentration of fructose that this transport system 
can generate? Assume that one fructose is transported per ATP 
hydrolyzed; that ATP is hydrolyzed on the intracellular surface of 
the membrane; and that the concentrations of ATP, ADP, and P; 
are 3 mM, 1 mM, and 0.5 mM, respectively. T = 298 K. (Hint: Re- 
fer to Chapter 3 to recall the effects of concentration on free 
energy of ATP hydrolysis.) 


11. In this chapter, we have examined coupled transport systems that 
rely on ATP hydrolysis, on primary gradients of Na* or H*, and on 
phosphotransferase systems. Suppose you have just discovered an 
unusual strain of bacteria that transports rhamnose across its 
plasma membrane. Suggest experiments that would test whether it 


was linked to any of these other transport systems. 


12. Which of the following peptides would be the most likely to acquire 
an N-terminal myristoyl lipid anchor? 
a. VLIHGLEQN 
b. THISISIT 


c. RIGHTHERE 
d. MEMEME 
e. GETREAL 


13. Which of the following peptides would be the most likely to acquire 

a prenyl anchor? 

a. RIGHTCALL 

b. PICKME 

c. ICANTICANT 

d. AINTMEPICKA 
e. None of the above 

14. What would the hydropathy plot of a soluble protein look like, com- 
pared to those in Figure 9.14? Find out by creating a hydropathy 
plot at www.expasy.ch. In the search box at the top of the page, type 
in “bovine pancreatic ribonuclease” and click “Go.” The search en- 
gine should yield UniProtKB/Swiss-Prot entry P61823. Scroll to the 
bottom of the page and click “ProtScale” under Sequence Analysis 
Tools. On the next page, select the radio button for “Hphob. / Kyte 
and Doolittle,” then scroll to the bottom of the page, and click 
“Submit.” On the next page, scroll to the bottom of the page and 
click “Submit” again. At the bottom of the next page, after a few sec- 
onds, you should see a hydropathy plot. How does the plot for ri- 
bonuclease compare to those in Figure 9.14? You should see a large 
positive peak at the left side of the plot. This is the signal sequence 
portion of the polypeptide. You can read about signal sequences on 
page 994. 

15. Proline residues are almost never found in short a-helices; nearly all 
transmembrane a-helices that contain proline are long ones (about 
20 residues). Suggest a reason for this observation. 

16. As described in this chapter, proline introduces kinks in transmem- 
brane a-helices. What are the molecular details of the kink, and why 
does it form? A good reference for this question is von Heijne, G., 
1991. Proline kinks in transmembrane a-helices. Journal of Molecu- 
lar Biology 218:499-503. Another is Barlow, D. J., and Thornton, 
J. M., 1988. Helix geometry in proteins. Jowrnal of Molecular Biology 
201:601-619. 

17. Compare the porin proteins, which have transmembrane pores 
constructed from £-barrels, with the Wza protein, which has a trans- 
membrane pore constructed from a ring of a-helices. How many 
amino acids are required to form the 6-barrel of a porin? How many 
would be required to form the same-sized pore from a-helices? 
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18. The hop-diffusion model of Akihiro Kusumi suggests that lipid mol- 
ecules in natural membranes diffuse within “fenced” areas before 
hopping the molecular fence to an adjacent area. Study Figure 9.29 
and estimate the number of phospholipid molecules that would be 
found in a typical fenced area of local diffusion. For the purpose of 
calculations, you can assume that the surface area of a typical phos- 
pholipid is about 60 A?. 

19. What are the energetic consequences of snorkeling for a charged 
amino acid? Consider the lysine residue shown in Figure 9.16. If the 
lysine side chain was reoriented to extend into the center of the 
membrane, how far from the center would the positive charge of 
the lysine be? The total height of the peak for the lysine plot in Fig- 
ure 9.15 is about 4k7; where kis Boltzmann’s constant. If the lysine 
side chain in Figure 9.16 was reoriented to face the membrane cen- 
ter, how much would its energy increase? How does this value com- 
pare with the classical value for the average translational kinetic en- 
ergy of a molecule in an ideal gas (3/2k7)? 

20. As described in the text, the pK, values of Asp% and Asp® of bacte- 
riorhodopsin are shifted to high values (more than 11) because of 
the hydrophobic environment surrounding these residues. Why is 
this so? What would you expect the dissociation behavior of aspar- 
tate carboxyl groups to be in a hydrophobic environment? 

21. Extending the discussion from problem 20, how would a hydro- 
phobic environment affect the dissociation behavior of the side 
chains of lysine and arginine residues in a protein? Why? 

22. In the description of the mechanism of proton transport by bacte- 
riorhodopsin, we find that light-driven conformation changes pro- 
mote transmembrane proton transport. Suggest at least one reason 
for this behavior. In molecular terms, how could a conformation 
change facilitate proton transport? 


Preparing for the MCAT Exam 
23. Singer and Nicolson’s fluid mosaic model of membrane structure 
presumed all of the following statements to be true EXCEPT: 
a. The phospholipid bilayer is a fluid matrix. 
b. Proteins can be anchored to the membrane by covalently linked 
lipid chains. 
c. Proteins can move laterally across a membrane. 
d. Membranes should be about 5 nm thick. 
e. Transverse motion of lipid molecules can occur occasionally. 
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Nucleotides 
and Nucleic Acids 





ESSENTIAL QUESTIONS 


Nucleotides and nucleic acids are compounds containing nitrogen bases (aromatic 
cyclic structures possessing nitrogen atoms) as part of their structure. Nucleotides 

are essential to cellular metabolism, and nucleic acids are the molecules of genetic 
information storage and expression. 


What are the structures of the nucleotides? How are nucleotides joined to- 
gether to form nucleic acids? How is information stored in nucleic acids? What 
are the biological functions of nucleotides and nucleic acids? 


Nucleotides are biological molecules that possess a heterocyclic nitrogenous 
base, a five-carbon sugar (pentose), and phosphate as principal components of 
their structure. The biochemical roles of nucleotides are numerous; they partic- 
ipate as essential intermediates in virtually all aspects of cellular metabolism. 
Serving an even more central biological purpose are the nucleic acids, the ele- 
ments of heredity and the agents of genetic information transfer. Just as proteins 
are linear polymers of amino acids, nucleic acids are linear polymers of nu- 
cleotides. Like the letters in this sentence, the orderly sequence of nucleotide 
residues in a nucleic acid can encode information. The two basic kinds of nucleic 
acids are deoxyribonucleic acid (DNA) and ribonucleic acid (RNA). The five- 
carbon sugar in DNA is 2-deoxyribose; in RNA, it is ribose. (See Chapter 7 for a 
detailed discussion of sugars and other carbohydrates.) DNA is the repository of 
genetic information in cells, whereas RNA serves in the expression of this infor- 
mation through the processes of transcription and translation (Figure 10.1). An 
interesting exception to this rule is that some viruses have their genetic infor- 
mation stored as RNA. 

This chapter describes the chemistry of nucleotides and the major classes of nu- 
cleic acids. Chapter 11 presents methods for determination of nucleic acid primary 
structure (nucleic acid sequencing) and describes the higher orders of nucleic acid 
structure. Chapter 12 introduces the molecular biology of recombinant DNA: the con- 
struction and uses of novel DNA molecules assembled by combining segments from 
different DNA molecules. 


tOst What Are the Structure and Chemistry 
of Nitrogenous Bases? 


The bases of nucleotides and nucleic acids are derivatives of either pyrimidine or 
purine. Pyrimidines are six-membered heterocyclic aromatic rings containing two 
nitrogen atoms (Figure 10.2a). The atoms are numbered in a clockwise fashion, as 
shown in Figure 10.2. The purine ring system consists of two rings of atoms: one re- 
sembling the pyrimidine ring and another resembling the imidazole ring (Figure 
10.2b). The nine atoms in this fused ring system are numbered according to the 
convention shown. 

The pyrimidine ring system is planar, whereas the purine system deviates some- 
what from planarity in having a slight pucker between its imidazole and pyrimidine 
portions. Both are relatively insoluble in water, as might be expected from their pro- 
nounced aromatic character. 
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Francis Crick (right) and James Watson (left) point out 
features of their model for the structure of DNA. 





We have discovered the secret of life! 


Proclamation by Francis H. C. Crick to patrons 
of the Eagle, a pub in Cambridge, 
England (1953) 
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FIGURE 10.1 The fundamental process of information 


transfer in cells. 
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i. 
N 
1 
The pyrimidine ring The purine ring system 


FIGURE 10.2 (a) The pyrimidine ring system; by conven- 
tion, atoms are numbered as indicated. (b) The purine 


ring system, atoms numbered as shown. 





(1) Replication 
DNA replication yields two 
DNA molecules identical to the 
original one, ensuring transmission 
of genetic information to daughter 
cells with exceptional fidelity. 


e Transcription 
The sequence of bases in DNA is 
recorded as a sequence of 
complementary bases in a single- 
stranded mRNA molecule. 





mRNA © 









Translation (3) Translation 
Three-base codons on the mRNA 
corresponding to specific amino acids 
direct the sequence of building a 
protein. These codons are recognized 
by tRNAs (transfer RNAs) carrying the 
appropriate amino acids. Ribosomes 

Attached are the “machinery” for protein 


amino acid synthesis. 


mRNA 


Growing 
peptide chain 


Protein &\p 


Three Pyrimidines and Two Purines Are Commonly Found in Cells 


The common naturally occurring pyrimidines are cytosine, uracil, and thymine 
(5-methyluracil) (Figure 10.3). Cytosine and thymine are the pyrimidines typically 
found in DNA, whereas cytosine and uracil are common in RNA. Note that the 
5-methyl group of thymine is the only thing that distinguishes it from uracil. Vari- 
ous pyrimidine derivatives, such as dihydrouracil, are present as minor constituents 
in certain RNA molecules. 

Adenine (6-amino purine) and guanine (2-amino-6-oxy purine), the two common 
purines, are found in both DNA and RNA (Figure 10.4). Other naturally occurring 
purine derivatives include hypoxanthine, xanthine, and uric acid (Figure 10.5). 
Hypoxanthine and xanthine are found only rarely as constituents of nucleic acids. 
Uric acid, the most oxidized state for a purine derivative, is never found in nucleic 


acids. 
NH» Oo (0) 
O O O 
| | | 
H H H 


Cytosine Uracil Thymine 
(2-oxy-4-amino (2-oxy-4-oxy (2-oxy-4-oxy 
pyrimidine) pyrimidine) 5-methyl pyrimidine) 


FIGURE 10.3 The common pyrimidine bases—cytosine, uracil, and thymine—in the tautomeric forms predom- 
inant at pH 7. 
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NH, O 
H J 
| mo | 
H H 
Adenine Guanine 
(6-amino purine) (2-amino-6-oxy purine) 


FIGURE 10.4 The common purine bases—adenine and guanine—in the tautomeric forms predominant at pH 7. 


The Properties of Pyrimidines and Purines Can Be Traced to Their 
Electron-Rich Nature 


The aromaticity of the pyrimidine and purine ring systems and the electron-rich 
nature of their carbonyl and ring nitrogen substituents endow them with the ca- 
pacity to undergo keto—enol tautomeric shifts. That is, pyrimidines and purines ex- 
ist as tautomeric pairs, as shown in Figure 10.6 for uracil and Figure 10.7 for gua- 
nine. The keto tautomers of uracil, thymine, and guanine vastly predominate at 
neutral pH. In other words, pK, values for ring nitrogen atoms 1 and 3 in uracil 
(Figure 10.6) are greater than 8 (the pK, value for N-3 is 9.5). In contrast, the enol 
form of cytosine predominates at pH 7 and the pK, value for N-3 in this pyrimidine 
is 4.5. Similarly, for guanine (Figure 10.7), the pK, value is 9.4 for N-1 and less than 
5 for N-3. These pK, values specify whether protons are associated with the various 
ring nitrogens at neutral pH. As such, they are important in determining whether 
these nitrogens serve as H-bond donors or acceptors. Hydrogen bonding between 
purine and pyrimidine bases is fundamental to the biological functions of nucleic 
acids, as in the formation of the double-helix structure of DNA (see Section 10.5). 
The important functional groups participating in H-bond formation are the amino 
groups of cytosine, adenine, and guanine; the ring nitrogens at position 3 of pyrim- 
idines and position 1 of purines; and the strongly electronegative oxygen atoms at- 
tached at position 4 of uracil and thymine, position 2 of cytosine, and position 6 of 
guanine (see Figure 10.17). 

Another property of pyrimidines and purines is their strong absorbance of ul- 
traviolet (UV) light, which is also a consequence of the aromaticity of their hete- 
rocyclic ring structures. Figure 10.8 shows characteristic absorption spectra of 





FIGURE 10.6 The keto—enol tautomerization of uracil. 





FIGURE 10.7 The tautomerization of the purine guanine. 





FIGURE 10.5 Other naturally occurring purine 
derivatives—hypoxanthine, xanthine, and uric acid. 
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FIGURE 10.8 The UV absorption spectra of the common 
ribonucleotides. 
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several of the common bases of nucleic acids—adenine, uracil, cytosine, and 
guanine—in their nucleotide forms: AMP, UMP, CMP, and GMP (see Section 10.3). 
This property is particularly useful in quantitative and qualitative analysis of nu- 
cleotides and nucleic acids. 


10:27 What Are Nucleosides? 


Nucleosides are compounds formed when a base is linked to a sugar. The sugars of nu- 
cleosides are pentoses (five-carbon sugars, see Chapter 7). Ribonucleosides contain 
the pentose D-ribose, whereas 2-deoxy-D-ribose is found in deoxyribonucleosides. 
In both instances, the pentose is in the five-membered ring form furanose: 
D-ribofuranose for ribonucleosides and 2-deoxy-D-ribofuranose for deoxyribonucleo- 
sides (Figure 10.9). In nucleosides, these ribofuranose atoms are numbered as 1’, 2’, 
3', and so on to distinguish them from the ring atoms of the nitrogenous bases. The 
base is linked to the sugar via a glycosidic bond. Glycosidic bonds in nucleosides (and 
nucleotides, see following discussion) are always of the 6-configuration. Nucleosides 
are named by adding the ending -idine to the root name of a pyrimidine or -osine to the 
root name of a purine. The common nucleosides are thus cytidine, uridine, thymidine, 





HUMAN BIOCHEMISTRY 





Adenosine: A Nucleoside with Physiological Activity 


For the most part, nucleosides have no biological role other than 
to serve as component parts of nucleotides. Adenosine is a rare 
exception. In mammals, adenosine functions as an autacoid, or 
“local hormone,” and as a neuromodulator. This nucleoside cir- 
culates in the bloodstream, acting locally on specific cells to 
influence such diverse physiological phenomena as blood vessel 
dilation, smooth muscle contraction, neuronal discharge, neuro- 
transmitter release, and metabolism of fat. For example, when 
muscles work hard, they release adenosine, causing the sur- 
rounding blood vessels to dilate, which in turn increases the flow 
of blood and its delivery of Os and nutrients to the muscles. In a 
different autacoid role, adenosine acts in regulating heartbeat. 
The natural rhythm of the heart is controlled by a pacemaker, the 
sinoatrial node, which cyclically sends a wave of electrical excita- 
tion to the heart muscles. By blocking the flow of electrical cur- 
rent, adenosine slows the heart rate. Supraventricular tachycardia is 
a heart condition characterized by a rapid heartbeat. Intravenous 
injection of adenosine causes a momentary interruption of the 
rapid cycle of contraction and restores a normal heart rate. 
Adenosine is licensed and marketed as Adenocard to treat 
supraventricular tachycardia. 

In addition, adenosine is implicated in sleep regulation. Dur- 
ing periods of extended wakefulness, extracellular adenosine lev- 


els rise as a result of metabolic activity in the brain, and this in- 
crease promotes sleepiness. During sleep, adenosine levels fall. 
Caffeine promotes wakefulness by blocking the interaction of ex- 
tracellular adenosine with its neuronal receptors.* 


*Porrka-Heiskanen, T., et al., 1997. Adenosine: A mediator of the sleep- 
inducing effects of prolonged wakefulness. Science 276:1265-1268; and 
Vaugeois, J-M., 2002. Signal transduction: Positive feedback from coffee. 
Nature 418:734-736. 


O m 
HC 
nN N 
A? 
O N N 


CH3 
Caffeine 


A Caffeine is an alkaloid, a term used to define naturally occurring 
nitrogenous molecules that have pharmacological effects. Alkaloids are 
classified according to their metabolic precursors, so caffeine is a 
purine alkaloid. 
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adenosine, and guanosine (Figure 10.10). Nucleosides are more water soluble than the 
free bases, because of the hydrophilicity of the pentose. 


10:3» What Are the Structure and Chemistry of Nucleotides? 


A nucleotide results when phosphoric acid is esterified to a sugar —OH group of 
a nucleoside. The nucleoside ribose ring has three —OH groups available for 
esterification, at C-2', C-3', and C-5’ (although 2’-deoxyribose has only two). The 
vast majority of monomeric nucleotides in the cell are ribonucleotides having 
5'-phosphate groups. Figure 10.11 shows the structures of the common four ribo- 
nucleotides, whose formal names are adenosine 5'-monophosphate, guanosine 
5’-monophosphate, cytidine 5'-monophosphate, and uridine 5'-monophosphate. 
These compounds are more often referred to by their abbreviations: 5’-AMP, 
5’-GMP, 5’-CMP, and 5’-UMP, or even more simply as AMP, GMP, CMP, 
and UMP. Because the pK, value for the first dissociation of a proton from the 
phosphoric acid moiety is 1.0 or less, the nucleotides have acidic properties. This 
acidity is implicit in the other names by which these substances are known— 
adenylic acid, guanylic acid, cytidylic acid, and uridylic acid. The pK, value for 
the second dissociation, pK, is about 6.0, so at neutral pH or above, the net 
charge on a nucleoside monophosphate is —2. Nucleic acids, which are polymers 
of nucleoside monophosphates, derive their name from the acidity of these phos- 
phate groups. 


FIGURE 10.9 Furanose structures—ribose and 
deoxyribose. 


FIGURE 10.10 The common ribonucleosides—cytidine, 
uridine, adenosine, and guanosine—and the less- 
common inosine. (Purine nucleosides and nucleotides 
usually adopt the anti conformation, where the purine 
ring is not above the ribose, as it would be in the syn 
conformation. Pyrimidines are always anti, never syn, 
because the 2-O atom of pyrimidines sterically hinders 
the ring from a position above the ribose.) 
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(or CMP or cytidylic acid) (or UMP or uridylic acid) 3'-AMP 
FIGURE 10.11 Structures of the four common ribonucleotides—AMP GMP CMP and UMP Also shown is the 
nucleotide 3’-AMP. 
Cyclic Nucleotides Are Cyclic Phosphodiesters 
Nucleoside monophosphates in which the phosphoric acid is esterified to two of the 
available ribose hydroxyl groups (Figure 10.12) are found in all cells. Forming two 
such ester linkages with one phosphate results in a cyclic phosphodiester structure. 
3’,5’-cyclic AMP, often abbreviated cAMP, and its guanine analog 3’,5’-cyclic GMP, 
or cGMP, are important regulators of cellular metabolism (see Parts 3 and 4). 
NH, Nucleoside Diphosphates and Triphosphates Are Nucleotides 
SN with Two or Three Phosphate Groups 
P Additional phosphate groups can be linked to the phosphoryl group of a 
NI nucleotide through the formation of phosphoric anhydride linkages, as shown 
in Figure 10.13. Addition of a second phosphate to AMP creates adenosine 
5'-diphosphate, or ADP, and adding a third yields adenosine 5’-triphosphate, or 
ATP. The respective phosphate groups are designated by the Greek letters a, 
B, and y, starting with the a-phosphate as the one linked directly to the pentose. 
The abbreviations GTP, CTP, and UTP represent the other corresponding nucle- 
oside 5'-triphosphates. Like the nucleoside 5’-monophosphates, the nucleoside 
o 5'-diphosphates and 5'-triphosphates all occur in the free state in the cell, as do 
A their deoxyribonucleoside phosphate counterparts, represented as dAMP, dADP, 
N and dATP; dGMP, dGDP, and dGTP; dCMP, dCDP, and dCTP; dUMP, dUDP, and 
dUTP; and dTMP, dTDP, and dTTP. 
N NH, 





3',5'-Cyclic GMP 


FIGURE 10.12 The cyclic nucleotides cAMP and cGMP. 


NDPs and NTPs Are Polyprotic Acids 


Nucleoside 5’-diphosphates (NDPs) and nucleoside 5'-triphosphates (NTPs) are 
relatively strong polyprotic acids in that they dissociate three and four protons, re- 
spectively, from their phosphoric acid groups. The resulting phosphate anions on 
NDPs and NTPs form stable complexes with divalent cations such as Mg** and Ca?*. 
Because Mg?" is present at high concentrations (as much as 40 mM) intracellularly, 
NDPs and NTPs occur primarily as Mg?* complexes in the cell. 





10.4 What Are Nucleic Acids? 
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FIGURE 10.13 Formation of ADP and ATP by the successive addition of phosphate groups via phosphoric 
anhydride linkages. Note that the reaction is a dehydration synthesis reaction. 


Nucleoside 5'-Triphosphates Are Carriers of Chemical Energy 


Nucleoside 5’-triphosphates are indispensable agents in metabolism because the 
phosphoric anhydride bonds they possess are a prime source of chemical energy to 
do biological work. 

Virtually all of the biochemical reactions of nucleotides involve either phosphate 
or pyrophosphate group transfer: the release of a phosphoryl group from an NTP to 
give an NDP, the release of a pyrophosphoryl group to give an NMP unit, or the ac- 
ceptance of a phosphoryl group by an NMP or an NDP to give an NDP or an NTP 
(Figure 10.14). The pentose and the base are not directly involved in this chemistry. 
A “division of labor” directs ATP to serve as the primary nucleotide in central path- 
ways of energy metabolism, whereas GTP is used to drive protein synthesis. Thus, 
the various nucleotides are channeled in appropriate metabolic directions through 
specific recognition of the base of the nucleotide. The bases of nucleotides never 
participate directly in the covalent bond chemistry that goes on. 


10:4: What Are Nucleic Acids? 


Nucleic acids are polynucleotides: linear polymers of nucleotides linked 3’ to 5’ by 
phosphodiester bridges (Figure 10.15). They are formed as 5'-nucleoside mono- 
phosphates are successively added to the 3'-OH group of the preceding nucleotide, 
a process that gives the polymer a directional sense. Polymers of ribonucleotides are 
named ribonucleic acid, or RNA. Deoxyribonucleotide polymers are called deoxy- 
ribonucleic acid, or DNA. Because C-1' and C-4' in deoxyribonucleotides are in- 
volved in furanose ring formation and because there is no 2’-OH, only the 3’- and 
5'-hydroxyl groups are available for internucleotide phosphodiester bonds. In the 
case of DNA, a polynucleotide chain may contain hundreds of millions of nu- 
cleotide units. The convention in all notations of nucleic acid structure is to read the polynu- 
cleotide chain from the 5’-end of the polymer to the 3’-end. Note that this reading direction 
actually passes through each phosphodiester from 3’ to 5’ (Figure 10.15). A repeti- 
tious uniformity exists in the covalent backbone of polynucleotides. 
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FIGURE 10.14 Phosphoryl, pyrophos- 
phoryl, and nucleotidyl group transfer, 
the major biochemical reactions of NTP 
nucleotides. 
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TO Adenine 
FIGURE 10.15 3’, 5’-phosphodiester bridges link nucleo- 3 a 3 
tides together to form polynucleotide chains. The 
5’-ends of the chains are at the top; the 3’-ends are at | 
the bottom. 


etc. 
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The Base Sequence of a Nucleic Acid Is Its Distinctive Characteristic 


The only significant variation that commonly occurs in the chemical structure of nu- 
cleic acids is the nature of the base at each nucleotide position. These bases are not 
part of the sugar—phosphate backbone but instead serve as distinctive side chains, 
much like the R groups of amino acids along a polypeptide backbone. They give the 
polymer its unique identity. A simple notation for nucleic acid structures is merely to 
list the order of bases in the polynucleotide using single capital letters—A, G, C, and 
U (or T). Occasionally, a lowercase “p” is written between each successive base to indi- 
cate the phosphodiester bridge, as in GpApCpGpUpA. 

To distinguish between RNA and DNA sequences, DNA sequences may be pre- 
ceded by a lowercase “d” to denote deoxy, as in d-GACGTA. From a simple string of 
letters such as this, any biochemistry student should be able to draw the unique 
chemical structure of, for example, a pentanucleotide, even though it may contain 
more than 200 atoms. 


10:5 What Are the Different Classes of Nucleic Acids? 


The two major classes of nucleic acids are DNA and RNA. DNA has only one bio- 
logical role, but it is the more central one. The information to make all the func- 
tional macromolecules of the cell (even DNA itself) is preserved in DNA and ac- 
cessed through transcription of the information into RNA copies. Coincident with 
DNA’s singular purpose, simple life forms such as viruses or bacteria usually contain 
only a single DNA molecule (or “chromosome”). Such DNA molecules must be 
quite large in order to embrace enough information for making the macromole- 
cules necessary to maintain a living cell. The Escherichia coli chromosome has a mo- 
lecular mass of 2.9 X 10° D and contains more than 9 million nucleotides. Eukary- 
otic cells have many chromosomes, and DNA is found principally in two copies in 
the diploid chromosomes of the nucleus. DNA is also found in mitochondria and 
in chloroplasts, where it encodes some of the proteins and RNAs unique to these 
organelles. 

In contrast, RNA occurs in multiple copies and various forms. Cells typically con- 
tain about eight times as much RNA as DNA. RNA has a number of important bio- 
logical functions, its central one being information transfer from DNA to protein. 
RNA molecules playing this role are categorized into several major types: messen- 
ger RNA, ribosomal RNA, transfer RNA, and small nuclear RNA. Another type, 
small RNAs (RNA 21 to 28 nucleotides in length), consists of important players in 
gene regulation. Beyond various roles in information transfer, RNA participates in 
a number of metabolic functions, including the processing and modification of 
tRNA, rRNA, and mRNA and several maintenance or “housekeeping” functions, 
such as preservation of telomeres. 

With these basic definitions in mind, let’s now briefly consider the chemical and 
structural nature of DNA and the various RNAs. Chapter 11 elaborates on methods 
to determine the primary structure of nucleic acids by sequencing methods and dis- 
cusses the secondary and tertiary structures of DNA and RNA. Part IV, Information 
Transfer, includes a detailed treatment of the dynamic role of nucleic acids in the 
molecular biology of the cell. 


The Fundamental Structure of DNA Is a Double Helix 


The DNA isolated from different cells and viruses characteristically consists of two 
polynucleotide strands wound together to form a long, slender, helical molecule, 
the DNA double helix. The strands run in opposite directions; that is, they are 
antiparallel. The two strands are held together in the double helical structure 
through interchain hydrogen bonds (Figure 10.16). These H bonds pair the bases of 
nucleotides in one chain to complementary bases in the other, a phenomenon 
called base pairing. 


Telomeres are specialized sequences at the ends 
of chromosomes. 
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Segment of unwound double helix 
illustrating the antiparallel orientation 
of the complementary strands 


FIGURE 10.16 The antiparallel nature of the DNA double $ 
helix. 3' 5 


Erwin Chargaff’s Analysis of the Base Composition of Different DNAs Provided a 
Key Clue to DNA Structure A clue to the chemical basis of base pairing in DNA came 
from the analysis of the base composition of various DNAs by Erwin Chargaff in the 
late 1940s. His data showed that the four bases commonly found in DNA (A, C, G, and 
T) do not occur in equimolar amounts and that the relative amounts of each vary from 
species to species (Table 10.1). Nevertheless, Chargaff noted that certain pairs of bases, 
namely, adenine and thymine, and guanine and cytosine, are always found in a 1:1 ra- 
tio and that the number of pyrimidine residues always equals the number of purine 
residues. These findings are known as Chargaff’s rules: [A] = [T]; [C] = [G]; [pyrim- 
idines] = [purines]. 








TABLE 10.1 ( 
Adenine Thymine Adenine Guanine Purines 

to to to to to 
Source Guanine Cytosine Thymine Cytosine Pyrimidines 
Ox 1.29 1.43 1.04 1.00 1.1 
Human 1.56 1.75 1.00 1.00 1.0 
Hen 1.45 1.29 1.06 0.91 0.99 
Salmon 1.43 1.43 1.02 1.02 1.02 
Wheat 1.22 1.18 1.00 0.97 0.99 
Yeast 1.67 1.92 1.03 1.20 1.0 
Haemophilus influenzae 1.74 1.54 1.07 0.91 1.0 
E. coli K-12 1.05 0.95 1.09 0.99 1.0 
Avian tubercle bacillus 0.4 0.4 1.09 1.08 1.1 
Serratia marcescens 0.7 0.7 0.95 0.86 0.9 
Bacillus schatz 0.7 0.6 1.12 0.89 1:0 











Source: After Chargaff, E., 1951. Structure and function of nucleic acids as cell constituents. Federation Proceedings 
10:654-659. 
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Watson and Crick’s Postulate of the DNA Double Helix Became the Icon of DNA 
Structure James Watson and Francis Crick, working in the Cavendish Laboratory 
at Cambridge University in 1953, took advantage of Chargaff’s results and the data 
obtained by Rosalind Franklin and Maurice Wilkins in X-ray diffraction studies on 
the structure of DNA to conclude that DNA was a complementary double helix. Two 
strands of deoxyribonucleic acid (sometimes referred to as the Watson strand and 
the Crick strand) are held together by the bonding interactions between unique base 
pairs, always consisting of a purine in one strand and a pyrimidine in the other. Base 
pairing is very specific: If the purine is adenine, the pyrimidine must be thymine. 
Similarly, guanine pairs only with cytosine (Figure 10.17). Thus, if an A occurs in 
one strand of the helix, T must occupy the complementary position in the oppos- 
ing strand. Likewise, a G in one dictates a C in the other. Because of this exclusive 
pairing of A only with T and G only with C, these pairs are taken as the standard or 
accepted law, and the A:T and G:C base pairs are often referred to as canonical. As 
Watson recognized from testing various combinations of bases using structurally ac- 
curate models, the A:T pair and the G:C pair form spatially equivalent units (Fig- 
ure 10.17). The backbone-to-backbone distance of an A:T pair is 1.11 nm, virtually 
identical to the 1.08 nm chain separation in G:C base pairs. 

Base pairing in the DNA molecule not only conforms to Chargaff results and Wat- 
son and Crick’s rules but also has a profound property relating to heredity: The sequence 
of bases in one strand has a complementary relationship to the sequence of bases in the other strand. 
That is, the information contained in the sequence of one strand is conserved in the 
sequence of the other. Therefore, separation of the two strands and faithful replication 
of each, through a process in which base pairing specifies the nucleotide sequence in 
the newly synthesized strand, leads to two progeny molecules identical in every respect 
to the parental double helix (Figure 10.18). Elucidation of the double helical structure 
of DNA represented one of the most significant events in the history of science. This 
discovery more than any other marked the beginning of molecular biology. Indeed, 
upon solving the structure of DNA, Crick proclaimed in The Eagle, a pub just across 
from the Cavendish lab, “We have discovered the secret of life!” 


The Information in DNA Is Encoded in Digital Form In this digital age, we are 

accustomed to electronic information encoded in the form of extremely long arrays of 

just two digits: ones (1s) and zeros (0s). DNA uses four digits to encode biological in- 

formation: A, C, G, and T. A significant feature of the DNA double helix is that virtu- 

ally any base sequence (encoded information) is possible: Other than the base-pairing 

rules, no structural constraints operate to limit the potential sequence of bases in DNA. 
DNA contains two kinds of information: 


1. The base sequences of genes that encode the amino acid sequences of proteins 
and the nucleotide sequences of functional RNA molecules such as rRNA and 
tRNA (see following discussion) 

2. The gene regulatory networks that control the expression of protein-encoding 
(and functional RNA-encoding) genes (see Chapter 29) 


FIGURE 10.17 The Watson-Crick base pairs A:T and G:C. 


Old Old 





Old New New Old 
Emerging progeny DNA 


FIGURE 10.18 Replication of DNA gives identical progeny 
molecules because base pairing is the mechanism that 
determines the nucleotide sequence of each newly syn- 
thesized strand. 
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FIGURE 10.19 If the cell walls of bacteria such as 
Escherichia coli are partially digested and the cells are 
then osmotically shocked by dilution with water, the 
contents of the cells are extruded to the exterior. In 
electron micrographs, the most obvious extruded com- 
ponent is the bacterial chromosome, shown here sur- 
rounding the cell. 


DNA Is in the Form of Enormously Long, Threadlike Molecules Because of the 
double helical nature of DNA molecules, their size can be represented in terms of 
the numbers of paired nucleotides (or base pairs) they contain. For example, the 
E. coli chromosome consists of 4.64 X 10° base pairs (abbreviated bp) or 4.64 X 10° 
kilobase pairs (kbp). DNA is a threadlike molecule. The diameter of the DNA dou- 
ble helix is only 2 nm, but the length of the DNA molecule forming the 
E. coli chromosome is over 1.6 X 10° nm (1.6 mm). The long dimension of an E. coli 
cell is only 2000 nm (0.002 mm), so its chromosome must be highly folded. Because 
of their long, threadlike nature, DNA molecules are easily sheared into shorter 
fragments during isolation procedures, and it is difficult to obtain intact chromo- 
somes even from the simple cells of prokaryotes. 


DNA in Cells Occurs in the Form of Chromosomes DNA occurs in various forms 
in different cells. The single chromosome of prokaryotic cells (Figure 10.19) is typ- 
ically a circular DNA molecule. Proteins are associated with prokaryotic DNA, but 
unlike eukaryotic chromosomes, prokaryotic chromosomes are not uniformly orga- 
nized into ordered nucleoprotein arrays. The DNA molecules of eukaryotic cells, 
each of which defines a chromosome, are linear and richly adorned with proteins. 
A class of arginine- and lysine-rich basic proteins called histones interact ionically 
with the anionic phosphate groups in the DNA backbone to form nucleosomes, 
structures in which the DNA double helix is wound around a protein “core” com- 
posed of pairs of four different histone polypeptides (see Section 11.5 in Chapter 
11). Chromosomes also contain a varying mixture of other proteins, so-called non- 
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Do the Properties of DNA Invite Practical Applications? 


The molecular recognition between one DNA strand and its com- 
plementary partner not only leads to formation of a double- 
stranded DNA, but it also creates a molecule with mechanical 
properties distinctly different from single-stranded DNA. DNA 
double helices are relatively rigid rods. Single-stranded DNA mol- 
ecules are flexible strands. These features are of interest to nano- 
technology, the new branch of applied science that aims to ma- 
nipulate matter at the molecular, or nanometer, level for practical 
purposes. Nanotechnology seeks to create nanodevices, nanoscale 
devices with simple machinelike qualities. 

DNA chains have been used to construct nanomachines capa- 
ble of simple movements such as rotation or pincerlike motions, 
and more elaborate DNA-based devices can even act as motors 
that walk along DNA tracks. To illustrate the principles, consider 
the DNA “tweezers” composed of three DNA strands (Q [red 
strand], S1, and S2) with regions of partial sequence complemen- 
tarity. The 40-nucleotide Q strand is hybridized with two differ- 
ent 42-nucleotide-long DNA strands, S1 (blue/purple) and S2 
(green/purple). Terminal segments of S1 and S2 are designed to 
be complementary to 18-nucleotide stretches at the opposite ends 
of Q. Base pairing between Q, Sl, and S2 forms a V-shaped 
supramolecular structure, the tweezers, in an open conformation 
(1). Both S1 and S2 have 24-nucleotide-long ends that remain un- 
paired. The DNA tweezers can be driven into a closed confor- 
mation by the “fuel,” a 56-nucleotide DNA strand (F) that has 
24 bases complementary to the unpaired region of S1 and 24 bases 
complementary to the unpaired region of S2. Hybridization of the 
“fuel” to the unpaired segments of S1 and S2 (blue and green, re- 
spectively) closes the “tweezers” (2). The F strand has eight un- 
paired nucleotides remaining at its end; this region is called the 
“toehold.” The “toehold” serves as the hybridization site for a fifth 
DNA strand, the R or “removal” strand. R is complementary to F 
along its length. Hybridization of the end of R to the complemen- 
tary eight-base “toehold” region of F results in the unzipping of F 


from S2 and then SI as it zips up with R. Removal of F returns the 
DNA tweezers to the open conformation (1). The F:R duplex is 
the “waste” generated by the operation of the DNA nanomachine. 
Thus, this DNA tweezers nanomachine consumes “fuel” and gen- 
erates “waste,” as many common machines do. 


a) 





Open 
F : R duplex 
(“waste”) s1 
R A-s-Q 
F F (“fuel”) 
N S2 NU 
“toehold” 
F 
R 
a 
Closed F 


(2) 


A DNA tweezers—a simple DNA nanomachine. (Adapted from Yurke, B., 
Turberfield, A. J., Mills, A. P., Jr., Simmel, F. C., and Neumann, J. L. 2000. A DNA-fuelled 
molecular machine made of DNA. Nature 406:605-608, as discussed in an article in the 
whimsically named nanoscience journal Small by Simmel, F. C., and Dittmer W. U., 2006. 
DNA nanodevices. Small 1:284-299). 
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histone chromosomal proteins, many of which are involved in regulating which 
genes in DNA are transcribed at any given moment. The amount of DNA in a 
diploid mammalian cell is typically more than 1000 times that found in an E. colt 
cell. Some higher plant cells contain more than 50,000 times as much. 


Various Forms of RNA Serve Different Roles in Cells 


Unlike DNA, cellular RNA molecules are almost always single-stranded. However, all 
of them typically contain double-stranded regions formed when stretches of nu- 
cleotides with complementary base sequences align in an antiparallel fashion and 
form canonical A:U and G:C base pairs. (Compare Figures 10.3 and 10.17 to con- 
vince yourself that U would pair with A in the same manner T does.) Such base pair- 
ing creates secondary structure. 


Messenger RNA Carries the Sequence Information for Synthesis of a Protein 
Messenger RNA (mRNA) serves to carry the information or “message” that is en- 
coded in genes to the sites of protein synthesis in the cell, where this information is 
translated into a polypeptide sequence. That is, mRNA molecules are transcribed 
copies of the protein-coding genetic units of DNA. Prokaryotic mRNAs have from 
75 to 3,000 nucleotides; mRNA constitutes about 2% of total prokaryotic RNA. 

Messenger RNA is synthesized during transcription, an enzymatic process in which 
an RNA copy is made of the sequence of bases along one strand of DNA. This mRNA 
then directs the synthesis of a polypeptide chain as the information that is contained 
within its nucleotide sequence is translated into an amino acid sequence by the 
protein-synthesizing machinery of the ribosomes. Ribosomal RNA and tRNA mole- 
cules are also synthesized by transcription of DNA sequences, but unlike mRNA mol- 
ecules, these RNAs are not subsequently translated to form proteins. In prokaryotes, 
a single mRNA may contain the information for the synthesis of several polypeptide 
chains within its nucleotide sequence (Figure 10.20). In contrast, eukaryotic mRNAs 
encode only one polypeptide but are more complex in that they are synthesized in 
the nucleus in the form of much larger precursor molecules called heterogeneous 
nuclear RNA, or hnRNA. hnRNA molecules contain stretches of nucleotide sequence 
that have no protein-coding capacity. These noncoding regions are called intervening 
sequences or introns because they intervene between coding regions, which are called 
exons. Introns interrupt the continuity of the information specifying the amino acid 
sequence of a protein and must be spliced out before the message can be translated. 
In addition, eukaryotic hnRNA and mRNA molecules have a run of 100 to 200 
adenylic acid residues attached at their 3’-ends, so-called poly(A) tails. This poly- 
adenylation occurs after transcription has been completed and is essential for effi- 
cient translation and stability of the mRNA. The properties of mRNA molecules as 
they move through transcription and translation in prokaryotic versus eukaryotic cells 
are summarized in Figure 10.20. 


Ribosomal RNA Provides the Structural and Functional Foundation for Ribosomes 
Ribosomes, the supramolecular assemblies where protein synthesis occurs, are about 
65% RNA of the ribosomal RNA type. Ribosomal RNA (rRNA) molecules fold into 
characteristic secondary structures as a consequence of intramolecular base-pairing 
interactions (Figure 10.21). The different species of rRNA are generally referred to 
according to their sedimentation coefficients! (see the Appendix to Chapter 5), 
which are a rough measure of their relative size (Figure 10.22). 

Ribosomes are composed of two subunits of different sizes that dissociate from 
each other if the Mg?* concentration is below 10~* M. Each subunit is a supramol- 
ecular assembly of proteins and RNA and has a total mass of 10° D or more. E. coli 
ribosomal subunits have sedimentation coefficients of 30S (the small subunit) and 
50S (the large subunit). Eukaryotic ribosomes are somewhat larger than prokary- 


Sedimentation coefficients are a measure of the velocity with which a particle sediments in a cen- 
trifugal force field. Sedimentation coefficients are expressed in Svedbergs (symbolized S), named to 
honor The Svedberg, developer of the ultracentrifuge. One S=107!" sec. 
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CENGAGENOW” ANIMATED FIGURE 10.20 Transcription and translation of mRNA mole- 
cules in prokaryotic versus eukaryotic cells. See this figure animated at www.cengage KO ; 
OL Protein A 


.com/login. 


FIGURE 10.21 Ribosomal RNA has a complex secondary 
structure due to many intrastrand hydrogen bonds. The 
gray line in this figure traces a polynucleotide chain con- 
sisting of more than 1000 nucleotides. Aligned regions 
represent H-bonded complementary base sequences. 


otic ribosomes, consisting of 40S and 60S subunits. More than 80% of total cellular 
RNA is represented by the various forms of rRNA. 

Ribosomal RNAs characteristically contain a number of specially modified nu- 
cleotides, including pseudouridine residues, ribothymidylic acid, and methylated 
bases (Figure 10.23). The central role of ribosomes in the biosynthesis of proteins 
is treated in detail in Chapter 30. Here we briefly note the significant point that ge- 
netic information in the nucleotide sequence of an mRNA is translated into the 
amino acid sequence of a polypeptide chain by ribosomes. 


Transfer RNAs Carry Amino Acids to Ribosomes for Use in Protein Synthesis 
Transfer RNAs (tRNAs) serve as the carrier of amino acids for protein synthesis (see 
Chapter 30). tRNA molecules also fold into a characteristic secondary structure 
(Figure 10.24). tRNAs are small RNA molecules, containing 73 to 94 residues, a sub- 
stantial number of which are methylated or otherwise unusually modified. Each of 
the 20 amino acids in proteins has at least one unique tRNA species dedicated to 
chauffeuring its delivery to ribosomes for insertion into growing polypeptide 
chains, and some amino acids are served by several tRNAs. In eukaryotes, there are 
even discrete sets of tRNA molecules for each site of protein synthesis—the cyto- 
plasm, the mitochondrion, and in plant cells, the chloroplast. All tRNA molecules 
possess a 3’-terminal nucleotide sequence that reads -CCA, and the amino acid is 
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FIGURE 10.22 The organization and composition of prokaryotic and eukaryotic ribosomes. 
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FIGURE 10.23 Unusual bases in RNA. 


carried to the ribosome attached as an acyl ester to the free 3'-OH of the terminal 
A residue. These aminoacyl-tRNAs are the substrates of protein synthesis, the amino 
acid being transferred to the carboxyl end of a growing polypeptide. The peptide 
bond-forming reaction is a catalytic process intrinsic to ribosomes. 


Small Nuclear RNAs Mediate the Splicing of Eukaryotic Gene Transcripts (hnRNA) 
into mRNA Small nuclear RNAs, or snRNAs, are a class of RNA molecules found 
in eukaryotic cells, principally in the nucleus. They are neither tRNA nor small 
rRNA molecules, although they are similar in size to these species. They contain 
from 100 to about 200 nucleotides, some of which, like tRNA and rRNA, are methy- 
lated or otherwise modified. No snRNA exists as naked RNA. Instead, snRNA is 
found in stable complexes with specific proteins forming small nuclear ribonucleo- 
protein particles, or snaRNPs, which are about 10S in size. Their occurrence in eu- 
karyotes, their location in the nucleus, and their relative abundance (1% to 10% of 
the number of ribosomes) are significant clues to their biological purpose: snRNPs 
are important in the processing of eukaryotic gene transcripts (hnRNA) into ma- 
ture messenger RNA for export from the nucleus to the cytoplasm (Figure 10.20). 
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The RNA World and Early Evolution 


Proteins are encoded by nucleotide sequences in DNA. DNA repli- 
cation depends on the activity of protein enzymes. These two state- 
ments form a “chicken and egg” paradox: Which came first in 
evolution—DNA or protein? Neither, it seems. The 1989 Nobel Prize 
in Chemistry was awarded to Thomas Cech and Sidney Altman for 
their discovery that RNA molecules are not only informational but 
also may be catalytic. This discovery gave evidence to earlier specu- 
lation by Carl Woese, Francis Crick, and Leslie Orgel that prebiotic 
evolution (that is, early evolution before cells arose) depended on 
self-replicating and catalytic RNAs, with proteins and DNA appear- 
ing later. Three basic assumptions about the prebiotic RNA world 
are (1) RNA replication maintained information-carrying RNAs, 
(2) Watson—Crick base pairing was essential to RNA replication, and 
(3) genetically encoded proteins were unavailable as catalysts. The 
challenge shifts to explaining the origin of nucleotides and their 
polymerization to form RNA. 

Adenine exists in outer space and is found in comets and me- 
teorites. A likely source is conversion of aminoimidazolecarboni- 
trile to adenine. (Aminoimidazolecarbonitrile is a tetramer of 


Glycolaldehyde can combine with other simple compounds to 
form ribose (and glucose). Glycolaldehyde has been detected in a 
gas cloud at the center of the Milky Way, our galaxy. 
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A Adapted from Glaser, R., et al., 2007. Adenine synthesis in interstellar space: Mechanisms of 
prebiotic pyrimidine-ring formation of monocyclic HCN-pentamers. Astrobiology 7:455-470. 


detected in intergalactic clouds.) Inorganic phosphate, the re- 
maining ingredient in nucleotides, is a common component in 
naturally occurring aqueous solutions. Its negative charge allows it 
to interact readily with positively charged mineral surfaces, upon 
which the first nucleotides may have spontaneously assembled. 
These tantalizing facts are bright spots along the dim thread that 
connects us to our distant past. The RNA world is an attractive 
hypothesis. 


Reference: Gesteland, R. F., Cech, T. R., Atkins, J. F., eds., 2006. The RNA 
World: The Nature of Modern RNA Suggests a Prebiotic RNA World, 3rd ed. Cold 
Spring Harbor, NY: Cold Spring Harbor Laboratory Press. 








Small RNAs Serve a Number of Roles, Including Gene Silencing A class of RNA 
molecules even smaller than tRNAs is the small RNAs, so-called because they are 
only 21 to 28 nucleotides long. (Some refer to this class as the noncoding RNAs [or 
ncRNAs]. Others refer to small RNAs as regulatory RNAs, because virtually every 
step along the pathway of gene expression can be regulated by one or another small 
RNA.) Small RNAs are involved in a number of novel biological functions. These 
small RNAs can target DNA or RNA through complementary base pairing. Base 
pairing of the small RNA with particular nucleotide sequences in the target is called 
direct readout. 

Small RNAs are classified into a number of subclasses on the basis of their 
function. RNA interference (RNAi) is mediated by one subclass, the small inter- 
fering RNAs (siRNAs). siRNAs disrupt gene expression by blocking specific 
protein production, even though the mRNA encoding the protein has been syn- 
thesized. The 21- to 23-nucleotide-long siRNAs act by base pairing with comple- 
mentary sequences within a particular mRNA to form regions of double-stranded 
RNA (dsRNA). These dsRNA regions are then specifically degraded, eliminating 
the mRNA from the cell (see Chapter 12). Thus, RNAi is a mechanism to silence 
the expression of specific genes, even after they have been transcribed, a phe- 
nomenon referred to as gene silencing. RNAi is also implicated in modifying the 
structure of chromatin and causing large-scale influences in gene expression. An- 
other subclass, the micro RNAs (miRNAs) control developmental timing by base 
pairing with and preventing the translation of certain mRNAs, thus blocking syn- 
thesis of specific proteins. Thus, miRNAs also act in gene silencing. However, un- 
like siRNAs, miRNAs (22 nucleotides long) do not cause mRNA degradation. A 
third subclass is the small nucleolar RNAs (snoRNAs). snoRNAs (60 to 300 nu- 
cleotides long) are catalysts that accomplish some of the chemical modifications 
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found in tRNA, rRNA, and even DNA (see Figure 10.23, for example). Small 
RNAs in bacteria (known by the acronym sRNAs) play an important role altering 
gene expression in response to stressful environmental situations. 


The Chemical Differences Between DNA and RNA Have 
Biological Significance 
Two fundamental chemical differences distinguish DNA from RNA: 


1. DNA contains 2-deoxyribose instead of ribose. 
2. DNA contains thymine instead of uracil. 


What are the consequences of these differences, and do they hold any significance 
in common? An argument can be made that, because of these differences, DNA is 
chemically more stable than RNA. The greater stability of DNA over RNA is consis- 
tent with the respective roles these macromolecules have assumed in heredity and 
information transfer. 

Consider first why DNA contains thymine instead of uracil. The key observation is 
that cylosine deaminates to form uracil at a finite rate in vivo (Figure 10.25). Because C 
in one DNA strand pairs with G in the other strand, whereas U would pair with A, 
conversion of a C to a U could potentially result in a heritable change of a C:G pair 
to a U:A pair. Such a change in nucleotide sequence would constitute a mutation in 
the DNA. To prevent this C deamination from leading to permanent changes in nu- 
cleotide sequence, a cellular repair mechanism “proofreads” DNA, and when a U 
arising from C deamination is encountered, it is treated as inappropriate and is re- 
placed by a C. If DNA normally contained U rather than T, this repair system could 
not readily distinguish U formed by C deamination from U correctly paired with A. 
However, the U in DNA is “5-methyl-U” or, as it is conventionally known, thymine 
(Figure 10.26). That is, the 5-methyl group on T labels it as if to say “this U belongs; 
do not replace it.” 

The other chemical difference between RNA and DNA is that the ribose 
2'-OH group on each nucleotide in RNA is absent in DNA. Consequently, the 
ubiquitous 3'-O of polynucleotide backbones lacks a vicinal hydroxyl neighbor in 
DNA. This difference leads to a greater resistance of DNA to alkaline hydrolysis, 
examined in detail in the following section. To view it another way, RNA is less 
stable than DNA because its vicinal 2'-OH group makes the 3’-phosphodiester 
bond susceptible to nucleophilic cleavage (Figure 10.27). For just this reason, it 
is selectively advantageous for the heritable form of genetic information to be 
DNA rather than RNA. 


10:6: Are Nucleic Acids Susceptible to Hydrolysis? 


Most reactions of nucleic acid hydrolysis break phosphodiester bonds in the polynu- 
cleotide backbone even though such bonds are among the most stable chemical 
bonds found in biological molecules. In the laboratory, hydrolysis of polynu- 
cleotides will generate smaller fragments that are easier to manipulate and study. 


RNA Is Susceptible to Hydrolysis by Base, But DNA Is Not 


RNA is relatively resistant to the effects of dilute acid, but gentle treatment of DNA 
with 1 mM HCI leads to hydrolysis of purine glycosidic bonds and the loss of 
purine bases from the DNA. The glycosidic bonds between pyrimidine bases and 
2'-deoxyribose are not affected, and in this case, the polynucleotide’s sugar- 
phosphate backbone remains intact. The purine-free polynucleotide product is 
called apurinic acid. 

DNA is not susceptible to alkaline hydrolysis. On the other hand, RNA is alkali 
labile and is readily hydrolyzed by hydroxide ions (Figure 10.27). DNA has no 
2'-OH; therefore, DNA is alkali stable. 
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FIGURE 10.25 Deamination of cytosine forms uracil. 





FIGURE 10.26 The 5-methyl group on thymine labels it 
as a special kind of uracil. 
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CENGAGENOW” ANIMATED FIGURE 10.27 Alkaline 
hydrolysis of RNA. The vertical lines represent ribose; the 
diagonals the phosphodiester linkages joining succes- 
sive nucleotides. Nucleophilic attack by OH- on the 

P atom leads to 5'-phosphoester cleavage and random 
hydrolysis of the cyclic 2’,3’-phosphodiester intermedi- 
ate to give a mixture of 2’- and 3’-nucleoside mono- 
phosphate products. See this figure animated at 
www.cengage.com/login. 
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A nucleophile such as OH” can abstract the H of the 2'-OH, 
generating 2'-O7 which attacks the 5*P of the phosphodiester bridge: 
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Complete hydrolysis of RNA by alkali yields a random 
mixture of 2'-NMPs and 3'-NMPs. 


The Enzymes That Hydrolyze Nucleic Acids Are Phosphodiesterases 


Enzymes that hydrolyze nucleic acids are called nucleases. Virtually all cells contain 
various nucleases that serve important housekeeping roles in the normal course of 
nucleic acid metabolism. Organs that provide digestive fluids, such as the pancreas, 
are rich in nucleases and secrete substantial amounts to hydrolyze ingested nucleic 
acids. Fungi and snake venom are often good sources of nucleases. As a class, nucle- 
ases are phosphodiesterases because they catalyze the cleavage of phosphodiester 
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Convention: The 3'-side of each phosphodiester 
is termed @; the 5'-side is termed ®. 


(a) Hydrolysis of the @bond yields 5'-PO,4 products: 
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(b) Hydrolysis of the®bond yields 3'-PO,4 products: 
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bonds by HO. Because each internal phosphate in a polynucleotide backbone is 
involved in two phosphoester linkages, cleavage can potentially occur on either side 
of the phosphorus (Figure 10.28). Convention labels the 3'-side as a and the 5’-side 
as b. Enzymes or reactions that hydrolyze nucleic acids are characterized as acting at 
either a or b. A second convention denotes whether the nucleic acid chain was 
cleaved at some internal location, endo, or whether a terminal nucleotide residue was 
hydrolytically removed, exo. Note that exo a cleavage occurs at the 3’-end of the poly- 
mer, whereas exo b cleavage involves attack at the 5'-terminus (Figure 10.28). 


Nucleases Differ in Their Specificity for Different Forms of Nucleic Acid 


Nucleases play an indispensable role in the cellular breakdown of nucleic acids and 
salvage of their constituent parts. Nucleases also participate in many other cellular 
functions, including (1) aspects of DNA metabolism, such as replication and repair; 
(2) aspects of RNA metabolism, such as splicing of the primary gene transcript, pro- 
cessing of mRNA, and RNAi; (3) rearrangements of genetic material, such as re- 
combination and transposition; (4) host defense mechanisms against foreign nucleic 
acid molecules; and (5) the immune response, through assembly of immunoglobu- 
lin genes (these topics are discussed in depth in Part IV). Some nucleases are not 
even proteins but instead are catalytic RNA molecules (see Chapter 13). 

Like most enzymes (see Chapter 13), nucleases exhibit selectivity or specificity 
for the nature of the substance on which they act. That is, some nucleases act 
only on DNA (DNases), whereas others are specific for RNA (the RNases). Still 
others are nonspecific and are referred to simply as nucleases. Nucleases may 
also show specificity for only single-stranded nucleic acids or may act only on 
double helices. Some display a decided preference for acting only at certain 
bases in a polynucleotide, or as we shall see for restriction endonucleases, act only at 
a particular nucleotide sequence four to eight nucleotides (or more) in length. 
To the molecular biologist, nucleases are the surgical tools for the dissection and 
manipulation of nucleic acids in the laboratory. 
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FIGURE 10.28 Cleavage in polynucleotide chains. 

(a) Cleavage on the a side leaves the phosphate 
attached to the 5’-position of the adjacent nucleotide, 
while (b) b-side hydrolysis yields 3’-phosphate products. 
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Restriction Enzymes Are Nucleases That Cleave Double-Stranded 
DNA Molecules 


Restriction endonucleases are enzymes, isolated chiefly from bacteria, that have the 
ability to cleave double-stranded DNA. The term restriction comes from the capacity 
of prokaryotes to defend against or “restrict” the possibility of takeover by foreign 
DNA that might gain entry into their cells. Prokaryotes degrade foreign DNA by us- 
ing their unique restriction enzymes to chop it into relatively large but noninfective 
fragments. Restriction enzymes are classified into three types: I, II, or III. Types I 
and III require ATP to hydrolyze DNA and can also catalyze chemical modification 
of DNA through addition of methyl groups to specific bases. Type I restriction en- 
donucleases cleave DNA randomly, whereas type III recognize specific nucleotide 
sequences within dsDNA and cut the DNA at or near these sites. 


Type II Restriction Endonucleases Are Useful for Manipulating 
DNA in the Lab 


Type II restriction enzymes have received widespread application in the cloning 
and sequencing of DNA molecules. Their hydrolytic activity is not ATP-dependent, 
and they do not modify DNA by methylation or other means. Most important, they 
cut DNA within or near particular nucleotide sequences that they specifically rec- 
ognize. These recognition sequences are typically four or six nucleotides in length 
and have a twofold axis of symmetry. For example, E. coli has a restriction enzyme, 
EcoRI, that recognizes the hexanucleotide sequence GAATTC: 


5'—N—N—N—N—G—A—A—T—T—C—N—N—N—N—3' 








3'—-N—N—N—N—C —T—T—A—A—G—N—N—N—N—' 
Note the twofold symmetry: the sequence read 5'—>3' is the same in both strands. 


When EcoRI encounters this sequence in dsDNA, it causes a staggered, double- 
stranded break by hydrolyzing each chain between the G and A residues: 





5'—N—N—N—N—G A—A—T—T—C—N—N—N-N—3' 
3_N-N-N-N-G—T-T-A-A_ Ġ—N—N—-N-N—v' 





Staggered cleavage results in fragments with protruding single-stranded 5'-ends: 








N-N-N-N—G 5' A—A—T—T—C—N-N-N-N—3' 








3' 


N—N—N—N—C —T—T—A—A 5’ G—N—N—N—N—5’ 


Because the protruding termini of EcoRI fragments have complementary base se- 
quences, they can form base pairs with one another. 


Deal NS ATA ECAN ENEN 








N—N—N—N—C—T—T—A—A G—N—N—N—N 
Therefore, DNA restriction fragments having such “sticky” ends can be joined to- 
gether to create new combinations of DNA sequence. If fragments derived from 
DNA molecules of different origin are combined, novel recombinant forms of DNA 
are created. 

EcoRI leaves staggered 5’-termini. Other restriction enzymes, such as PstI, 
which recognizes the sequence 5'-CTGCAG-3’ and cleaves between A and G, pro- 
duce cohesive staggered 3’-ends. Still others, such as Ball, act at the center of the 
twofold symmetry axis of their recognition site and generate blunt ends that are 
noncohesive. Ball recognizes 5'-TGGCCA-3' and cuts between G and C. 

Table 10.2 lists many of the commonly used restriction endonucleases and their 
recognition sites. Different restriction enzymes sometimes recognize and cleave 
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TABLE 10.2 


About 1000 restriction enzymes have been characterized. They are named by italicized 
three-letter codes; the first is a capital letter denoting the genus of the organism of 
origin, and the next two letters are an abbreviation of the particular species. Because 
prokaryotes often contain more than one restriction enzyme, the various representatives 
are assigned letter and number codes as they are identified. Thus, EcoRI is the initial 
restriction endonuclease isolated from Escherichia coli, strain R. With one exception 
(Neil), all known type II restriction endonucleases generate fragments with 5'-PO, and 











3'-OH ends. 
Common Recognition 
Enzyme Isoschizomers Sequence Compatible Cohesive Ends 
Alul AG|CT Blunt 
Apyl Atul, EcoRI CC|G(4)GG 
Asull TTĻCGAA Clal, Hpall, Taqi 
Aval G|PyCGPuG Sall, Xhol, Xmal 
Avril C|CTAGG 
Ball TGG/CCA Blunt 
BamHI G|GATCC Bell, BglII, Mbol, Sau3A, Xholl 
Bell T)GATCA BamHI, BglII, Mbol, Sau3A, Xholl 
BglII AĻGATCT BamHI, Bell, Mbol, Sau3A, Xholl 
BstEL G|GTNACC 
BstXI CCANNNNN|NTGG 
Clal AT|CGAT Accl, Acyl, Asull, Hpall, TaqI 
Ddel C/TNAG 
EcoRI GJAATTC 
EcoRII Atul, Apyl {CC(4) GG 
FnuDIil Thal CG|CG Blunt 
Hael (4) GG|CC(4) Blunt 
Haell PuGCGC|Py 
Hae GGJ/CC Blunt 
Hincll GTPy|PuAC Blunt 
Hindi AĻAGCTT 
Hpal GTTJAAC Blunt 
Apall CCGG Accl, Acyl, Asull, Clal, TaqI 
KpnI GGTAC|C 
Mbol Sau3A. {GATC BamHI, Bell, BglII, Xholl 
MspI CCGG 
MstI TGCĻGCA Blunt 
NotI GC|GGCCGC 
PstI CTGCA|G 
Sacl SstI GAGCT|C 
Sall GJTCGAC Aval, Xhol 
Sau3A {GATC BamHI, Bell, BglII, Mbol, Xholl 
Sfil GGCCNNNN|NGGCCG 
Smal Xmal CCC|GGG Blunt 
SphI GCATG|C 
SstI Sacl GAGCT|C 
Taq TĻCGA Accl, Acyl, Asull, Clal, Hpall 
Xbal T/CTAGA 
Xhol C\TCGAG Aval, Sall 
Xholl (è) ĻGATC (2) BamHI, Bell, BglII, Mbol, Sau3A 
Xmal Smal CĻCCGGG Aval 
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FIGURE 10.29 Restriction mapping of a DNA molecule as determined by an analysis of the electrophoretic pat- 
tern obtained for different restriction endonuclease digests. (Keep in mind that a dsDNA molecule has a 
unique nucleotide sequence and therefore a definite polarity; thus, fragments from one end are distinctly dif- 
ferent from fragments derived from the other end.) 


within identical target sequences. Such enzymes are called isoschizomers, meaning 
that they cut at the same site; for example, Mbol and Sau3A are isoschizomers. 


Restriction Fragment Size Assuming random distribution and equimolar pro- 
portions for the four nucleotides in DNA, a particular tetranucleotide sequence 
should occur every 4* nucleotides, or every 256 bases. Therefore, the fragments 
generated by a restriction enzyme that acts at a four-nucleotide sequence should 
average about 250 bp in length. “Six-cutters,” enzymes such as EcoRI or BamHI, 
will find their unique hexanucleotide sequences on the average once in every 
4096 (4°) bp of length. Because the genetic code is a triplet code with three suc- 
cessive bases in a DNA strand specifying one amino acid in a polypeptide sequence, 
and because polypeptides typically contain at most 1000 amino acid residues, the 
fragments generated by six-cutters are approximately the size of prokaryotic genes. 
This property makes these enzymes useful in the construction and cloning of ge- 


netically useful recombinant DNA molecules. For the isolation of even larger nu- 
cleotide sequences, such as those of genes encoding large polypeptides (or those 
of eukaryotic genes that are disrupted by large introns), partial or limited digestion 
of DNA by restriction enzymes can be employed. However, restriction endonucle- 
ases that cut only at specific nucleotide sequences 8 or even 13 nucleotides in 
length are also available, such as NotI and Sfil. 


Restriction Endonucleases Can Be Used to Map the Structure 
of a DNA Fragment 


The application of these sequence-specific nucleases to problems in molecular bi- 
ology is considered in detail in Chapter 12, but one prominent application is de- 
scribed here. Because restriction endonucleases cut dsDNA at unique sites to gen- 
erate large fragments, they provide a means for mapping DNA molecules that are 
many kilobase pairs in length. Restriction digestion of a DNA molecule is in many 
ways analogous to proteolytic digestion of a protein by an enzyme such as trypsin 
(see Chapter 5): The restriction endonuclease acts only at its specific sites so that 
a discrete set of nucleic acid fragments is generated. This action is analogous to 
trypsin cleavage only at Arg and Lys residues to yield a particular set of tryptic pep- 
tides from a given protein. The restriction fragments represent a unique collection 
of different-sized DNA pieces. Fortunately, this complex mixture can be resolved 
by electrophoresis (see the Appendix to Chapter 5). Electrophoresis of DNA mole- 
cules on gels of restricted pore size (as formed in agarose or polyacrylamide me- 
dia) separates them according to size, the largest being retarded in their migration 
through the gel pores while the smallest move relatively unhindered. Figure 10.29 
shows a hypothetical electrophoretogram obtained for a DNA molecule treated 
with two different restriction nucleases, alone and in combination. Just as cleavage 
of a protein with different proteases to generate overlapping fragments allows an 
ordering of the peptides, restriction fragments can be ordered or “mapped” ac- 
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cording to their sizes, as deduced from the patterns depicted in Figure 10.29. 


SUMMARY 


Nucleotides and nucleic acids possess heterocyclic nitrogenous bases 
as principal components of their structure. Nucleotides participate as 
essential intermediates in virtually all aspects of cellular metabolism. 
Nucleic acids are the substances of heredity (DNA) and the agents of 
genetic information transfer (RNA). 


10.1 What Are the Structure and Chemistry of Nitrogenous Bases? 
The bases of nucleotides and nucleic acids are derivatives of either 
pyrimidine (cytosine, uracil, and thymine) or purine (adenine and gua- 
nine). The aromaticity of the pyrimidine and purine ring systems and 
the electron-rich nature of their —OH and ring nitrogen substituents 
allow them to undergo keto—-enol tautomeric shifts and endow them 
with the capacity to absorb UV light. 


10.2 What Are Nucleosides? Nucleosides are formed when a base is 
linked to a sugar. The usual sugars of nucleosides are pentoses; ribo- 
nucleosides contain the pentose D-ribose, whereas 2-deoxy-D-ribose is 
found in deoxyribonucleosides. Nucleosides are more water soluble 
than free bases. 


10.3 What Are the Structure and Chemistry of Nucleotides? A nu- 
cleotide results when phosphoric acid is esterified to a sugar —OH 
group of a nucleoside. Successive phosphate groups can be linked to 
the phosphoryl group of a nucleotide through phosphoric anhydride 
linkages. Nucleoside 5'-triphosphates, as carriers of chemical energy, 
are indispensable agents in metabolism because phosphoric anhydride 
bonds are a prime source of chemical energy to do biological work. Vir- 
tually all of the biochemical reactions of nucleotides involve either phos- 
phate or pyrophosphate group transfer. The bases of nucleotides serve as “in- 
formation symbols.” 


10.4 What Are Nucleic Acids? Nucleic acids are polynucleotides: linear 
polymers of nucleotides linked 3’ to 5’ by phosphodiester bridges. The 
only significant variation in the chemical structure of nucleic acids is the 
particular base at each nucleotide position. These bases are not part of 
the sugar—phosphate backbone but instead serve as distinctive side chains. 


10.5 What Are the Different Classes of Nucleic Acids? The two major 
classes of nucleic acids are DNA and RNA. Two fundamental chemical 
differences distinguish DNA from RNA: The nucleotides in DNA con- 
tain 2-deoxyribose instead of ribose as their sugar component, and DNA 
contains the base thymine instead of uracil. These differences confer 
important biological properties on DNA. 

DNA consists of two antiparallel polynucleotide strands wound to- 
gether to form a long, slender, double helix. The strands are held 
together through specific base pairing of A with T and C with G. The 
information in DNA is encoded in digital form in terms of the se- 
quence of bases along each strand. Because base pairing is specific, 
the information in the two strands is complementary. DNA molecules 
may contain tens or even hundreds of millions of base pairs. In eukary- 
otic cells, DNA is complexed with histone proteins to form a nucleo- 
protein complex known as chromatin. 

RNA occurs in multiple forms in cells, almost all of which are 
single-stranded. Nevertheless, the presence of complementary nu- 
cleotide sequences within the strand gives rise to multiple double- 
stranded regions in RNA molecules. Messenger RNA (mRNA) mole- 
cules are direct copies of the base sequences of protein-coding genes. 
Ribosomal RNA (rRNA) molecules provide the structural and func- 
tional foundations for ribosomes, the agents for translating mRNAs 
into proteins. In protein synthesis, the amino acids are delivered to 
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the ribosomes in the form of aminoacyl-tRNA (transfer RNA) deriva- 
tives. Small nuclear RNAs (snRNAs) are characteristic of eukaryotic 
cells and are necessary for processing the RNA transcripts of protein- 
coding genes into mature mRNA molecules. Small RNAs are a recently 
discovered class of regulatory RNA molecules. A prominent role of 
small RNAs is gene silencing, particularly in the phenomenon of RNA 
interference (RNAi). 


10.6 Are Nucleic Acids Susceptible to Hydrolysis? Like all biological 
polymers, nucleic acids are susceptible to hydrolysis, particularly hydrol- 


ysis of the phosphoester bonds in the polynucleotide backbone. RNA is 
susceptible to hydrolysis by base: DNA is not. Nucleases are hydrolytic en- 
zymes that cleave the phosphoester linkages in the sugar—-phosphate 
backbone of nucleic acids. Nucleases abound in nature, with varying 
specificity for RNA or DNA, single- or double-stranded nucleic acids, 
endo versus exo action, and 3’- versus 5’-cleavage of phosphodiesters. Re- 
striction endonucleases of the type II class are sequence-specific endo- 
nucleases useful in mapping the structure of DNA molecules. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 


1. Draw the principal ionic species of 5’-GMP occurring at pH 2. 
2. Draw the chemical structure of pACG. 


3. Chargaff’s results (Table 10.1) yielded a molar ratio of 1.29 for A to 
G in ox DNA, 1.43 for T to C, 1.04 for A to T, and 1.00 for G to C. 
Given these values, what are the approximate mole fractions of A, 
C, G, and T in ox DNA? 


4. Results on the human genome published in Science (Science 291: 
1304-1350 [2001]) indicate that the haploid human genome con- 
sists of 2.91 gigabase pairs (2.91 10° base pairs) and that 27% of 
the bases in human DNA are A. Calculate the number of A, T, G, 
and C residues in a typical human cell. 


5. Adhering to the convention of writing nucleotide sequences in the 
5'—3' direction, what is the nucleotide sequence of the DNA strand 
that is complementary to ddATGGCAACTGTCACTA? 


6. Messenger RNAs are synthesized by RNA polymerases that read 
along a DNA template strand in the 3’5’ direction, polymerizing 
ribonucleotides in the 5'—>3' direction (see Figure 10.20). Give 
the nucleotide sequence (5'—>3') of the DNA template strand 
from which the following mRNA segment was transcribed: 
5’-UAGUGACAGUUGCGAU-3’. 

7. The DNA strand that is complementary to the template strand 
copied by RNA polymerase during transcription has a nucleotide se- 
quence identical to that of the RNA being synthesized (except 
T residues are found in the DNA strand at sites where U residues oc- 
cur in the RNA). An RNA transcribed from this nontemplate DNA 
strand would be complementary to the mRNA synthesized by RNA 
polymerase. Such an RNA is called antisense RNA because its base 
sequence is complementary to the “sense” mRNA. One strategy to 
thwart the deleterious effects of genes activated in disease states 
(such as cancer) is to generate antisense RNAs in affected cells. 
These antisense RNAs would form double-stranded hybrids with 
mRNAs transcribed from the activated genes and prevent their 
translation into protein. Suppose transcription of a cancer-activated 
gene yielded an mRNA whose sequence included the segment 
5'-UACGGUCUAAGCUGA. What is the corresponding nucleotide 
sequence (5'—>3') of the template strand in a DNA duplex that 
might be introduced into these cells so that an antisense RNA could 
be transcribed from it? 

8. A 10-kb DNA fragment digested with restriction endonuclease EcoRI 
yielded fragments 4 kb and 6 kb in size. When digested with BamHI, 
fragments 1, 3.5, and 5.5 kb were generated. Concomitant digestion 
with both EcoRI and BamHI yielded fragments 0.5, 1, 3, and 5.5 kb 
in size. Give a possible restriction map for the original fragment. 

9. Based on the information in Table 10.2, describe two different 20-base 
nucleotide sequences that have restriction sites for BamH1, PstI, Sall, 
and Smal. Give the sequences of the Smal cleavage products of each. 

10. (Integrates with Chapter 3.) The synthesis of RNA can be summa- 
rized by the reaction: 


n NTP—> (NMP), + n PP; 


What is the A G?’ overa for synthesis of an RNA molecule 100 nucleo- 
tides in length, assuming that the A G°’ for transfer of an NMP from 
an NTP to the 3'-O of polynucleotide chain is the same as the A G°” 
for transfer of an NMP from an NTP to HO? (Use data given in 
Table 3.3.) 


11. Gene expression is controlled through the interaction of proteins 
with specific nucleotide sequences in double-stranded DNA. 

a. List the kinds of noncovalent interactions that might take place 
between a protein and DNA. 

b. How do you suppose a particular protein might specifically inter- 
act with a particular nucleotide sequence in DNA? That is, how 
might proteins recognize specific base sequences within the dou- 
ble helix? 


12. Restriction endonucleases also recognize specific base sequences 
and then act to cleave the double-stranded DNA at a defined site. 
Speculate on the mechanisms by which this sequence recognition 
and cleavage reaction might occur by listing a set of requirements 
for the process to take place. 


13. A carbohydrate group is an integral part of a nucleoside. 

a. What advantage does the carbohydrate provide? 
Polynucleotides are formed through formation of a sugar- 
phosphate backbone. 

b. Why might ribose be preferable for this backbone instead of 
glucose? 

c. Why might 2-deoxyribose be preferable to ribose in some situa- 
tions? 

14. Phosphate groups are also integral parts of nucleotides, with the 
second and third phosphates of a nucleotide linked through phos- 
phoric anhydride bonds, an important distinction in terms of the 
metabolic role of nucleotides. 

a. What property does a phosphate group have that a nucleoside 
lacks? 

b. How are phosphoric anhydride bonds useful in metabolism? 

c. How are phosphate anhydride bonds an advantage to the ener- 
getics of polynucleotide synthesis? 


15. The RNAs acting in RNAi are about 21 nucleotides long. To judge 
whether it is possible to uniquely target a particular gene with a 
RNA of this size, consider the following calculation: What is the ex- 
pected frequency of occurrence of a specific 21-nt sequence? 


16. The haploid human genome consists of 3 X 10° base pairs. Using 
the logic in problem 15, one can calculate the minimum length 
of a unique DNA sequence expected to occur by chance just once 
in the human genome. That is, what is the length of an oligonu- 
cleotide whose expected frequency of occurrence is once every 
3 X 10° bp? 

17. Snake venom phosphodiesterase is an a@specific exonuclease (Fig- 
ure 10.28) that acts equally well on single-stranded RNA or DNA. 
Design a protocol based on snake venom phosphodiesterase that 
would allow you to determine the base sequence of an oligonu- 
cleotide. Hint: Adapt the strategy for protein sequencing by Edman 
degradation, as described on pages 80 and 102. 


18. From the answer to problem 4 and the molecular weights of (AMP 
(331 D), dCMP (307 D), dGMP (347 D), and dTMP (322 D), cal- 
culate the mass (in daltons) of the DNA in a typical human cell. 


Preparing for the MCAT Exam 

19. The bases of nucleotides and polynucleotides are “information sym- 
bols.” Their central role in providing information content to DNA 
and RNA is clear. What advantages might bases as “information 
symbols” bring to the roles of nucleotides in metabolism? 
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20. Structural complementarity is the key to molecular recognition, a 
lesson learned in Chapter 1. The principle of structural comple- 
mentarity is relevant to answering problems 5, 6, 7, 11, 12, and 19. 
The quintessential example of structural complementarity in all of 
biology is the DNA double helix. What features of the DNA double 
helix exemplify structural complementarity? 
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Structure of Nucleic Acids 


ESSENTIAL QUESTION 


The nucleotide sequence—the primary structure—of DNA not only determines its 
higher-order structure but it is also the physical representation of genetic informa- 
tion in organisms. RNA sequences, as copies of specific DNA segments, direct both 
the higher-order structure and the function of RNA molecules in information trans- 
fer processes. 


What are the higher-order structures of DNA and RNA, and what methodolo- 
gies have allowed scientists to probe these structures and the functions that 
derive from them? 


Chapter 10 presented the structure and chemistry of nucleotides and how these units 
are joined via phosphodiester bonds to form nucleic acids, the biological polymers for 
information storage and transmission. In this chapter, we investigate biochemical 
methods that reveal this information by determining the sequential order of nucleo- 
tides in a polynucleotide, the so-called primary structure of nucleic acids. Then, we 
consider the higher orders of structure in the nucleic acids: the secondary and tertiary 
levels. Although the focus here is primarily on the structural and chemical properties 
of these macromolecules, it is fruitful to keep in mind the biological roles of these re- 
markable substances. The sequence of nucleotides in nucleic acids is the embodiment 
of genetic information (see Part IV). We can anticipate that the cellular mechanisms 
for accessing this information, as well as reproducing it with high fidelity, will be illu- 
minated by knowledge of the chemical and structural qualities of these polymers. 


dele How Do Scientists Determine the Primary Structure 
of Nucleic Acids? 


Determining the primary structure of nucleic acids (the nucleotide sequence) 
would seem to be a more formidable problem than amino acid sequencing of pro- 
teins, simply because nucleic acids contain only 4 unique monomeric units (A, C, 
G, and T) whereas proteins have 20. With only four, there are apparently fewer spe- 
cific sites for selective cleavage, distinctive sequences are more difficult to recog- 
nize, and the likelihood of ambiguity is greater. The much greater number of 
monomeric units in most polynucleotides as compared to polypeptides is a further 
difficulty. However, two simple tools make nucleic acid sequencing substantially eas- 
ier than polypeptide sequencing. One of these tools is the set of type II restriction en- 
donucleases that cleave DNA at specific oligonucleotide sites, generating unique frag- 
ments of manageable size (see Chapter 10). The second is gel electrophoresis, a 
method capable of separating nucleic acid fragments that differ from one another 
in length by just a single nucleotide. 


The Nucleotide Sequence of DNA Can Be Determined 
from the Electrophoretic Migration of a Defined Set 
of Polynucleotide Fragments 


The most widely used protocol for nucleic acid sequencing is the chain termination 
or dideoxy method of Frederick Sanger, which relies on enzymatic replication of 
the DNA to be sequenced. Very sensitive analytical techniques that can detect the 
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newly synthesized DNA chains following electrophoretic separation are available, so 
Sanger sequencing can be carried out on as little as 1 attomole (amol, 107 '8mol) of 
DNA contained in less than 0.1 uL volume. (107!8 moles of DNA are roughly equiv- 
alent to 107 grams (pg) of I-kb sized DNA molecules.) These analytical techniques 
typically rely on fluorescent detection of the DNA products. 


Sanger’s Chain Termination or Dideoxy Method Uses DNA Replication 
To Generate a Defined Set of Polynucleotide Fragments 


To appreciate the rationale of the chain termination or dideoxy method, we first 
must briefly examine the biochemistry of DNA replication. DNA is a double helical 
molecule. In the course of its replication, the sequence of nucleotides in one strand 
is copied in a complementary fashion to form a new second strand by the enzyme 
DNA polymerase. Each original strand of the double helix serves as a template for 
the biosynthesis that yields two daughter DNA duplexes from the parental double 
helix (Figure 11.1). DNA polymerase carries out this reaction in vitro in the pres- 
ence of the four deoxynucleotide monomers and copies single-stranded DNA, pro- 
vided a double-stranded region of DNA is artificially generated by adding a primer. 
This primer is merely an oligonucleotide capable of forming a short stretch of 
dsDNA by base pairing with the ssDNA (Figure 11.2). The primer must have a free 
3'-OH end from which the new polynucleotide chain can grow as the first residue 
is added in the initial step of the polymerization process. DNA polymerases synthe- 
size new strands by adding successive nucleotides in the 5’->3’ direction. 


The Chain Termination Protocol In the chain termination method of DNA se- 
quencing, a DNA fragment of unknown sequence serves as a template in a polymer- 
ization reaction using some type of DNA polymerase, usually a genetically engi- 
neered version that lacks all traces of exonuclease activity that might otherwise 
degrade the DNA. (DNA polymerases usually have an intrinsic exonuclease activity 
that allows proofreading and correction of the DNA strand being synthesized; see 
Chapter 28.) The primer requirement is met by an appropriate oligonucleotide (this 
method is also known as the primed synthesis method for this reason). The reaction 
is run in the presence of all four deoxynucleoside triphosphates dATP, dGTP, dCTP, 
and dTTP, which are the substrates for DNA polymerase (Figure 11.3). In addition, 
the reaction mixture contains the four corresponding 2',3'-dideoxynucleotides 
(ddATP, ddGTP, ddCTP, and ddTTP); it is these dideoxynucleotides that give the 
method its name. 

Because dideoxynucleotides lack 3’-OH groups, they cannot serve as acceptors 
for 5’nucleotide addition in the polymerization reaction; thus, the chain is termi- 
nated where they become incorporated. The concentrations of the deoxynu- 
cleotides in each reaction mixture are significantly greater than the concentrations 
of the dideoxynucleotides, so incorporation of a dideoxynucleotide is infrequent. 
Therefore, base-specific premature chain termination is only a random, occasional 
event, and a population of new strands of varying length is synthesized. Neverthe- 
less, termination, although random, occurs everywhere in the sequence. Thus, the 
population of newly synthesized DNAs forms a nested set of molecules that differ in 
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FIGURE 11.1 DNA replication yields two daughter DNA 
duplexes identical to the parental DNA molecule. 


CENGAGENOW™ ACTIVE FIGURE 11.2 Primed synthesis 
of a DNA template by DNA polymerase, using the four 
deoxynucleoside triphosphates as the substrates. 

Test yourself on the concepts in this figure at 
www.cengage.com/login. 
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CENGAGENOW” ANIMATED FIGURE 11.3 The chain 
termination or dideoxy method of DNA sequencing. 

A template DNA (the single-stranded DNA to be 
sequenced) with a complementary primer annealed at 
its 3’-end is copied by DNA polymerase in the presence 
of the four deoxynucleotide substrates (dATP, dCTP, 
dGTP, dTTP) and small amounts of the four dideoxynu- 
cleotide analogs of these substrates, each of which car- 
ries a distinctive fluorescence tag (illustrated here as 
orange for ddATP, blue for ddCTP, green for ddGTP, and 
red for ddTTP). Occasional incorporation of a dideoxy- 
nucleotide terminates further synthesis of that comple- 
mentary strand. The nested set of terminated strands 
can be separated by capillary electrophoresis and iden- 
tified by laser fluorescence spectroscopy. Test yourself 
on the concepts in this figure at www.cengage.com/ 
login. 
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length by just one nucleotide. Each newly synthesized strand has a dideoxynu- 
cleotide at its 3’-end, and each of the four dideoxynucleotides used in Sanger se- 
quencing is distinctive because each bears a fluorescent tag of a different color. 
(These fluorescent tags are attached to the 5-position of pyrimidine dideoxynu- 
cleotides or the 7-position of purine dideoxynucleotides, where these tags do not 
impair the ability of DNA polymerase to add them to a growing polynucleotide 
chain.) The color of a particular fluorescence (as in orange for ddA, blue for ddC, 
green for ddG, and red for ddT) reveals which base was specified by the template 
and incorporated by DNA polymerase at that spot. 


Reading Dideoxy Sequencing Gels The sequencing products are visualized by 
fluorescence spectroscopy following their separation according to size by capillary 
electrophoresis (Figure 11.3). Because the smallest fragments migrate fastest upon 
electrophoresis and because fragments differing by only a single nucleotide in 
length are readily resolved, the sequence of nucleotides in the set of newly synthe- 
sized DNA fragments is given by the order of the fluorescent colors emerging from 
the capillary. Thus, the gel in Figure 11.3 is read TTGTCGAAGTCAG (5’-3’). Be- 
cause of the way DNA polymerase acts, this observed sequence is complementary to 
the corresponding unknown template sequence. Knowing this, the template se- 
quence now can be written CTGACTTCGACAA (5'-33’). 

Sanger sequencing has been fully automated. Automation is achieved through the 
use of robotics for preparing the samples, running the DNA sequencing reactions, 
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The enormous significance of DNA sequence information to fun- 
damental questions in biology, medicine, and personal health is a 
compelling force for the development of more rapid and efficient 
DNA sequencing technologies, so-called next-generation sequenc- 
ing, or NGS, methods. One important NGS advance is 454 Tech- 
nology, a methodology developed by 454 Life Sciences, a division 
of Roche Company. Like Sanger sequencing, 454 Technology re- 
lies on DNA polymerase-catalyzed copying of a primed single- 
stranded DNA. (However, because 454 Technology does not rely 
on chain termination or creation of a nested set of DNA frag- 
ments, dideoxynucleotide terminators are not needed.) Multiple 
copies of unique single-stranded template DNA molecules paired 
with primer strands are immobilized on microscopic beads that 
can be loaded into micro-microtiter wells at a scale of 1.6 million 
different wells on a 6 cm X 6 cm platform (see accompanying fig- 
ure). Each well receives a unique DNA template. The reagents for 
primed synthesis are passed over the platform in sequential order: 
First, a reaction mixture with DNA polymerase plus dTTP (but no 
other dNTPs), a wash, then a reaction mixture with enzymes but 
only dATP, a wash, then the dGTP-specific mixture, a wash, and fi- 
nally the dCTP mixture and a wash. Such cycles are repeated up to 
100 times over an 8-hour period. Up to 500 cycles are possible in 
one run. A fiber-optic array to monitor light emission from each 
well is aligned with the platform. 

The methodology is based on detection of DNA polymerase ac- 
tion through light emission. To do this, the technology exploits an 
overlooked product of the polymerase reaction, namely, the pyro- 
phosphate released each time a dNTP contributes the correct 
complementary dNMP in the polymerase reaction. Pyrophosphate 
release is coupled to light emission through two reactions. The 
first is catalyzed by ATP sulfurylase, which uses PP; plus adenosine- 
5’-phosphosulfate (APS) to form ATP. The second reaction, cat- 
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High-Throughput DNA Sequencing by the Light of Fireflies 


alyzed by the ATP-dependent firefly enzyme luciferase, oxidizes 
luciferin to form oxyluciferin with the emission of light. 


Reaction 1: 


PP; + APS — ATP + SO,2- 


(This is the reverse of the ATP sulfurylase reaction shown as reac- 
tion 1 in Figure 25.34.) 


Reaction 2: 
ATP + luciferin + O > AMP + PP; + CO, + oxyluciferin + light 
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Light detection confirms that addition of a dNMP by primed syn- 
thesis has occurred. Using computer recording of light emission to 
keep track of when in each cycle each well emitted a pulse of light al- 
lows reconstruction of sequence information for each of 1.6 million 
templates. Using this methodology, the 580,069-nucleotide sequence 
of Mycoplasma genitalium was confirmed in one run on the 454 
Genome Sequencer. (From Margulies, M., et al., 2005. Genome sequenc- 
ing in microfabricated high-density picolitre reactors. Nature 437:376-380.) 
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loading the chain-terminated DNA fragments onto capillary electrophoresis tubes, 
performing the electrophoresis, and imaging the results for computer analysis. These 
advances have made it feasible to sequence the entire genomes of organisms (see 
Chapter 12). Celera Genomics, the private enterprise that reported a sequence for the 
2.91 billion-bp human genome in 2001, used 300 automated DNA sequencers/ 
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(a) The six degrees of freedom in the 
sugar-PO, backbone: 


Rotation about bonds 1, 2, 3, 4, 5, and 6 
correspond to 6 degrees of freedom designated 
a, B, y, 5, £, and ¢ as indicated. 


(b) Four puckered conformations of 
furanose rings: 





FIGURE 11.4 (a) The six degrees of freedom in the 
deoxyribose—PO, units of the polynucleotide chain. 


(b) Four puckered conformations of the furanose rings. 


(c) Free rotation about the C1’—N glycosidic bond. 


analyzers to sequence more than 1 billion bases every month. Today, the more tedious 
aspect of DNA sequencing is the isolation and preparation of DNA fragments of in- 
terest, such as cloned genes; automated sequencing makes the rest routine. 


Te2s What Sorts of Secondary Structures Can 
Double-Stranded DNA Molecules Adopt? 


Conformational Variation in Polynucleotide Strands 


Polynucleotide strands are inherently flexible. Each deoxyribose—phosphate segment 
of the backbone has six degrees of freedom (Figure 11.4a) as a consequence of the 
six successive single bonds per segment along the chain. Furanose rings of pentoses 
are not planar but instead adopt puckered conformations, four of which are shown 
in Figure 11.4b, A seventh degree of freedom per nucleotide unit arises because of 
free rotation about the C1'-N glycosidic bond. This freedom allows the plane of the 
base to rotate relative to the path of the polynucleotide strand (Figure 11.4c). 


DNA Usually Occurs in the Form of Double-Stranded Molecules 


Double-stranded DNA molecules adopt one of three secondary structures, termed A, 
B, and Z. In a moment, we will address the “ABZs of DNA secondary structure’; first 
we must consider some general features of DNA double helices. Fundamentally, 
double-stranded DNA is a regular two-chain structure with hydrogen bonds formed 


(c) Free rotation about C1'-N 
glycosidic bond (7th degree of freedom): 
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between opposing bases on the two chains (see Chapter 10). Such H bonding is pos- 
sible only when the two chains are antiparallel. The polar sugar-phosphate backbones 
of the two chains are on the outside. The bases are stacked on the inside of the struc- 
ture; these heterocyclic bases, as a consequence of their 7-electron clouds, are hydro- 
phobic on their flat sides. One purely hypothetical conformational possibility for a two- 
stranded arrangement would be a ladderlike structure (Figure 11.5) in which the base 
pairs are fixed at 0.6 nm apart because this is the distance between adjacent sugars 
along a polynucleotide strand. Because HyO molecules could fit into the spaces be- 
tween the hydrophobic surfaces of the bases, this conformation is energetically unfa- 
vorable. This ladderlike structure converts to a double helix when given a simple right- 
handed twist. Helical twisting brings the base-pair rungs of the ladder closer together, 
stacking them 0.34 nm apart, without affecting the sugar—sugar distance of 0.6 nm. Be- 
cause this helix repeats itself approximately every 10 bp, its pitch is 3.4 nm. This is the 
major conformation of DNA in solution, and it is called B-DNA. 


Watson-Crick Base Pairs Have Virtually Identical Dimensions 


As indicated in Chapter 10, the base pairing in DNA is size complementary: Large 
bases (purines) pair with small bases (pyrimidines). Hydrogen bond formation be- 
tween purines and pyrimidines dictates that the purine adenine pairs with the 
pyrimidine thymine; the purine guanine pairs with the pyrimidine cytosine. Size 
complementarity means that the A:T pair and G:C pair have virtually identical di- 
mensions (Figure 11.6). Watson and Crick realized that units of such structural 
equivalence could serve as spatially invariant substructures to build a polymer whose 
exterior dimensions would be uniform along its length, regardless of the sequence 
of bases. That is, the pairing of smaller pyrimidines with larger purines everywhere 
across the double-stranded molecule allows the two polynucleotide strands to as- 
sume essentially identical helical conformations. 


The DNA Double Helix Is a Stable Structure 


Several factors account for the stability of the double helical structure of DNA. 


H Bonds Although it has long been emphasized that the two strands of DNA are 
held together by H bonds formed between the complementary purines and pyrim- 
idines, two in an A:T pair and three in a G:C pair (Figure 11.6), the H bonds be- 
tween base pairs impart little net stability to the double-stranded structure compared 
to the separated strands in solution. When the two strands of the double helix are 
separated, the H bonds between base pairs are replaced by H bonds between indi- 
vidual bases and surrounding water molecules. Polar atoms in the sugar—phosphate 
backbone do form external H bonds with surrounding water molecules, but these 
form with separated strands as well. 


Electrostatic Interactions A prominent feature of the backbone of a DNA 
strand is the repeating array of negatively charged phosphate groups. These ar- 
rays of negative charge along the strands repel each other so that their 
sugar—phosphate backbones are kept apart and the two strands come together 
through Watson—Crick base pairing. As a consequence, the negative charges are 
situated on the exterior surface of the double helix, such that repulsive effects are 
minimized. Further these charges become electrostatically shielded from one an- 
other because divalent cations, particularly Mg?*, bind strongly to the anionic 
phosphates. 


Van der Waals and Hydrophobic Interactions The core of the helix consists of 
the base pairs, and these base pairs stack together through 7, a-electronic interac- 
tions (a form of van der Waals interaction), and hydrophobic forces. These base- 
pair stacking interactions range from —16 to —51 kJ/mol (expressed as the energy 
of interaction between adjacent base pairs), contributing significantly to the overall 
stabilizing energy. 
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FIGURE 11.5 (a) Double-stranded DNA as an imaginary 
ladderlike structure. (b) A simple right-handed twist 
converts the ladder to a helix. 
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FIGURE 11.6 Watson—Crick A:T and G:C base pairs. All 
H bonds in both base pairs are straight. 


A stereochemical consequence of the way A:T and G:C base pairs form is that the 
sugars of the respective nucleotides have opposite orientations. This is why the 
sugar—phosphate backbones of the two chains run in opposite or “antiparallel” direc- 
tions. Furthermore, the two glycosidic bonds holding the bases in each base pair are 
not directly across the helix from each other, defining a common diameter (Figure 
11.7). Consequently, the sugar—-phosphate backbones of the helix are not equally 
spaced along the helix axis and the grooves between them are not the same size. In- 
stead, the intertwined chains create a major groove and a minor groove (Figure 11.7). 


B-DNA Top view 


Radius of 
sugar—phosphate 
backbone 





FIGURE 11.7 The major and minor grooves of B-DNA. 
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The edges of the base pairs have a specific relationship to these grooves. The “top” 
edges of the base pairs (“top” as defined by placing the glycosidic bond at the bottom, 
as in Figure 11.7) are exposed along the interior surface or “floor” of the major 
groove; the base-pair edges nearest to the glycosidic bond form the interior surface of 
the minor groove. Some proteins that bind to DNA can actually recognize specific nu- 
cleotide sequences by “reading” the pattern of H-bonding possibilities presented by 
the edges of the bases in these grooves. Such DNA-protein interactions provide one 
step toward understanding how cells regulate the expression of genetic information 
encoded in DNA (see Chapter 29). 


Double Helical Structures Can Adopt a Number of Stable Conformations 


In solution, DNA ordinarily assumes the familar structure we have been discussing: 
B-DNA. However, nucleic acids also occur naturally in other double helical forms. 
The base-pairing arrangement remains the same, but the inherently flexible sugar— 
phosphate backbone can adopt different conformations. Base-pair rotations are an- 
other kind of conformational variation. Helical twist is the rotation (around the axis 
of the double helix) of one base pair relative to the next (Figure 11.8a). Successive 
base pairs in B-DNA show a mean rotation of 36° with respect to each other. Pro- 
pellor twist involves rotation around a different axis, namely, an axis perpendicular 
to the helix axis (Figure 11.8b). Propellor twist allows greater overlap between suc- 
cessive bases along a strand of DNA and diminishes the area of contact between 
bases and solvent water. 


A-Form DNA Is an Alternative Form of Right-Handed DNA 


An alternative form of the right-handed double helix is A-DNA. A-DNA molecules dif- 
fer from B-DNA molecules in a number of ways. The pitch, or distance required to 
complete one helical turn, is different. In B-DNA, it is 3.4 nm, whereas in 
A-DNA it is 2.46 nm. One turn in A-DNA requires 11 bp to complete. Depending on 
local sequence, 10 to 10.6 bp define one helical turn in B-form DNA. In A-DNA, the 
base pairs are no longer nearly perpendicular to the helix axis but instead are tilted 
19° with respect to this axis. Successive base pairs occur every 0.23 nm along the axis, 
as opposed to 0.332 nm in B-DNA. The B-form of DNA is thus longer and thinner than 
the short, squat A-form, which has its base pairs displaced around, rather than cen- 
tered on, the helix axis. Figure 11.9 and Table 11.1 show the relevant structural char- 
acteristics of the A- and B-forms of DNA. (Z-DNA, another form of DNA to be dis- 
cussed shortly, is also depicted in Figure 11.9 and Table 11.1.) A comparison of the 
structural properties of A-, B-, and Z-DNA is summarized in Table 11.1. 

Relatively dehydrated DNA fibers can adopt the A-conformation, and DNA may 
be in the A-form in dehydrated structures, such as bacterial and fungal spores. The 
pentose conformation in A-DNA is 3'-endo, as opposed to 2'-endo in B-DNA. Dou- 
ble helical DNA:RNA hybrids have an A-like conformation. The 2'’-OH in RNA 
sterically prevents double helical regions of RNA chains from adopting the 
B-form helical arrangement. Importantly, double-stranded regions in RNA chains 
often assume an A-like conformation, with their bases strongly tilted with respect to 
the helix axis. 


Z-DNA Is a Conformational Variation in the Form of a Left-Handed 
Double Helix 


Z-DNA was first discovered when X-ray analysis of crystals of the synthetic deoxynu- 
cleotide d(CpGpCpGpCpG revealed an antiparallel double helix of unexpected con- 
formation. The alternating pyrimidine—purine (Py—Pu) sequence of this oligonu- 
cleotide is the key to its unusual properties. The N-glycosyl bonds of G residues in 
this alternating copolymer are rotated 180° with respect to their conformation in 
B-DNA, so now the purine ring is in the syn rather than the anti conformation (Fig- 
ure 11.10). The C residues remain in the anti form. Because the G ring is “flipped,” 
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a) (2) 
Propellor twist, as in (2), allows greater overlap 
of successive bases along the same strand and 
reduces the area of contact between the bases 
and water. 


(c) 








Propellor-twisted base pairs. Note how the 
hydrogen bonds between bases are distorted by this 
motion, yet remain intact. The minor-groove edges 
of the bases are shaded gray. 


FIGURE 11.8 Helical twist and propellor twist in DNA. 
(a) Successive base pairs in B-DNA show a rotation with 
respect to each other. (b) Rotation in a different 
dimension—propellor twist—allows the hydrophobic 
surfaces of bases to overlap better. Dots represent axes 
perpendicular to the helix axis. The view is from the 
sugar-P backbone. (c) Each of the bases in a base pair 
shows positive propellor twist (a clockwise rotation from 
the horizontal, as viewed along the N-glycosidic bond, 
from the pentose C1’ to the base). (Adapted from Figure 
3.4 in Callandine, C. R., and Drew, H.R., 1992. Understanding DNA: 
The Molecule and How It Works. London: Academic Press.) 
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FIGURE 11.9 Comparison of the A-, B-, and Z-forms of the DNA double helix. The A- and B-structures show 12 bp 
of DNA; the Z-structures, 6 bp. The middle Z-structure shows just one strand of a Z-DNA double helix to illustrate 
better the left-handed zigzag path of the polynucleotide backbones in Z-DNA. (The light blue line was added to 
show the imaginary zigzag path.) A-DNA: pdb id = 2D47, B-DNA: pdb id = 355D, Z-DNA: pdb id = 1DCG. 


the C ring must also flip to maintain normal Watson—Crick base pairing. However, 
pyrimidine nucleosides do not readily adopt the syn conformation because it cre- 
ates steric interference between the pyrimidine C-2 oxy substituent and atoms of the 
pentose. Because the cytosine ring does not rotate relative to the pentose, the whole 
C nucleoside (base and sugar) must flip 180° (Figure 11.11). It is topologically pos- 
sible for the G to go syn and the C nucleoside to undergo rotation by 180° without 
breaking and re-forming the G:C hydrogen bonds. In other words, the B-to-Z struc- 
tural transition can take place without disrupting the bonding relationships among 
the atoms involved. 
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TABLE 11.1 














Double Helix Type 
A B Z 
Overall proportions Short and broad Longer and thinner Elongated and slim 
Rise per base pair 2.3 A 3.32 A + 0.19 A 3.8 A 
Helix packing diameter 25.5A 23.7 A 18.4A 
Helix rotation sense Right-handed Right-handed Left-handed 
Base pairs per helix repeat 1 1 2 
Base pairs per turn of helix oe | ~10 12 
Mean rotation per base pair 33.6° 35.9° + 4.2° —60°/2 
Pitch per turn of helix 24.6 A 33.2 A 45.6 A 
Base-pair tilt from the perpendicular +19° =[.2? 2 4,1? =g: 
Base-pair mean propeller twist +18° +16° + 7° ~0° 
Helix axis location Major groove Through base pairs Minor groove 
Major groove proportions Extremely narrow but very Wide and with Flattened out on helix 
deep intermediate depth surface 
Minor groove proportions Very broad but shallow Narrow and with Extremely narrow but very 
intermediate depth deep 
Glycosyl bond conformation anti anti anti at C, syn at G 








Adapted from Dickerson, R.L., et al., 1983. Helix geometry and hydration in A-DNA, B-DNA, and Z-DNA. Cold Spring Harbor Symposium on Quantitative Biology 47:13-24. 





FIGURE 11.10 Comparison of the deoxyguanosine con- 
Deoxyguanosine in Z-DNA (syn position) formation in B- and Z-DNA. 


B-DNA 





Z-DNA 


FIGURE 11.11 The change in topological relationships 
of base pairs from B- to Z-DNA. A six-base-pair 
GCGCGC segment of B-DNA (1) is converted to 
Z-DNA (2) through rotation of the base pairs, as indi- 
cated by the curved arrows. The purine rings (green) 

B-DNA of the deoxyguanosine nucleosides rotate via an anti 
to syn change in the conformation of the guanine- 
deoxyribose glycosidic bond; the pyrimidine rings 
(blue) are rotated by flipping the entire deoxycyto- 
sine nucleoside (base and deoxyribose). 
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Because alternate nucleotides assume different conformations, the repeating unit 
on a given strand in the Z-helix is the dinucleotide. That is, for any number of bases, 
n, along one strand, n— 1 dinucleotides must be considered. For example, a 
GpCpGpC subset of sequence along one strand is composed of three successive dinu- 
cleotide units: GpC, CpG, and GpC. (In A- and B-DNA, the nucleotide conformations 
are essentially uniform and the repeating unit is the mononucleotide.) It follows that 
the CpG sequence is distinct conformationally from the GpC sequence along the al- 
ternating copolymer chains in the Z-double helix. The conformational alterations go- 
ing from B to Z realign the sugar—phosphate backbone along a zigzag course that has 
a left-handed orientation (Figure 11.9), thus the designation Z-DNA. Note that in any 
GpCpGp subset, the sugar-phosphates of GpC form the horizontal “zig” while the 
CpG backbone segment forms the vertical “zag.” The mean rotation angle circum- 
scribed around the helix axis is —15° for a CpG step and —45° for a GpC step (giving 
—60° for the dinucleotide repeat). The minus sign denotes a left-handed or counter- 
clockwise rotation about the helix axis. Z-DNA is more elongated and slimmer than 
B-DNA. 


Cytosine Methylation and Z-DNA The Z-form can arise in sequences that are not 
strictly alternating Py-Pu. For example, the hexanucleotide ™CGAT™CG, a Py-Pu- 
Pu-Py-Py-Pu sequence containing two 5-methylcytosines (™C), crystallizes as Z-DNA. 
Indeed, the in vivo methylation of C at the 5-position is believed to favor a B-to-Z 
switch because, in B-DNA, these hydrophobic methyl groups would protrude into 
the aqueous environment of the major groove, a destabilizing influence. In Z-DNA, 
the same methyl groups can form a stabilizing hydrophobic patch. It is likely that 
the Z-conformation naturally occurs in specific regions of cellular DNA, which oth- 
erwise is predominantly in the B-form. Furthermore, because methylation is impli- 
cated in gene regulation, the occurrence of Z-DNA may affect the expression of ge- 
netic information (see Part 4). 


The Double Helix Is a Very Dynamic Structure 


The long-range structure of B-DNA in solution is not a rigid, linear rod. Instead, DNA 
behaves as a dynamic, flexible molecule. Localized thermal fluctuations temporarily 
distort and deform DNA structure over short regions. Base and backbone ensembles 
of atoms undergo elastic motions on a time scale of nanoseconds. To some extent, 
these effects represent changes in rotational angles of the bonds comprising the 
polynucleotide backbone. These changes are also influenced by sequence-dependent 
variations in base-pair stacking. The consequence is that the helix bends gently. When 
these variations are summed over the great length of a DNA molecule, these bending 
influences give the double helix a roughly spherical shape, as might be expected for 
a long, semirigid rod undergoing apparently random coiling. It is also worth noting 
that, on close scrutiny, the surface of the double helix is not that of a totally feature- 
less, smooth, regular “barber pole” structure. Different base sequences impart their 
own special signatures to the molecule by subtle influences on such factors as the 
groove width, the angle between the helix axis and base planes, and the mechanical 
rigidity. Certain regulatory proteins bind to specific DNA sequences and participate 
in activating or suppressing expression of the information encoded therein. These 
proteins bind at unique sites by virtue of their ability to recognize novel structural 
characteristics imposed on the DNA by the local nucleotide sequence. 


Intercalating Agents Distort the Double Helix Aromatic macrocycles, flat hydro- 
phobic molecules composed of fused, heterocyclic rings, such as ethidium bromide, 
acridine orange, and actinomycin D (Figure 11.12), can slip between the stacked 
base pairs of DNA. The bases are forced apart to accommodate these so-called 
intercalating agents, causing an unwinding of the helix to a more ladderlike struc- 
ture. The deoxyribose—phosphate backbone is almost fully extended as successive 
base pairs are displaced 0.7 nm from one another, and the rotational angle about 
the helix axis between adjacent base pairs is reduced from 36° to 10°. 
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Dynamic Nature of the DNA Double Helix in Solution Intercalating substances in- 
sert with ease into the double helix, indicating that the van der Waals stacking in- 
teractions that they share with the bases sandwiching them are more favorable than 
similar interactions between the bases themselves. Furthermore, the fact that these 
agents slip in suggests that the double helix must momentarily unwind and present 
gaps for these agents to occupy. That is, the DNA double helix in solution must be 
represented by a set of metastable alternatives to the standard B-conformation. 
These alternatives constitute a flickering repertoire of dynamic structures. 


Alternative Hydrogen-Bonding Interactions Give Rise to Novel 
DNA Structures: Cruciforms, Triplexes and Quadruplexes 


Cruciform Structures Arise from Inverted Repeats Inverted repeats (Figure 
11.13) are duplex DNA sequences showing twofold symmetry (the 5'—>3' sequence 
is identical in both strands). Palindromes are words, phrases, or sentences that read 
the same backward or forward, such as “radar,” “sex at noon taxes,” “Madam, I’m 
Adam,” and “a man, a plan, a canal, Panama.” Inverted repeats are sometimes re- 
ferred to as palindromes (despite the inaccuracy of this description). Inverted re- 
peats have the potential to adopt cruciform (meaning “cross-shaped”) structures if 
the normal interstrand base pairing is replaced by intrastrand pairing. In effect, 
each strand forms a hairpin structure through alignment and pairing of the self- 
complementary sequences along the strand. Cruciforms are never as stable as nor- 
mal DNA duplexes because an unpaired segment must exist in the loop region. Cru- 
ciforms potentially create novel structures that can serve as distinctive recognition 
sites for specific DNA-binding proteins. 
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FIGURE 11.12 The structures of ethidium bromide, acri- 


dine orange, and actinomycin D, three intercalating 


agents, and their effects on DNA structure. 
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FIGURE 11.13 Self-complementary inverted repeats can 


Structure of Nucleic Acids 


rearrange to form hydrogen-bonded cruciform stem- 


loop structures. 


FIGURE 11.14 Hoogsteen base pairs: A:T (left) and Ct :G 


(right). 
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Hoogsteen Base Pairs and DNA Multiplexes The A:T and G:C base pairs first seen 
by Watson (Figure 11.6) are the canonical building blocks for DNA structures. How- 
ever, Karst Hoogsteen found that adenine and thymine do not pair in this way when 
crystallized from aqueous solution. Instead, they form two H bonds in a different 
arrangement (Figure 11.14). Further, Hoogsteen observed that, in mildly acidic solu- 
tions, guanine and cytosine form base pairs different from Watson-Crick G:C base 
pairs. These Hoogsteen base pairs depend upon protonation of cytosine N-3 (Figure 
11.14) and have only two H bonds, not three. In both A:T and G:C Hoogsteen base 
pairs, the purine N-7 atom is an H-bond acceptor. The functional groups of adenine 
and guanine that participate in Watson—Crick H bonds remain accessible in Hoog- 
steen base pairs. Thus, base triplets can form, as shown in Figure 11.15, giving rise 
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FIGURE 11.15 Base triplets formed when a purine interacts with one pyrimidine by Hoogsteen base pairing 
and another by Watson-Crick base pairing. 
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to TAT and a C*GC triplets, where each purine interacts with one of its pyrimidine 
partners through Hoogsteen base pairing and the other through Watson—Crick base 
pairing. 


H-DNA Is Triplex DNA Under certain conditions, triple-stranded DNA struc- 
tures can form. In H-DNA, two of the strands are pyrimidine-rich and the third 
is purine-rich. One pyrimidine-rich strand is hydrogen bonded to the purine-rich 
strand via Watson—Crick base pairing, and the other pyrimidine-rich strand is 
hydrogen bonded to the purine-rich strand by Hoogsteen base pairing. Such 
structures were originally referred to as H-DNA, because protonation of the cy- 
tosine N-3 atom was necessary, but the name also fits because a hinge is present 
between double- and triple-stranded DNA regions when H-DNA forms. Consider, 
for example, a long stretch of alternating C:T sequence in one strand of a DNA 
duplex (Figure 11.16). If the C:T bases in half of this stretch separated from 
their G:A partners and the unpaired C:T segment folded back on the C:T half 
still paired in the C:T/G:A duplex, triplex DNA could form through Hoogsteen 
base pairing. Triple-stranded DNA is implicated in the regulation of some eu- 
karyotic genes. 


DNA Quadruplex Structures Four-stranded DNA structures can form between 
polynucleotide strands rich in guanine. At the heart of such G-quadruplexes are 
cyclic arrays of four G residues united through Hoogsteen base pairing (Figure 
11.17a). The presence of metal cations (K+, Na*, Ca?*) favors their assembly. Free- 
electron pairs contributed by the closely spaced O6 carbonyl oxygens of the 
G-quartet coordinate the centrally located cation. A variety of different G-quadruplex 
structures have been reported, with different G-rich sequences leading to variations 
on a common quadruplex plan. Quadruplexes constructed from dG, strands usually 
form with all four strands in parallel orientation and all bases in the anti conforma- 
tion (Figure 11.17b). Polynucleotides with varying sequence repeats, such as (G3N), 
or (GpNp) „ form G-quadruplexes with variations on the dG, structural theme, such as 
the (dG4T4G4)o structure in which two such strands pair in antiparallel fashion to 
form the G-quadruplex (Figure 11.17¢ and 11.17d). G-quadruplex structures have 
biological significance because they have been found in telomeres (structures that 
define the ends of chromosomes), in regulatory regions of genes, in immunoglobu- 
lin gene regions responsible for antibody diversity, and in sequences associated with 
human diseases. 
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FIGURE 11.16 H-DNA. (a) T 


he pyrimidine-rich strands of 


the duplex regions are blue, and the purine-rich strands 


are green. The Hoogsteen 
strand in the triplex (H-D 


base-paired pyrimidine-rich 
A) structure is yellow. 


(b) Nucleotide sequence representation of H-DNA 


formation.T: A Hoogsteen 
formation is shown by do 
ing leading to triplex form 


base pairing leading to triplex 
s; Ct-G Hoogsteen base pair- 
ation is shown by + signs. 


(Adapted from Htun, H., and Dahlberg, J. E., 1989. Topology and 





formation of triple-stranded H 


-DNA. Science 243:1571-1576.) 
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(a) 


(c) 


FIGURE 11.17 (a) G-quadruplex showing the 
cyclic array of guanines linked by Hoogsteen 
hydrogen bonding. (b) Four G-rich polynucleo- 
tide strands in parallel alignment with all bases 
in anti conformation. (c) Antiparallel dimeric 
hairpin quadruplex formed from d(G4T4Gq)2. (d) 
Structure of d(G4T4G,4)2K* solved by X-ray crystal- 
lography. Two d(G,T,G,) strands come together 
as hairpins to form a G-quadruplex . The back- 
bones of the two strands are traced in violet. 
(Adapted from Keniry, M. A., 2001. Quadruplex structures 
in nucleic acids. Biopolymers 56:123-146.) 
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‘Idee Can the Secondary Structure of DNA Be Denatured 
and Renatured? 


Thermal Denaturation of DNA Can Be Observed by Changes 
in UV Absorbance 


When duplex DNA molecules are subjected to conditions of pH, temperature, or 
ionic strength that disrupt base-pairing interactions, the strands are no longer held 
together. That is, the double helix is denatured, and the strands separate as indi- 
vidual random coils. If temperature is the denaturing agent, the double helix is said 
to melt. The course of this dissociation can be followed spectrophotometrically be- 
cause the relative absorbance of the DNA solution at 260 nm increases as much as 
40% as the bases unstack. This absorbance increase, or hyperchromic shift, is due 
to the fact that the aromatic bases in DNA interact via their 77-electron clouds when 
stacked together in the double helix. Because the UV absorbance of the bases is a 
consequence of 7r-electron transitions, and because the potential for these transi- 
tions is diminished when the bases stack, the bases in duplex DNA absorb less 
260-nm radiation than expected for their numbers. Unstacking alleviates this sup- 
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pression of UV absorbance. The rise in absorbance coincides with strand separa- 
tion, and the midpoint of the absorbance increase is termed the melting tempera- 
ture, Tm (Figure 11.18). DNAs differ in their Tm values because they differ in rela- 
tive G + C content. The higher the G + C content of a DNA, the higher its melting 
temperature because G:C pairs have higher base stacking energies than A:T pairs. 
Also, Tm is dependent on the ionic strength of the solution; the lower the ionic 
strength, the lower the melting temperature. Because cations suppress the electro- 
static repulsion between the negatively charged phosphate groups in the comple- 
mentary strands of the double helix, the double-stranded form of DNA is more sta- 
ble in dilute salt solutions. DNA in pure water melts even at room temperature. 


pH Extremes or Strong H-Bonding Solutes also Denature DNA Duplexes 


At pH values greater than 10, the bases of DNA become deprotonated, which de- 
stroys their base-pairing potential, thus denaturing the DNA duplex. Extensive pro- 
tonation of the bases below pH 2.3 also disrupts base pairing. Alkali is the preferred 
denaturant because, unlike acid, it does not hydrolyze the glycosidic bonds linking 
purine bases to the sugar-phosphate backbone. Small solutes that readily form H 
bonds can also denature duplex DNA at temperatures below Tm, If present in suffi- 
ciently high concentrations, such small solutes will form H bonds with the bases, 
thereby disrupting H-bonding interactions between the base pairs. Examples in- 
clude formamide and urea. 


Single-Stranded DNA Can Renature to Form DNA Duplexes 


Denatured DNA will renature to re-form the duplex structure if the denaturing con- 
ditions are removed (that is, if the solution is cooled, the pH is returned to neutral- 
ity, or the denaturants are diluted out). Renaturation requires reassociation of the 
DNA strands into a double helix, a process termed reannealing. For this to occur, the 
strands must realign themselves so that their complementary bases are once again in 
register and the helix can be zippered up (Figure 11.19). Renaturation is dependent 
on both DNA concentration and time. Many of the realignments are imperfect, and 
thus the strands must dissociate again to allow for proper pairings to be formed. The 
process occurs more quickly if the temperature is warm enough to promote diffusion 
of the large DNA molecules but not so warm as to cause melting. 


The Rate of DNA Renaturation Is an Index of DNA Sequence Complexity 


The renaturation rate of DNA is an excellent indicator of the sequence complexity of 
DNA. For example, the DNA of bacteriophage T4 contains 2 X 10° base pairs; an 
Escherichia coli cell contains more than ten times as much (4.64 X 10° base pairs). 


FIGURE 11.18 Heat denaturation of DNA from various 
sources, so-called melting curves. (From Marmur, J., 1959. 
Heterogenity in deoxyribonucleic acids. Nature 183:1427-1429.) 
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A DEEPER LOOK 


The Buoyant Density of DNA 


Density gradient ultracentrifugation is a variant of the basic tech- 
nique of ultracentrifugation (discussed in the Appendix to Chap- 
ter 5). The densities of DNAs are about the same as those of con- 
centrated solutions of cesium chloride, CsCl (1.6 to 1.8 g/mL). 
Centrifugation of CsCl solutions at very high rotational speeds, 
where the centrifugal force becomes 10° times stronger than the 
force of gravity, causes the formation of a density gradient within 
the solution. This gradient is the result of a balance that is estab- 
lished between the sedimentation of the salt ions toward the bot- 
tom of the tube and their diffusion upward toward regions of 
lower concentration. If DNA is present in the centrifuged CsCl 
solution, it moves to a position of equilibrium in the gradient 
equivalent to its buoyant density (as shown in the figure). For this 
reason, this technique is also called isopycnic centrifugation. 

Cesium chloride centrifugation is an excellent means of re- 
moving RNA and proteins in the purification of DNA. The density 
of DNA is typically slightly greater than 1.7 g/cm*, whereas the 
density of RNA is more than 1.8 g/cm’. Proteins have densities less 
than 1.3 g/cm’. In CsCl solutions of appropriate density, the DNA 
bands near the center of the tube, RNA pellets to the bottom, and 
the proteins float near the top. Single-stranded DNA is denser 
than double helical DNA. The irregular structure of randomly 
coiled ssDNA allows the atoms to pack together through van der 
Waals interactions. These interactions compact the molecule into 
a smaller volume than that occupied by a hydrogen-bonded dou- 
ble helix. 


Cell extract 


Mix CsCl solution 
and cell extract and 


place in centrifuge. Centrifuge at 
high speed for 


~48 hours. 
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CsCl solution 
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Molecules move to positions 
where their density equals that 
of the CsCl solution. 


The net movement of solute particles in an ultracentrifuge is 
the result of two processes: diffusion (from regions of higher con- 
centration to regions of lower concentration) and sedimentation 
due to centrifugal force (in the direction away from the axis of ro- 
tation). In general, diffusion rates for molecules are inversely pro- 
portional to their molecular weight—larger molecules diffuse 
more slowly than smaller ones. On the other hand, sedimentation 
rates increase with increasing molecular weight. A macromolecu- 
lar species that has reached its position of equilibrium in isopycnic 
centrifugation has formed a concentrated band of material. 

Essentially three effects are influencing the movement of the 
molecules in creating this concentration zone: (1) diffusion away 
to regions of lower concentration, (2) sedimentation of molecules 
situated at positions of slightly lower solution density in the den- 
sity gradient, and (3) flotation (buoyancy or “reverse sedimenta- 
tion”) of molecules that have reached positions of slightly greater 
solution density in the gradient. The consequence of the physics 
of these effects is that, at equilibrium, the width of the concentration 
band established by the macromolecular species is inversely proportional to 
the square root of its molecular weight. That is, a population of large 
molecules will form a concentration band that is narrower than 
the band formed by a population of small molecules. For example, 
the bandwidth formed by dsDNA will be less than the bandwidth 
formed by the same DNA when dissociated into ssDNA. 


Proteins and nucleic acids 
absorb UV light. The positions 
of these molecules within the 
centrifuge can be determined 
by ultraviolet optics. 
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E. coli DNA is considerably more complex in that it encodes more information. Ex- 
pressed in another way, for any fixed amount of single-stranded DNA (in grams), the 
nucleotide sequences represented in an E. coli sample will show greater sequence 
variation than those in an equal weight of phage T4 DNA. Thus, it will take longer for 
the E. coli DNA strands to find their complementary partners and reanneal. Because 
the rate of DNA duplex formation depends on complementary DNA sequences en- 
countering one another and beginning the process of sequence alignment and rean- 
nealing, the time necessary for reconstituting double-stranded DNA molecules is an 
excellent index of the degree of sequence complementarity in a DNA sample. 


Nucleic Acid Hybridization: Different DNA Strands of Similar Sequence 
Can Form Hybrid Duplexes 


If DNA from two different species are mixed, denatured, and allowed to cool slowly 
so that reannealing can occur, hybrid duplexes may form, provided the DNA from 
one species is similar in nucleotide sequence to the DNA of the other. The degree 
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of hybridization is a measure of the sequence similarity or relatedness between the 
two species. Depending on the conditions of the experiment, about 25% of the 
DNA from a human forms hybrids with mouse DNA, implying that some of the nu- 
cleotide sequences (genes) in humans are very similar to those in mice (Figure 
11.20). Mixed RNA:DNA hybrids can be created in vitro if single-stranded DNA is 
allowed to anneal with RNA copies of itself, such as those formed when genes are 
transcribed into mRNA molecules. 

Nucleic acid hybridization is a commonly employed procedure in molecular biol- 
ogy. First, it can reveal evolutionary relationships. Second, it gives researchers the 
power to identify specific genes selectively against a vast background of irrelevant ge- 
netic material: An appropriately labeled oligonucleotide or polynucleotide, referred 
to as a probe, is constructed so that its sequence is complementary to a target gene. 
The probe specifically base pairs with the target gene, allowing identification and sub- 
sequent isolation of the gene. Also, the quantitative expression of genes (in terms of 
the amount of mRNA synthesized) can be assayed by hybridization experiments. 


‘Tde4s Can DNA Adopt Structures of Higher Complexity? 


DNA can adopt regular structures of higher complexity in several ways. For example, 
many DNA molecules are circular. Most, but not all, bacterial chromosomes are co- 
valently closed, circular DNA duplexes, as are most plasmid DNAs. Plasmids are nat- 
urally occurring, self-replicating, extrachromosomal DNA molecules found in bacte- 
ria; plasmids carry genes specifying novel metabolic capacities advantageous to the 
host bacterium. Various animal virus DNAs are circular as well. 


Supercoils Are One Kind of Structural Complexity in DNA 


In duplex DNA, the two strands are wound about each other once every 10 bp, that 
is, once every turn of the helix. Double-stranded circular DNA (or linear DNA du- 
plexes whose ends are not free to rotate) form supercoils if the strands are under- 
wound (negatively supercoiled) or overwound (positively supercoiled) (Figure 11.21). Un- 
derwound duplex DNA has fewer than the normal number of turns, whereas 
overwound DNA has more. DNA supercoiling is analogous to twisting or untwisting 
a two-stranded rope so that it is torsionally stressed. Negative supercoiling intro- 
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Renatured DNA 


FIGURE 11.19 Steps in the thermal denaturation and re- 
naturation of DNA. The nucleation phase of the reaction 
is a second-order process depending on sequence 
alignment of the two strands (1). This process takes 
place slowly because it takes time for complementary 
sequences to encounter one another in solution and 
then align themselves in register. Once the sequences 
are aligned, the strands zipper up quickly (2). 





FIGURE 11.20 Solutions of human DNA (red) and mouse 
DNA (blue) are mixed and denatured, and the single 
strands are allowed to reanneal. About 25% of the human 
DNA strands form hybrid duplexes (one red and one blue 
strand) with mouse DNA. 
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FIGURE 11.21 Toroidal and interwound varieties of supercoiling. (a) The DNA is coiled in a spiral fashion about 
an imaginary toroid (yellow circle). (b) The DNA interwinds and wraps about itself. (c) Supercoils in long, linear 
DNA arranged into loops whose ends are restrained—a model for chromosomal DNA. (Adapted from Figures 6.1 
and 6.2 in Callandine, C. R., and Drew, H. R., 1992. Understanding DNA: The Molecule and How It Works. London: Academic Press.) 


duces a torsional stress that favors unwinding of the right-handed B-DNA double he- 
lix, whereas positive supercoiling overwinds such a helix. Both forms of supercoil- 
ing compact the DNA so that it sediments faster upon ultracentrifugation or 
migrates more rapidly in an electrophoretic gel in comparison to relaxed DNA 
(DNA that is not supercoiled). Cellular DNA is almost always negatively supercoiled 
(underwound). 


Linking Number The basic parameter characterizing supercoiled DNA is the linking 
number (L). This is the number of times the two strands are intertwined, and provided 
both strands remain covalently intact, L cannot change. In a relaxed circular DNA du- 
plex of 400 bp, Lis 40 (assuming 10 bp per turn in B-DNA). The linking number for 
relaxed DNA is usually taken as the reference parameter and is written as Lo. L can be 
equated to the twist (T) and writhe (W) of the duplex, where twist is the number of 
helical turns and writhe is the number of supercoils: 


L=T+ W 


Figure 11.22 shows the values of T and W for a simple striped circular tube in var- 
ious supercoiled forms. In any closed, circular DNA duplex that is relaxed, W= 0. 
A relaxed circular DNA of 400 bp has 40 helical turns, T= L = 40. This linking 
number can be changed only by breaking one or both strands of the DNA, winding 
them tighter or looser, and rejoining the ends. Enzymes capable of carrying out 
such reactions are called topoisomerases because they change the topological state 
of DNA. Topoisomerases fall into two basic classes: I and II. Topoisomerases of the 
I type cut one strand of a DNA double helix, pass the other strand through, and 
then rejoin the cut ends. Topoisomerase II enzymes cut both strands of a dsDNA, 
pass a region of the DNA duplex between the cut ends, and then rejoin the ends 
(Figure 11.23). Topoisomerases are important players in DNA replication (see 
Chapter 28). 


DNA Gyrase The bacterial enzyme DNA gyrase is a topoisomerase that introduces 
negative supercoils into DNA in the manner shown in Figure 11.23. Suppose DNA 
gyrase puts four negative supercoils into the 400-bp circular duplex, then W= —4, 
T remains the same, and L = 36 (Figure 11.24). In actuality, the negative supercoils 
cause a torsional stress on the molecule, so T tends to decrease; that is, the helix be- 
comes a bit unwound, so base pairs are separated. The extreme would be that T 
would decrease by 4 and the supercoiling would be removed (T = 36, L = 36, and 
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FIGURE 11.22 Supercoil topology for a simple circular tube with a single stripe along it. (Adapted from Figures 6.5 
and 6.6 in Callandine, C. R., and Drew, H. R., 1992. Understanding DNA: The Molecule and How It Works. London: Academic Press.) 


W= 0). That is, negative supercoiling has the potential to cause localized unwind- 
ing of the DNA double helix so that single-stranded regions (or bubbles) are cre- 
ated (Figure 11.24). Usually the real situation is a compromise in which the nega- 
tive value of Wis reduced, T decreases slightly, and these changes are distributed 
over the length of the circular duplex so that no localized unwinding of the helix 
ensues. Nevertheless, negative supercoiling makes it easier to separate the DNA 
strands and access the information encoded by the nucleotide sequence. 


Superhelix Density The difference between the linking number of a DNA and 
the linking number of its relaxed form is AL = (L — Lo). In our example with four 
negative supercoils, AL = —4. The superhelix density or specific linking difference 
is defined as AL/Lo and is sometimes termed sigma, o. For our example, 
o = —4/40, or —0.1. As a ratio, a is a measure of supercoiling that is independent 
of length. Its sign reflects whether the supercoiling tends to unwind (negative o) or 
overwind (positive o) the helix. In other words, the superhelix density states the 
number of supercoils per 10 bp, which also is the same as the number of supercoils 
per B-DNA repeat. Circular DNA isolated from natural sources is always found in 
the underwound, negatively supercoiled state. 


Toroidal Supercoiled DNA Negatively supercoiled DNA can arrange into a 
toroidal state (Figure 11.25). The toroidal state of negatively supercoiled DNA is sta- 
bilized by wrapping around proteins that serve as spools for the DNA “ribbon.” This 
toroidal conformation of DNA is found in protein-DNA interactions that are the 
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FIGURE 11.23 A model for the action of bacterial DNA 
gyrase (topoisomerase Il). The A-subunits cut the DNA 
duplex (1) and then hold onto the cut ends (2). Confor- 
mational changes in the enzyme allow an intact region 
of the DNA duplex to pass between the cut ends. The 
cut ends are religated (3), and the covalently complete 
DNA duplex is released from the enzyme. The circular 
DNA now contains two negative supercoils (4). 
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FIGURE 11.24 A 400-bp circular DNA molecule in differ- 
ent topological states: (a) relaxed, (b) negative super- 
coils distributed over the entire length, and (c) negative 
supercoils creating a localized single-stranded region. 





T=-2, W=0 
(a) 
Protein spool 
(b) T=0, W=-2 T=0, W=-2 


FIGURE 11.25 Supercoiled DNA in a toroidal form wraps readily around protein “spools.” A twisted segment 
of linear DNA with two negative supercoils (a) can collapse into a toroidal conformation if its ends are 
brought closer together (b). Wrapping the DNA toroid around a protein “spool” stabilizes this conformation 
(c). (Adapted from Figure 6.6 in Callandine, C. R., and Drew, H. R., 1992. Understanding DNA: The Molecule and How It Works. 
London: Academic Press.) 


basis of phenomena as diverse as chromosome structure (see Figure 11.27) and 
gene expression. 


A5 What Is the Structure of Eukaryotic Chromosomes? 


A typical human cell is 20 um in diameter. Its genetic material consists of 23 pairs of ds- 
DNA molecules in the form of chromosomes, the average length of which is 3 X 
109 bp/23 or 1.3 X 108 nucleotide pairs. At 0.34 nm/bp in B-DNA, this represents a 
DNA molecule 5 cm long. Together, these 46 dsDNA molecules amount to more than 
2 m of DNA that must be packaged into a nucleus perhaps 5 um in diameter! Clearly, 
the DNA must be condensed by a factor of more than 10°. The mechanisms by which 
this condensation is achieved are poorly understood at the present time, but it is clear 
that the first stage of this condensation is accomplished by neatly wrapping the DNA 
around protein spools called nucleosomes. The string of nucleosomes is then coiled to 
form a helical filament. Subsequent steps are less clear, but it is believed that this fila- 
ment is arranged in loops associated with the nuclear matrix, a skeleton or scaffold of 
proteins providing a structural framework within the nucleus (see following discussion). 


Nucleosomes Are the Fundamental Structural Unit in Chromatin 


The DNA in a eukaryotic cell nucleus during the interphase between cell divisions ex- 
ists as a nucleoprotein complex called chromatin. The proteins of chromatin fall into 
two classes: histones and nonhistone chromosomal proteins. Histones are abundant 
and play an important role in chromatin structure. In contrast, the nonhistone class is 
defined by a great variety of different proteins, all of which are involved in genetic reg- 
ulation; typically, there are only a few molecules of each per cell. Five distinct histones 
are known: H1, H2A, H2B, H3, and H4 (Table 11.2). All five are relatively small, posi- 








TABLE 11.2 | 
Ratio of 
Histone Lysine to Arginine Size (kD) Copies per Nucleosome 
H1 59/3 21.2 1 (not in core) 
H2A 13/13 14.1 2 
H2B 20/8 13.1 2 
H3 13/17 15.1 2 
H4 11/14 11.4 2 
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FIGURE 11.26 The nucleosome core particle wrapped with 1.65 turns of DNA (147 bp). The DNA is shown as a 
blue and orange double helix. The four types of core histones are shown as different colors. (left) View down the 
axis of the nucleosome; (right) view perpendicular to the axis (pdb id = 1AO)). (Adapted from Luger, K, et al., 1997. 
Crystal structure of the nucleosome core particle at 2.8 A resolution. Nature 389:25 1-260. Photos courtesy of T. J. Richmond, 
ETH-Honggerberg, Zurich, Switzerland.) 


tively charged, arginine- or lysine-rich proteins that interact via ionic bonds with the 
negatively charged phosphate groups on the polynucleotide backbone. Pairs of his- 
tones H2A, H2B, H3, and H4 aggregate to form octameric core structures, and the 
DNA helix is wound about these core octamers, creating nucleosomes. 

If chromatin is swelled suddenly in water and prepared for viewing in the electron 
microscope, the nucleosomes are evident as “beads on a string,” dsDNA being the 
string. The structure of the histone octamer core wrapped with DNA has been solved 
by T. J. Richmond and collaborators (Figure 11.26). The core octamer has surface 
landmarks that guide the course of the DNA; 147 bp of B-DNA in a flat, left-handed 
superhelical conformation make 1.6 turns around the histone core (Figure 11.26), 
which itself is a protein superhelix consisting of a spiral array of the four histone 
dimers. Histone H1, a three-domain protein, organizes an additional 29-43 bp of 
DNA and links consecutive nucleosomes. Each complete nucleosome unit contains 
176-190 bp of DNA. The N-terminal tails of histones H3 and H4 are accessible on the 
surface of the nucleosome. Lysine and serine residues in these tails can be covalently 
modified in myriad ways (lysines may be acetylated, methylated, or ubiquitinated; ser- 
ines may be phosphorylated). These modifications play an important role in chro- 
matin dynamics and gene expression (see Chapter 29). 


Higher-Order Structural Organization of Chromatin Gives Rise 
to Chromosomes 


A higher order of chromatin structure is created when the array of nucleosomes, 
in their characteristic beads-on-a-string motif, is wound in the fashion of a solenoid 
(Figure 11.27). One structure proposed for the resulting 30-nm fiber has a diameter 
of 33 nm and a height of 33 nm. It is formed by 22 nucleosomes arrayed helically. Cur- 
rent evidence indicates that this 30-nm filament then forms long DNA loops of variable 
length, each containing on average between 60,000 and 150,000 bp. Electron micro- 
scopic analysis of human chromosome 4 suggests that 18 such loops are then arranged 
radially about the circumference of a single turn to form a miniband unit of the chro- 
mosome. According to this model, approximately 10° of these minibands are arranged 
along a central axis in each of the chromatids of human chromosome 4 that form at 
mitosis (Figure 11.27). Despite intensive study, much about the higher-order structure 
of chromosomes remains to be discovered. 
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(a) “Beads on a string” chromatin form 
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FIGURE 11.27 A model for chromosome structure, hu- 
man chromosome 4. 





SMC Proteins Establish Chromosome Organization and Mediate 
Chromosome Dynamics 


Although the details remain a mystery, we know that the process of chromatin or- 
ganization into chromosomes involves SMC proteins. SMC stands for structural 
maintenance of chromosomes. SMC proteins are members of the nonhistone chro- 
mosomal protein class. SMC proteins form a large superfamily of ATPases involved 
in higher-order chromosome organization and dynamics. SMC protein representa- 
tives are found in all forms of life—archaea, bacteria, and eukaryotes. Chromoso- 
mal dynamics includes chromosome condensation, sister chromatid cohesion, 
genetic recombination, and DNA repair, as well as other phenomena. SMC proteins 
have a characteristic five-domain organization, consisting of an N-terminal globular 
ATP-binding domain, a rodlike dimerization domain involved in coiled coil 
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(a) SMC protein architecture 
SMC monomer 
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formation, a flexible hinge region, another rodlike and coiled coil-forming region, 
and finally a C-terminal globular domain termed DA for its DNA-binding and 
ATPase abilities (Figure 11.28). Five subgroups of SMC proteins are found in eu- 
karyotes, and functional SMC proteins are heterodimers. SMC2/SMC4 het- 
erodimers are essential for chromatin condensation as part of condensin com- 
plexes; SMC1/SMC3 heterodimers act in sister chromatid cohesion as part of 
cohesin complexes. Current models of SMC protein function suggest that V-shaped 
heterodimers bind to DNA through their DA domains and mediate chromosomal 
dynamics in an ATP-dependent manner. The flexible hinge region of each SMC 
subunit is located at the point of the V, and hinge-bending motions allow the DNA- 
binding parts of the two globular heads to move closer together, compacting the 
DNA into a higher-order structure (Figure 11.28). 


‘Ve60 Can Nucleic Acids Be Synthesized Chemically? 


Laboratory synthesis of oligonucleotide chains of defined sequence presents some of 
the same problems encountered in chemical synthesis of polypeptides (see Chapter 
5). First, functional groups on the monomeric units (in this case, bases) are reactive 
under conditions of polymerization and therefore must be protected by blocking 
agents. Second, to generate the desired sequence, a phosphodiester bridge must be 
formed between the 3’-O of one nucleotide (B) and the 5’-O of the preceding one 
(A) in a way that precludes the unwanted bridging of the 3’-O of A with the 5’-O of 
B. Finally, recoveries at each step must be high so that overall yields in the multistep 
process are acceptable. As in peptide synthesis (see Chapter 5), orthogonal solid- 
phase methods are used to overcome some of these problems. Commercially available 


FIGURE 11.28 SMC protein architecture and function. 

(a) SMC protein architecture. SMC proteins range from 
115 to 165 kD in size. (b) SMC protein function. SMC pro- 
teins are reminiscent of motor proteins. Illustrated in (b) is 
a condensation of DNA into a coiled arrangement 
through SMC2/SMC4-mediated interactions. 
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HUMAN BIOCHEMISTRY 


Telomeres and Tumors 


Eukaryotic chromosomes are linear. The ends of chromosomes 
have specialized structures known as telomeres. The telomeres of 
virtually all eukaryotic chromosomes consist of short, tandemly 
repeated nucleotide sequences at the ends of the chromosomal 
DNA. For example, the telomeres of human germline (sperm and 
egg) cells contain between 1000 and 1700 copies of the hexam- 
eric repeat TTAGGG (see accompanying figure). Telomeres con- 


(a) 


tribute to the maintenance of chromosomal integrity by protect- 
ing against DNA degradation or rearrangement. Telomeres are 
added to the ends of chromosomal DNA by an RNA-containing 
enzyme known as telomerase (see Chapter 28). In human telom- 
erase, the ribonucleotide part is a 962-nucleotide-long RNA. 
Telomerase is an unusual DNA polymerase that was discovered in 
1985 by Elizabeth Blackburn and Carol Greider of the University 
of California, San Francisco. However, most normal somatic cells 
lack telomerase. Consequently, upon every cycle of cell division 


when the cell replicates its DNA, about 50-nucleotide segments 
are lost from the end of each telomere. Thus, over time, the 
telomeres of somatic cells in animals become shorter and shorter, 
eventually leading to chromosome instability and cell death. This 
phenomenon has led some scientists to espouse a “telomere the- 
ory of aging” that implicates telomere shortening as the principal 
factor in cell, tissue, and even organism aging. Interestingly, can- 
cer cells appear “immortal” because they continue to reproduce 
indefinitely. A survey of 20 different tumor types by Geron Cor- 
poration of Menlo Park, California, revealed that all contained 
telomerase activity. 





Site of telomerase 
DNA polymerase 
function 








«í (a) Telomeres on human chromosomes. TTAGGG tandem repeats are 
attached to the 3’-ends of the DNA strands and are paired with the comple- 
mentary sequence 3’-AATCCC-5' on the other DNA strand. Thus, a G-rich region 
is created at the 3’-end of each DNA strand, and a C-rich region is created at the 
5’-end of each DNA strand. Typically, at each end of the chromosome, the G-rich 
strand protrudes 12 to 16 nucleotides beyond its complementary C-rich strand. 
(b) The ribonucleic acid of human telomerase serves as the template for the 
DNA polymerase activity of telomerase. Nucleotides 46 to 56 of this RNA are 
CUAACCCUAAC and provide the template function for the telomerase-catalyzed 
addition of TTAGGG units to the 3’-end of a DNA strand. 


Telomerase 
protein 











automated instruments, called DNA synthesizers or “gene machines,” are capable of 
carrying out the synthesis of oligonucleotides of 150 bases or more. 


Phosphoramidite Chemistry Is Used to Form Oligonucleotides 
from Nucleotides 


Phosphoramidite chemistry is currently the accepted method of oligonucleotide syn- 
thesis. The general strategy involves the sequential addition of nucleotide units as 
nucleoside phosphoramidite derivatives to a nucleoside covalently attached to the insol- 
uble resin. Excess reagents, starting materials, and side products are removed after 
each step by filtration. After the desired oligonucleotide has been formed, it is freed 
of all blocking groups, hydrolyzed from the resin, and purified by gel electrophore- 
sis. The four-step cycle is shown in Figure 11.29. Chemical synthesis takes place in the 
3'—5' direction (the reverse of the biological polymerization direction). 


Genes Can Be Synthesized Chemically 


It is possible to synthesize genes using phosphoramidite chemistry (Table 11.3). Be- 
cause protein-coding genes are characteristically much larger than the 150-bp prac- 
tical limit on oligonucleotide synthesis, their synthesis involves joining a series of 
oligonucleotides to assemble the overall sequence. 
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FIGURE 11.29 Solid-phase oligonucleotide synthesis. The four-step cycle starts with the first base in nucleoside 
form attached by its 3’-OH group to an insoluble support. Its 5’-OH is blocked with a dimethoxytrityl (DMTr) 
group (a). If the base has reactive —NH, functions, as in A, G, or C, then N-benzoyl or N-isobutyry! derivatives are 
used to prevent their reaction (b). In step 1, the DMTr protecting group is removed by trichloroacetic acid treat- 
ment. Step 2 is the coupling step: The second base is added in the form of a nucleoside phosphoramidite deriv- 
ative whose 5'-OH bears a DMTr blocking group so it cannot polymerize with itself (c). continued 


Alw What Are the Secondary and Tertiary Structures 
of RNA? 


RNA molecules (see Chapter 10) are typically single-stranded. The course of a 
single-stranded RNA in three-dimensional space conceivably would have six de- 
grees of freedom per nucleotide, represented by rotation about each of the six sin- 
gle bonds along the sugar-phosphate backbone per nucleotide unit. (Rotation 
about the -glycosidic bond creates a seventh degree of freedom in terms of the 
total conformational possibilities at each nucleotide.) Compare this situation with 
DNA, whose separated strands would obviously enjoy the same degrees of freedom. 
However, the double-stranded nature of DNA imposes great constraint on its con- 
formational possibilities. Compared to dsDNA, an RNA molecule has a much 
greater number of conformational possibilities. Intramolecular interactions and 
other stabilizing influences limit these possibilities, but the higher-order structure 
of RNA remains an area for fruitful scientific discovery. 
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TABLE 11.3 | 

Gene Size (bp) 
tRNA 126 
a-Interferon 542 
Secretin 81 
y-Interferon 453 
Rhodopsin 1057 
Proenkephalin 77 
Connective tissue activating 

peptide III 280 
Lysozyme 385 
Tissue plasminogen activator 1610 
c-Ha-ras 576 
RNase T1 324 
Cytochrome b5 330 
Bovine intestinal Ca-binding 

protein 298 
Hirudin 226 
RNase A 375 
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FIGURE 11.29 continued In step 2, the presence of a weak acid, such as tetrazole, activates the phospho- 
ramidite, and it rapidly reacts with the free 5’-OH of N-1, forming a dinucleotide linked by a phosphite group. 
Unreacted free 5’-OHs of N-1 are blocked from further participation in the polymerization process by acetyla- 
tion with acetic anhydride in step 3, referred to as capping. In step 4, the phosphite linkage between N-1 and 
N-2 is oxidized by aqueous iodine (l2) to form the desired more stable phosphate group. Subsequent cycles 
add successive residues to the resin-immobilized chain.When the chain is complete, it is cleaved from the sup- 
port with NH4OH, which also removes the N-benzoyl— and N-isobutyryl-protecting groups from the amino 
functions on the A, G, and C residues. 


Although single-stranded, RNA molecules are rich in double-stranded regions 
that form when complementary sequences within the chain come together and join 
via intrastrand base pairing. These interactions create hairpin stem-loop structures, 
in which the base-paired regions form the stem and the unpaired regions between 
base pairs are the loop, as in Figures 11.30 and 11.31. Paired regions of RNA can- 
not form B-DNA-type double helices because the RNA 2’-OH groups are a steric 
hindrance to this conformation. Instead, these paired regions adopt a conforma- 
tion similar to the A-form of DNA, having about 11 bp per turn, with the bases 
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FIGURE 11.30 Bulges and loops formed in RNA when aligned sequences are not fully complementary. (Adapted 
from Appendix Figure 1 in Gesteland, R. F, Cech, T.R.,and Atkins, J. F., eds. The RNA World, 2nd ed. New York: Cold Spring Harbor 
Press.) 


strongly tilted from the plane perpendicular to the helix axis (see Figure 11.9). 
A-form double helices are the most prominent secondary structural elements in 
RNA. Both tRNA and rRNA have large amounts of A-form double helix. In addition, 
a number of defined structural motifs recur within the loops of stem-loop struc- 
tures, such as U-turns (a loop motif of consensus sequence UNRN, where N is any 
nucleotide and R is a purine) and tetraloops (another class of four-nucleotide loops 
found at the termini of stem-loop structures). Stems of stem-loop structures may 
also have bulges (or internal loops) where the RNA strand is forced into a short 
single-stranded loop because one or more bases along one strand in an RNA dou- 
ble helix finds no base-pairing partners (Figure 11.30). Regions where several stem- 
loop structures meet are termed junctions (Figure 11.31). Stems, loops, bulges, and 
junctions are the four basic secondary structural elements in RNA. 

The single-stranded loops in RNA stem-loops create base-pairing opportunities 
between distant, complementary, single-stranded loop regions. These interactions, 
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FIGURE 11.31 Junctions and coaxial stacking in RNA. 
Stem junctions (or multibranched loops) are another 
type of RNA secondary structure. Coaxial stacking of 
stems or stem-loops (as in stacking of stem 1 on stem- 
loop 4) is a tertiary structural feature found in many 
RNAs. (Adapted from Figure 1 in Tyagi, R., and Matthews, D. H., 
2007. Predicting coaxial stacking in multibranch loops. RNA 
13:1-13.) 
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on the right (pdb id = 1YMO). (Adapted from Figure 2 in 
Staple, D.W.,and Butcher, S. E., 2005. Pseudoknots: RNA structures 
with diverse functions. PLoS Biology 3:e213.) 





mostly based on Watson—Crick base pairing, lead to tertiary structure in RNA. Other 
tertiary structural motifs arise from coaxial stacking (Figure 11.31), pseudoknot 
formation, and ribose zippers. In coaxial stacking, the blunt, nonloop ends of stem- 
loops situated next to one another in the RNA sequence stack upon each other to 
create an uninterrupted stack of base pairs. A good example of coaxial stacking is 
found in the tertiary structure of tRNAs, where the acceptor end of the L-shaped 
tRNA is formed by coaxial stacking of the acceptor stem on the TWC stem-loop and 
the anticodon end is formed by coaxial stacking of the dihydrouracil stem-loop on 
the anticodon stem-loop (Figures 11.33 and 11.35). Pseudoknots occur when bases 
in the loops of stem-loop structures form a short double helix by base pairing with 
nearby single-stranded regions in the RNA (Figure 11.32). Ribose zippers are found 
when two antiparallel, single-stranded regions of RNA align as an H-bonded net- 
work forms between the 2'-OH groups of the respective strands, the O at the 2'-OH 
position of one strand serving as the H-bond acceptor while the H on the 2'-OH of 
the other strand is the H-bond donor. Ribose zippers and the other RNA structures 
mentioned here are well represented by many examples in the SCOR (Structural 
Classification of RNA) database at http://scor.lbl.gov/ and NDB (Nucleic Acid 
Database) at http://ndbserver.rutgers.edu/. 


Transfer RNA Adopts Higher-Order Structure Through Intrastrand 
Base Pairing 


In tRNA molecules, which contain 73 to 94 nucleotides in a single chain, a majority 
of the bases are hydrogen bonded to one another. Figure 11.33 shows the structure 
that typifies tRNAs. Hairpin turns bring complementary stretches of bases in the 
chain into contact so that double helical regions form, creating stem-loop sec- 
ondary structures. Because of the arrangement of the complementary stretches 
along the chain, the overall pattern of base pairing can be represented as a cloverleaf. 
Each cloverleaf consists of four base-paired segments—three loops and the stem 
where the 3’- and 5’-ends of the molecule meet. These four segments are desig- 
nated the acceptor stem, the D loop, the anticodon loop, and the TYC loop (the latter 
two are U-turn motifs). 
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FIGURE 11.33 A general diagram for the structure of 
tRNA. The positions of invariant bases as well as bases 
that seldom vary are shown in color.R = purine; Y = 
pyrimidine. Dotted lines denote sites in the D loop and 
variable loop regions where varying numbers of nu- 
cleotides are found in different tRNAs. Inset: An aminoa- 
cyl group can add to the 3’-OH to create an aminoacyl- 


Anticodon tRNA. 





tRNA Secondary Structure The acceptor stem is where the amino acid is linked to 
form the aminoacyl-tRNA derivative, which serves as the amino acid—donating 
species in protein synthesis; this is the physiological role of tRNA. The carboxyl 
group of an amino acid is linked to the 3'-OH of the 3’-terminal A nucleotide, thus 
forming an aminoacyl ester (Figure 11.33). The 3’-end of tRNA is invariantly 
CCA-3'-OH. The D loop is so named because this tRNA loop often contains dihy- 
drouridine, or D, residues. In addition to dihydrouridine, tRNAs characteristically 
contain a number of unusual bases, including inosine, thiouridine, pseudouridine, 
and hypermethylated purines (see Figure 10.23). The anticodon stem-loop consists of 
a double helical segment and seven unpaired bases, three of which are the 
anticodon—a three-nucleotide unit that recognizes and base pairs with a particu- 
lar mRNA codon, a complementary three-base unit in mRNA providing the genetic 
information that specifies an amino acid. In the 5’-3' direction beyond the anti- 
codon stem-loop lies a loop that varies from tRNA to tRNA in the number of 
residues that it has, the so-called extra or variable loop. The last loop in the tRNA, 
reading 5'—3’, is within the TYC stem-loop. It contains seven unpaired bases, in- 
cluding the sequence TYC, where y is the symbol for pseudouridine. Most of the 
invariant residues common to tRNAs lie within the non—hydrogen-bonded regions 
of the cloverleaf structure. 


tRNA Tertiary Structure Tertiary structure in tRNA arises from base-pairing inter- 
actions between bases in the D loop with bases in the variable and TYC loops, as 
shown for yeast phenylalanine tRNA in Figure 11.34. Note that these base-pairing in- 
teractions involve the invariant nucleotides of tRNAs. These interactions fold the D 
and TYC arms together and bend the cloverleaf into the stable L-shaped tertiary 
form (Figure 11.35). Many of these base-pairing interactions involve base pairs that 
are not canonical A:T or G:C pairings, as illustrated around the central ribbon dia- 
gram of the tRNA in Figure 11.35. Note that three of the interactions involve three 
bases. The amino acid acceptor stem (highlighted in green) is at one end of the in- 
verted, backward L shape, separated by 7 nm or so from the anticodon at the oppo- 
site end of the L. The D and TYC loops form the corner of the L. Hydrophobic stack- 
ing interactions between the flat faces of the bases contributes significantly to L-form 
stabilization. 
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FIGURE 11.34 Tertiary interactions in yeast phenyl- 
alanine tRNA. The molecule is presented in the conven- 
tional cloverleaf secondary structure generated by in- 
trastrand hydrogen bonding. Solid lines connect bases 
that are hydrogen bonded when this cloverleaf pattern 
is folded into the characteristic tRNA tertiary structure 
(see also Figure 11.35). 
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Ribosomal RNA also Adopts Higher-Order Structure 
Through Intrastrand Base Pairing 


rRNA Secondary Structure A large degree of intrastrand sequence complementarity 
is found in all ribosomal RNA strands, and all assume a highly folded pattern that 
allows base pairing between these complementary segments, giving rise to multiple 
stem-loop structures. Furthermore, the loop regions of stem-loops contain the 
characteristic structural motifs, such as U-turns, tetraloops, and bulges. Figure 
11.36 shows the secondary structure of several 16S rRNAs, based on computer 
alignment of each nucleotide sequence into optimal H-bonding segments. The re- 
liability of these alignments is then tested through a comparative analysis of 
whether very similar secondary structures are observed. If so, then such structures 
are apparently conserved. The approach is based on the thesis that because ribo- 
somal RNA species (regardless of source) serve common roles in protein synthesis, 
it may be anticipated that they share structural features. These secondary struc- 
tures resemble one another, even though the nucleotide sequences of these 16S 
rRNAs exhibit little sequence similarity. Apparently, evolution is acting at the level 
of rRNA secondary structure, not rRNA nucleotide sequence. Similar conserved 
folding patterns are seen for the 5S-like rRNAs and 23S-like rRNAs that reside in 
the large ribosomal subunits of various species. An insightful conclusion may be 
drawn regarding the persistence of such strong secondary structure conservation 
despite the millennia that have passed since these organisms diverged: All ribosomes 
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FIGURE 11.35 (a) The three-dimensional structure of yeast phenylalanine tRNA. The tertiary folding 
is illustrated in the center of the diagram with the ribose-phosphate backbone presented as a con- 
tinuous ribbon; H bonds are indicated by crossbars. Unpaired bases are shown as short, uncon- 
nected rods. The anticodon loop is at the bottom and the -CCA 3’-OH acceptor end is at the top 
right. (b) A space-filling model of the molecule (pdb id = 6TNA). 
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(a) E. coli (a eubacterium) 


(b) H. volcanii (an archaebacterium) (c) S. cerevisiae (yeast, a lower eukaryote) 





FIGURE 11.36 Comparison of secondary structures of 16S-like rRNAs from (a) a bacterium (E.coli), (b) an ar- 
chaeon (H. volcanii), and (c) a eukaryote (S. cerevisiae, a yeast). 


are similar, and all function in a similar manner. As usual with RNAs, the single- 
stranded regions of rRNA create the possibility of base-pairing opportunities with 
distant, complementary, single-stranded regions. Such interactions are the driving 
force for tertiary structure formation in RNAs. 


rRNA Tertiary Structure Recently, the detailed structure of ribosomes has been 
revealed through X-ray crystallography and cryoelectron microscopy of ribo- 
somes (see Chapter 30). These detailed images not only disclose the tertiary 
structure of the rRNAs but also the quaternary interactions that must occur when 
ribosomal proteins combine with rRNAs and when the ensuing ribonucleopro- 
tein complexes, the small and large subunits, come together to form the com- 
plete ribosome. Only the rRNAs of the 50S ribosomal subunit are shown in 
Figure 11.37; no ribosomal proteins are shown. Note that the overall anatomy of 
the 50S ribosomal subunit (shown diagrammatically in Figure 10.22) is essen- 
tially the same as that of the rRNA molecules within this subunit, even though 
these rRNAs account for only 65% of the mass of this particle. An assortment of 
tertiary structural features are found in the rRNAs, including coaxial stacks, 
pseudoknots, and ribose zippers. We will consider the role of rRNA in ribosome 
structure and function in Chapter 30. 


Aptamers Are Oligonucleotides Specifically Selected for Their 
Ligand-Binding Ability 

Aptamers are synthetic oligonucleotides, usually RNA, which fold into very specific 
three-dimensional structures that selectively bind ligands with high affinity. Ligand 
binding by aptamers is based on the fundamental principle of structural comple- 
mentarity. The rich array of interactive possibilities presented by the four bases and 
the sugar—phosphate backbone, coupled with the inherent flexibility of polynu- 
cleotide chains, make nucleic acids very good ligand-binding candidates. The bases 
project polar amino and carbonyl functionalities, and their 7-electron density gives 
them nonpolar properties. The sugar—phosphate backbone presents polar —OH 
groups and regularly spaced, negatively charged phosphate groups. These phos- 
phate groups can coordinate cations and thus provide foci of positive charge. Syn- 
thetic aptamers designed to target a selected protein can be potent inhibitors of 
protein function; they are of interest in drug development. 
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FIGURE 11.37 The secondary and tertiary structures of rRNAs in the 50S ribosomal subunit from the archaeon 
Haloarcula marismortui (pdb id = 1FFk). (a) Secondary structure of the 23S rRNA, with various domains color- 
coded. (b) Secondary structure of 5S rRNA. (c) Tertiary structure of the 5S and 23S rRNAs within the 50S riboso- 
mal subunit. The 5S rRNA (red) lies atop the 23S rRNA. (Adapted from Figure 4 in Ban, N, et al., 2000. The complete 
atomic structure of the large ribosomal subunit at 2.4 A resolution. Science 289:905—-920.) 
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FIGURE 11.38 Structure of the thiamine pyrophosphate 
(TPP) riboswitch, a conserved region within the mRNA 
that encodes enzymes for synthesis of this coenzyme 
(pdb id = 2CKY). TTP a pyrimidine-containing com- 
pound, is shown in orange. (From Figure 1b in Thore, S., 
Leibundgdut, M., and Ban, N., 2006. Structure of the eukaryotic 
thiamine pyrophosphate riboswitch with its regulatory ligand. 
Science 312:1208-1211.) 





Riboswitches, a naturally occurring class of aptamers, are conserved regions of 
mRNAs that reversibly bind specific metabolites and coenzymes and usually act as 
regulators of gene expression. Riboswitches are usually buried within the 5’- or 
3’-untranslated regions of the mRNAs whose expression they regulate. Binding of 
the metabolite to the riboswitch typically blocks expression of the mRNA. Figure 
11.38 shows the structure of the thiamine pyrophosphate riboswitch. 


SUMMARY 


11.1 How Do Scientists Determine the Primary Structure of Nucleic 
Acids? The most widely used protocol for nucleic acid sequencing is 
Sanger’s chain termination (also called the dideoxy or the primed syn- 
thesis) method. A DNA fragment of unknown sequence serves as tem- 
plate in a polymerization reaction using DNA polymerase. Polymeriza- 
tion depends on an oligonucleotide primer base-paired to the unknown 
sequence. All four DNA polymerase deoxynucleotide substrates—dATP, 
dGTP, dCTP, and dTTP—are present. In addition, the reaction mixture 
contains the four corresponding 2’,3’-dideoxynucleotides (ddATP, 
ddGTP, ddCTP, and ddTTP). As synthesis proceeds, a deoxynucleotide is 
usually added to the 3’-OH end of the growing chain as the newly 
formed strand is extended in the 5'—>3' direction. Occasionally, however, 
a dideoxynucleotide is added and, because it lacks a 3'-OH group, it can- 
not serve as a deoxynucleotide acceptor in chain extension. Then syn- 
thesis is terminated. This base-specific premature chain termination is 
only a random, occasional event, and a population of new strands of vary- 
ing length is synthesized. The population of newly synthesized DNAs 
forms a nested set of molecules differing in length by just one nucleo- 
tide. Each has a dideoxynucleotide at its 3’-end. Because each of the four 
dideoxynucleotides bears a different fluorescent tag, the particular fluo- 
rescence (orange for ddA, blue for ddC, green for ddG, and red for 
ddT) indicates which base was specified by the template and incorpo- 
rated by DNA polymerase at that spot. The sequencing products are vi- 
sualized by fluorescence spectroscopy following capillary electrophore- 
sis, revealing the sequence of the newly synthesized strands. This 
observed sequence is complementary to the corresponding unknown 
template sequence. Sanger sequencing has been fully automated. 


11.2 What Sorts of Secondary Structures Can Double-Stranded DNA 
Molecules Adopt? DNA typically occurs as a double helical molecule, 
with the two DNA strands running antiparallel to one another, bases in- 
side, sugar—-phosphate backbone outside. The double helical arrange- 


ment dramatically curtails the conformational possibilities otherwise 
available to single-stranded DNA. DNA double helices can be in a num- 
ber of stable conformations, with the three predominant forms termed 
A-, B-, and Z-DNA. B-DNA, has about 10.5 base pairs per turn, each con- 
tributing about 0.332 nm to the length of the double helix. The base 
pairs in B-DNA are nearly perpendicular to the helix axis. In A-DNA, the 
pitch is 2.46 nm, with 11 bp per turn. A-DNA has its base pairs displaced 
around, rather than centered on, the helix axis. Z-DNA has four dis- 
tinctions: It is left-handed, it is G: C-rich, the repeating unit on a given 
strand is the dinucleotide, and the sugar—phosphate backbone follows a 
zigzag course. Alternative hydrogen-bonding interactions between A:T 
and G:C gives rise to Hoogsteen base pairs. Interstrand Hoogsteen base 
pairing creates novel multiplex structures composed of three or four 
DNA strands. These multiplex structures occur naturally and have bio- 
logical implications. 


11.3 Can the Secondary Structure of DNA Be Denatured and Rena- 
tured? When duplex DNA is subjected to conditions that disrupt base- 
pairing interactions, the double helix is denatured and the two DNA 
strands separate as individual random coils. Denatured DNA will rena- 
ture to re-form a duplex structure if the denaturing conditions are re- 
moved. The rate of DNA renaturation is an index of DNA sequence 
complexity. 

If DNA from two different species are mixed, denatured, and al- 
lowed to anneal, artificial hybrid duplexes may form, provided the DNA 
from one species is similar in nucleotide sequence to the DNA of the 
other. Nucleic acid hybridization can reveal evolutionary relationships, 
and it can be exploited to identify specific DNA sequences. 


11.4 Can DNA Adopt Structures of Higher Complexity? Supercoils are 
one kind of DNA tertiary structure. In relaxed, B-form DNA, the two 
strands wind about each other once every 10 bp or so (once every turn of 


the helix). DNA duplexes form supercoils if the strands are underwound 
(negatively supercoiled) or overwound (positively supercoiled). The basic para- 
meter characterizing supercoiled DNA is the linking number, L. L can be 
equated to the twist (T) and writhe (W), where twist is the number of he- 
lical turns and writhe is the number of supercoils: L= T + W. L can be 
changed only if one or both strands of the DNA are broken, the strands 
are wound tighter or looser, and their ends are rejoined. DNA gyrase is a 
topoisomerase that introduces negative supercoils into bacterial DNA. 


11.5 What Is the Structure of Eukaryotic Chromosomes? The DNA 
in a eukaryotic cell exists as chromatin, a nucleoprotein complex 
mostly composed of DNA wrapped around a protein core consisting 
of eight histone polypeptide chains—two copies each of histones 
H2A, H2B, H3, and H4. This DNA: histone core structure is termed 
a nucleosome, the fundamental structural unit of chromosomes. A 
higher order of chromatin structure is created when the array of nu- 
cleosomes is wound into a solenoid, creating a 30-nm filament. This 
30-nm filament then is formed into long DNA loops, and loops are 
arranged radially about the circumference of a single turn to form a 
miniband unit of a chromosome. SMC proteins mediate chomoso- 
mal dynamics, including chromatin condensation and chromosome 
formation. 


11.6 Can Nucleic Acids Be Synthesized Chemically? Laboratory syn- 
thesis of oligonucleotide chains of defined sequence is accomplished 
through orthogonal solid-phase methods based on phosphoramidite 
chemistry. Chemical synthesis takes place in the 3'—>5' direction (the 
reverse of the biological polymerization direction). Commercially avail- 
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able automated instruments called DNA synthesizers can synthesize 
oligonucleotide chains with 150 bases or more. 


11.7 What Are the Secondary and Tertiary Structures of RNA? 
Compared to double-stranded DNA, single-stranded RNA has many 
more conformational possibilities, but intramolecular interactions 
and other stabilizing influences limit these possibilities. RNA mole- 
cules have many double-stranded regions formed via intrastrand hy- 
drogen bonding. Such double-stranded regions give rise to hairpin 
stem-loop structures. A number of defined structural motifs recur 
within the loops of stem-loop structures, such as U-turns and 
tetraloops. 

Single-stranded loops in RNA stem-loops create base-pairing oppor- 
tunities between distant, complementary, single-stranded loop regions. 
Other tertiary structural motifs arise from coaxial stacking, pseudoknot 
formation, and ribose zippers. 

In tRNAs, the formation of stem-loops leads to a cloverleaf pattern of 
secondary structure formed from four base-paired segments: the acceptor 
stem, the D loop, the anticodon loop, and the TYC loop. Base-pairing in- 
teractions between bases in the D and TyC loops give rise to tertiary 
structure by bending the cloverleaf into the stable L-shaped form. 

Substantial intrastrand sequence complementarity also is found in 
ribosomal RNA molecules, leading to a highly folded pattern based 
on base pairing between complementary segments. The complete 
three-dimensional structure of rRNAs has revealed an assortment of 
the tertiary structural features common to RNAs, including coaxial 
stacks, pseudoknots, and ribose zippers. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. The oligonucleotide ddAGATGCCTGACT was subjected to sequenc- 
ing by Sanger’s dideoxy method, using fluorescent-tagged dideoxy- 
nucleotides and capillary electrophoresis, essentially as shown in 
Figure 11.3. Draw a diagram of the gel-banding pattern within the 
capillary. 

2. The output of an automated DNA sequence determination by the 
Sanger dideoxy chain termination method performed as illustrated 
in Figure 11.3 is displayed at right. What is the sequence of the orig- 
inal oligonucleotide? 

3. X-ray diffraction studies indicate the existence of a novel double- 
stranded DNA helical conformation in which AZ (the rise per base 
pair) = 0.32 nm and P (the pitch) = 3.36 nm. What are the other pa- 
rameters of this novel helix: (a) the number of base pairs per turn, 
(b) Ad@ (the mean rotation per base pair), and (c) c (the true repeat)? 

4. A 41.5nm-long duplex DNA molecule in the B-conformation 
adopts the A-conformation upon dehydration. How long is it now? 
What is its approximate number of base pairs? 

5. If 80% of the base pairs in a duplex DNA molecule (12.5 kbp) are 
in the B-conformation and 20% are in the Z-conformation, what is 
the length of the molecule? 

6. A “relaxed,” circular, double-stranded DNA molecule (1600 bp) is 
in a solution where conditions favor 10 bp per turn. What is the 
value of Lo for this DNA molecule? Suppose DNA gyrase introduces 
12 negative supercoils into this molecule. What are the values of L, 
W, and T now? What is the superhelical density, o? 

7. Suppose one double helical turn of a superhelical DNA molecule 
changes conformation from B- to Z-form. What are the changes in 
L, W, and T? Why do you suppose the transition of DNA from B- to 
Z-form is favored by negative supercoiling? 

8. Assume that there is one nucleosome for every 200 bp of eukaryotic 
DNA. How many nucleosomes are there in a diploid human cell? 
Nucleosomes can be approximated as disks 11 nm in diameter and 














6 nm long. If all the DNA molecules in a diploid human cell are in 
the B-conformation, what is the sum of their lengths? If this DNA is 
now arrayed on nucleosomes in the beads-on-a-string motif, what 
would be the approximate total height of the nucleosome column 
if these disks were stacked atop one another? 


9. The characteristic secondary structures of tRNA and rRNA mole- 
cules are achieved through intrastrand hydrogen bonding. Even for 
the small tRNAs, remote regions of the nucleotide sequence inter- 
act via H bonding when the molecule adopts the cloverleaf pattern. 
Using Figure 11.33 as a guide, draw the primary structure of a tRNA 
and label the positions of its various self-complementary regions. 

10. Using the data in Table 10.1, arrange the DNAs from the follow- 
ing sources in order of increasing Tw: human, salmon, wheat, 
yeast, E. coli. 

11. At 0.2 M Na‘, the melting temperature of double-stranded DNA is 
given by the formula, T„= 69.3 + 0.41 (% G + C). The DNAs from 
mice and rats have (G + C) contents of 44% and 40%, respectively. 
Calculate the Tns for these DNAs in 0.2 M NaCl. If samples of these 
DNAs were inadvertently mixed, how might they be separated from 
one another? 
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12. The buoyant density of DNA is proportional to its (G + C) con- 
tent. (G:C base pairs have more atoms per volume than A:T base 
pairs.) Calculate the density (p) of avian tubercle bacillus 
DNA from the data presented in Table 10.1 and the equation p = 
1.660 + 0.098(GC), where (GC) is the mole fraction of (G + C) 
in DNA. 


13. (Integrates with Chapter 10.) Pseudouridine (y) is an invariant 
base in the TYC loop of tRNA; w is also found in strategic places 
in rRNA. (Figure 10.23 shows the structure of pseudouridine.) 
Draw the structure of the base pair that y might form with G. 


14. The plasmid pBR322 is a closed circular dsDNA containing 4363 
base pairs. What is the length in nm of this DNA (that is, what is its 
circumference if it were laid out as a perfect circle)? The E. coli K12 
chromosome is a closed circular dsDNA of about 4,639,000 base 
pairs. What would be the circumference of a perfect circle formed 
from this chromosome? What is the diameter of a dsDNA molecule? 
Calculate the ratio of the length of the circular plasmid pBR322 
to the diameter of the DNA of which it’s made. Do the same for the 
E. coli chromosome. 


15. Listed below are four DNA sequences. Which one contains a 
type-II restriction endonuclease (“six-cutter”) hexanucleotide site? 
Which one that is likely to form a cruciform structure? Which one 
is likely to be found in Z-DNA? Which one represents the 5’-end of 
a tRNA gene? Which one is most likely to be found in a triplex 
DNA structure? 

a. CGCGCGCCGCGCACGCGCTCGCGCGCCGC 
b. GAACGTCGTATTCCCGTACGACGTTC 

c. CAGGTCTCTCTCTCTCTCTCTC 

d. TGGTGCGAATTCTGTGGAT 

e. ATCGGAATTCATCG 

16. The nucleotide sequence of E. coli tRNA®™ is as follows: 
UGGGGUAUCG))CCAAGC—GGU »,AAGGCACCGG35 
AUUCUGAWYVC4y9CGGCAUUCCG;,AGGTWCGAA U6 
CCUCGUACCC,)CAGCCA;¢, 

From this primary structure information, draw the secondary struc- 
ture (cloverleaf) of this RNA and identify its anticodon. 


17. The Protein Data Bank (PDB) is also a repository for nucleic acid 
structures. Go to the PDB at www.rcsb.org and enter pdb id = 1YI2. 
1YI2 is the PDB ID for the structure of the H. marismortui 50S ribo- 
somal subunit with erythromycin bound. Erythromycin is an antibi- 
otic that acts by inhibiting bacterial protein synthesis. In the list of 
the display options under the image of the 50S subunit, click on the 
“KiNG” viewing option to view the structure. Using the tools of the 
KiNG viewer, zoom in and locate erythromycin within this structure. 
If the 50S ribosomal subunit can be compared to a mitten, where in 
the mitten is erythromycin? 


18. Online resources provide ready access to detailed information about 
the human genome. Go the National Center for Biotechnology Infor- 
mation (NCBI) genome database at http://www.ncbi.nlm.nih.gov/ 
Genomes/index.html and click on Homo sapiens in the Map Viewer 
genome annotation updates list to access the chromosome map and 
organization of the human genome. Next, go to http://www.ncbi 
.nim.nih.gov/genome/. In the “Search For” box, type in the follow- 
ing diseases to discover the chromosomal location of the affected 
gene and, by exploring links highlighted by the search results, dis- 
cover the name of the protein affected by the disease: 

a. Sickle cell anemia 
b. Tay Sachs disease 
c. Leprechaunism 

d. Hartnup disorder 


Preparing for the MCAT Exam 

19. (Integrates with Chapter 10.) Erwin Chargaff did not have any 
DNA samples from thermoacidophilic bacteria such as those that 
thrive in the geothermal springs of Yellowstone National Park. 
(Such bacteria had not been isolated by 1951 when Chargaff re- 
ported his results.) If he had obtained such a sample, what do you 
think its relative G:C content might have been? Why? 

20. Think about the structure of DNA in its most common B-form 
double helical conformation and then list its most important struc- 
tural features (deciding what is “important” from the biological role 
of DNA as the material of heredity). Arrange your answer with the 
most significant features first. 
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Scala/Art Resource, NY 


The Chimera of Arezzo, of Etruscan origin and proba- 
bly from the fifth century B.C., was found near Arezzo, 
Italy, in 1553. Chimeric animals existed only in the 
imagination of the ancients. But the ability to create 
chimeric DNA molecules is a very real technology 
that has opened up a whole new field of scientific 
investigation. 





...how many vain chimeras have you created ?... 
Go and take your place with the seekers after 
gold. 


Leonardo da Vinci 
The Notebooks (1508-1518), Volume I, Chapter 25 





KEY QUESTIONS 


12.1 What Does It Mean “To Clone”? 
12.2 What Is a DNA Library? 


12.3 Can the Cloned Genes in Libraries Be 
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ESSENTIAL QUESTIONS 


Using techniques for the manipulation of nucleic acids in the laboratory, scientists 
can join together different DNA segments from different sources. Such manmade 
products are called recombinant DNA molecules, and the use of such molecules to 
alter the genetics of organisms is termed genetic engineering. 


What are the methods that scientists use to create recombinant DNA mole- 
cules; can scientists create genes from recombinant DNA molecules; and can 
scientists modify the heredity of an organism using recombinant DNA? 


In the early 1970s, technologies for the laboratory manipulation of nucleic acids 
emerged. In turn, these technologies led to the construction of DNA molecules com- 
posed of nucleotide sequences taken from different sources. The products of these 
innovations, recombinant DNA molecules,! opened exciting new avenues of investi- 
gation in molecular biology and genetics, and a new field was born—recombinant 
DNA technology. Genetic engineering is the application of this technology to the 
manipulation of genes. These advances were made possible by methods for amplifi- 
cation of any particular DNA segment, regardless of source, within bacterial host 
cells. Or, in the language of recombinant DNA technology, the cloning of virtually 
any DNA sequence became feasible. 


2st What Does It Mean “To Clone”? 


In classical biology, a clone is a population of identical organisms derived from a sin- 
gle parental organism. For example, the members of a colony of bacterial cells that 
arise from a single cell on a petri plate are a clone. Molecular biology has borrowed 
the term to mean a collection of molecules or cells all identical to an original mol- 
ecule or cell. So, if a single bacterial cell harboring a recombinant DNA molecule 
in the form of a plasmid grows and multiplies on a petri plate to form a colony, the 
plasmids within the millions of cells in the bacterial colony represent a clone of the 
original DNA molecule, and these molecules can be isolated and studied. Further- 
more, if the cloned DNA molecule is a gene (or part of a gene)—that is, it encodes 
a functional product—a new avenue to isolating and studying this product has 
opened. Recombinant DNA methodology offers exciting new vistas in biochemistry. 


Plasmids Are Very Useful in Cloning Genes 


Plasmids are naturally occurring, circular, extrachromosomal DNA molecules (see 
Chapter 11). Natural strains of the common colon bacterium Escherichia coli isolated 
from various sources contain diverse plasmids. Often these plasmids carry genes 
specifying novel metabolic activities that are advantageous to the host bacterium. 
These activities range from catabolism of unusual organic substances to metabolic 
functions that endow the host cells with resistance to antibiotics, heavy metals, or 
bacteriophages. Plasmids that are able to perpetuate themselves in E. coli, the bac- 
terium favored by bacterial geneticists and molecular biologists, are the workhorses 


'The advent of molecular biology, like that of most scientific disciplines, generated a jargon all its own. 
Learning new fields often requires gaining familiarity with a new vocabulary. We will soon see that 
many words—vector, amplification, and insert are but a few examples—have been bent into new mean- 
ings to describe the marvels of molecular biology. 


of recombinant DNA technology. Because restriction endonuclease digestion of 
plasmids can generate fragments with overlapping or “sticky” ends, artificial plas- 
mids can be constructed by ligating different fragments together. Such artificial 
plasmids were among the earliest recombinant DNA molecules. These recombinant 
molecules can be autonomously replicated, and hence propagated, in suitable bac- 
terial host cells, provided they still possess a site signaling where DNA replication 
can begin (a so-called origin of replication or ori sequence). 


Plasmids as Cloning Vectors The idea arose that “foreign” DNA sequences could 
be inserted into artificial plasmids and that these foreign sequences would be car- 
ried into Ł. coli and propagated as part of the plasmid. That is, these plasmids could 
serve as cloning vectors to carry genes. (The word vector is used here in the sense of 
“a vehicle or carrier.”) Plasmids useful as cloning vectors possess three common fea- 
tures: a replicator, a selectable marker, and a cloning site (Figure 12.1). A replicator 
is an origin of replication, or ori. The selectable marker is typically a gene conferring 
resistance to an antibiotic. Only cells containing the cloning vector will grow in the 
presence of the antibiotic. Therefore, growth on antibiotic-containing media “se- 
lects for” plasmid-containing cells. Typically, the cloning site is a sequence of nu- 
cleotides representing one or more restriction endonuclease cleavage sites. Cloning 
sites are located where the insertion of foreign DNA neither disrupts the plasmid’s 
ability to replicate nor inactivates essential markers. 


Virtually Any DNA Sequence Can Be Cloned Nuclease cleavage at a restriction site 
opens, or linearizes, the circular plasmid so that a foreign DNA fragment can be in- 
serted. The ends of this linearized plasmid are joined to the ends of the fragment so 
that the circle is closed again, creating a recombinant plasmid (Figure 12.2). Recom- 
binant plasmids are hybrid DNA molecules consisting of plasmid DNA sequences plus 
inserted DNA elements (called inserts). Such hybrid molecules are also called chimeric 
constructs or chimeric plasmids. (The term chimera is borrowed from mythology and 
refers to a beast composed of the body and head of a lion, the heads of a goat and a 
snake, and the wings of a bat.) The presence of foreign DNA sequences does not ad- 
versely affect replication of the plasmid, so chimeric plasmids can be propagated in 
bacteria just like the original plasmid. Bacteria often harbor several hundred copies of 
common cloning vectors per cell. Hence, large amounts of a cloned DNA sequence 


pBR322 
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FIGURE 12.1 One of the first widely used cloning vec- 
tors, the plasmid pBR322. This 4363-bp plasmid contains 
an ori and genes for resistance to the drugs ampicillin 
(amp') and tetracycline (tet'). The locations of restriction 
endonuclease cleavage sites are indicated. 
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CENGAGENOW” ACTIVE FIGURE 12.2 An EcoRI restric- 
tion fragment of foreign DNA can be inserted into a 
plasmid having an EcoRI cloning site by (1) cutting the 
plasmid at this site with EcoRI, (2) annealing the lin- 
earized plasmid with the EcoRI foreign DNA fragment, 
and (3) sealing the nicks with DNA ligase. Test yourself 
on the concepts in this figure at www.cengage.com/ 
login. 
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can be recovered from bacterial cultures. The enormous power of recombinant DNA 
technology stems in part from the fact that virtually any DNA sequence can be selectively 
cloned and amplified in this manner. DNA sequences that are difficult to clone include in- 
verted repeats, origins of replication, centromeres, and telomeres. The only practical 
limitation is the size of the foreign DNA segment: Most plasmids with inserts larger 
than about 10 kbp are not replicated efficiently. However, bacteriophages such as 
bacteriophage A can be manipulated so that DNA sequences as large as 40 kbp can 
be inserted into the bacteriophage genome. Such recombinant phage DNA mole- 
cules lack essential A genes and replicate in E. coli as plasmids. 


Construction of Chimeric Plasmids Creation of chimeric plasmids requires join- 
ing the ends of the foreign DNA insert to the ends of a linearized plasmid. This 
ligation is facilitated if the ends of the plasmid and the insert have complementary, 
single-stranded overhangs. Then these ends can base-pair with one another, an- 
nealing the two molecules together. One way to generate such ends is to cleave the 
DNA with restriction enzymes that make staggered cuts; many such restriction endo- 
nucleases are available (see Table 10.2). For example, if the sequence to be inserted 


is an EcoRI fragment and the plasmid is cut with EcoRI, the single-stranded sticky 
ends of the two DNAs can anneal (Figure 12.2). The interruptions in the 
sugar—phosphate backbone of DNA can then be sealed with DNA ligase to yield a 
covalently closed, circular chimeric plasmid. DNA ligase is an enzyme that cova- 
lently links adjacent 3’-OH and 5’-PO, groups. An inconvenience of this strategy is 
that any pair of EcoRI sticky ends can anneal with each other. So, plasmid molecules 
can reanneal with themselves, as can the foreign DNA restriction fragments. These 
DNAs can be eliminated by selection schemes designed to identify only those bac- 
teria containing chimeric plasmids. 

Blunt-end ligation is an alternative method for joining different DNAs. The most 
widely used DNA ligase, bacteriophage T4 DNA ligase, is an ATP-dependent enzyme 
that can even ligate two DNA fragments whose ends lack overhangs (blunt-ended 
DNAs). Many restriction endonucleases cut double-stranded DNA so that blunt 
ends are formed. 

A great number of variations on these basic themes have emerged. For example, 
short synthetic DNA duplexes whose nucleotide sequence consists of little more 
than a restriction site can be blunt-end ligated onto any DNA. These short DNAs are 
known as linkers. Cleavage of the ligated DNA with the restriction enzyme then 
leaves tailor-made sticky ends useful in cloning reactions (Figure 12.3). Similarly, 
many vectors contain a polylinker cloning site, a short region of DNA sequence 
bearing numerous restriction sites. 


Promoters and Directional Cloning Note that the strategies discussed thus far 
create hybrids in which the orientation of the DNA insert within the chimera is ran- 
dom. Sometimes it is desirable to insert the DNA in a particular orientation. For ex- 
ample, an experimenter might wish to insert a particular DNA (a gene) in a vector 
so that its gene product is synthesized. To do this, the DNA must be placed down- 
stream from a promoter. A promoter is a nucleotide sequence lying upstream of a 
gene. The promoter controls expression of the gene. RNA polymerase molecules 
bind specifically at promoters and initiate transcription of adjacent genes, copying 
template DNA into RNA products. One way to insert DNA so that it will be properly 
oriented with respect to the promoter is to create DNA molecules whose ends have 
different overhangs. Ligation of such molecules into the plasmid vector can only 
take place in one orientation to give directional cloning (Figure 12.4). 
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(b) A vector cloning site containing multiple restriction sites, 
a so-called polylinker. 
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CENGAGENOW" Go to CengageNOW at 
www.cengage.com/login and click 
BiochemistryInteractive to explore blunt-end 
ligation. 


CENGAGENOW™ ANIMATED FIGURE 12.3 (a) The use of 
linkers to create tailor-made ends on cloning fragments. 
Note that the ligation reaction can add multiple linkers 
on each end of the blunt-ended DNA. EcoRI digestion 
removes all but the terminal one, leaving the desired 
5’-overhangs. (b) Cloning vectors often have polylinkers 
consisting of a multiple array of restriction sites at their 
cloning sites, so restriction fragments generated by a 
variety of endonucleases can be incorporated into the 
vector. Note that the polylinker is engineered not only 
to have multiple restriction sites but also to have an 
uninterrupted sequence of codons, so this region of the 
vector has the potential for translation into protein (see 
Figure 12.15). (Adapted from Figure 1.14.2 in Greenwich, D, and 
Brent, R., 2003. UNIT 1.14 Introduction to Vectors Derived from 
Filamentous Phages, in Current Protocols in Molecular Biology, 
Ausubel, F.M, Brent, R, Kingston, R. E., Moore, D. D., Seidman, J. G, 
Smith, J. A, and Struhl, K, eds. New York: John Wiley and Sons.) See 
this figure animated at www.cengage.com/login. 
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CENGAGENOW” ANIMATED FIGURE 12.4 Directional 
cloning. DNA molecules whose ends have different over- 
hangs can be used to form chimeric constructs in which 
the foreign DNA can enter the plasmid in only one ori- 
entation. The foreign DNA and the plasmid are digested 
with the same two enzymes. pUC stands for universal 
cloning plasmid. See this figure animated at www 
.cengage.com/login. HindIII 


Biologically Functional Chimeric Plasmids The first biologically functional 
chimeric DNA molecules constructed in vitro were assembled from parts of differ- 
ent plasmids in 1973 by Stanley Cohen, Annie Chang, Herbert Boyer, and Robert 
Helling. These plasmids were used to transform recipient £. coli cells (transformation 
means the uptake and replication of exogenous DNA by a recipient cell). To facili- 
tate transformation, the bacterial cells were rendered somewhat permeable to DNA 
by Ca?* treatment and a brief 42°C heat shock. Although less than 0.1% of the 
Ca**-treated bacteria became competent for transformation, transformed bacteria 
could be selected by their resistance to certain antibiotics (Figure 12.5). Conse- 
quently, the chimeric plasmids must have been biologically functional in at least two 
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© tet" gene is inactivated by the 
insertion of DNA fragment. 
amp" gene remains intact. 


Chimeric plasmid 
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@ Only tet" colonies appear; @ Using velvet-covered disc, bacterial © Only ampicillin-resistant 
tetë colonies can be colonies are lifted from surface of agar (amp") bacterial colonies 
recovered from amp" plate amp" plate and pressed briefly to surface grow. 
by comparing two plates. of plate containing tetracycline media. 


CENGAGENOW” ACTIVE FIGURE 12.5 A typical bacterial transformation experiment. Here the plasmid pBR322 
is the cloning vector. (1) Cleavage of pBR322 with BamHI, followed by (2) annealing and ligation of inserts 
generated by BamHI cleavage of some foreign DNA, (3) creates a chimeric plasmid. (4) The chimeric plasmid is 
then used to transform Ca**-treated heat-shocked E. coli cells, and the bacterial sample is plated on a petri 
plate. (5) Following incubation of the petri plate overnight at 37°C, (6) colonies of amp' bacteria are evident. 
(7) Replica plating of these bacteria on plates of tetracycline-containing media (8) reveals which colonies are 

tet‘ and which are tetracycline sensitive (tet’). Only the tet’ colonies possess plasmids with foreign DNA inserts. 
Test yourself on the concepts in this figure at www.cengage.com/login. 


aspects: They replicated stably within their hosts, and they expressed the drug re- 
sistance markers they carried. 

In general, plasmids used as cloning vectors are engineered to be small (2.5 kbp 
to about 10 kbp in size) so that the size of the insert DNA can be maximized. These 
plasmids have only a single origin of replication, so the time necessary for complete 
replication depends on the size of the plasmid. Under selective pressure in a grow- 
ing culture of bacteria, overly large plasmids are prone to delete any nonessential 
“genes,” such as any foreign inserts. Such deletion would thwart the purpose of 
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CENGAGENOW” ANIMATED FIGURE 12.6 A typical shuttle vector. LEU2* is a gene in the yeast pathway for 
leucine biosynthesis. The recipient yeast cells are LEU2~ (defective in this gene) and thus require leucine for 
growth. LEU2~ yeast cells transformed with this shuttle vector can be selected on medium lacking any leucine 
supplement. See this figure animated at www.cengage.com/login. 


most cloning experiments. The useful upper limit on cloned inserts in plasmids is 
about 10 kbp. Many eukaryotic genes exceed this size. 


Shuttle Vectors Are Plasmids That Can Propagate in Two 
Different Organisms 


Shuttle vectors are plasmids capable of propagating and transferring (“shuttling”) 
genes between two different organisms, one of which is typically a prokaryote 
(E. coli) and the other a eukaryote (for example, yeast). Shuttle vectors must have 
unique origins of replication for each cell type as well as different markers for se- 
lection of transformed host cells harboring the vector (Figure 12.6). Shuttle vectors 
have the advantage that eukaryotic genes can be cloned in bacterial hosts, yet the 
expression of these genes can be analyzed in appropriate eukaryotic backgrounds. 


Artificial Chromosomes Can Be Created from Recombinant DNA 


DNA molecules 2 megabase pairs in length have been successfully propagated in 
yeast by creating yeast artificial chromosomes or YACs. Furthermore, such YACs 
have been transferred into transgenic mice for the analysis of large genes or multi- 
genic DNA sequences in vivo, that is, within the living animal. For these large DNAs 
to be replicated in the yeast cell, YAC constructs must include not only an origin of 
replication (known in yeast terminology as an autonomously replicating sequence or 
ARS) but also a centromere and telomeres. Recall that centromeres provide the site 
for attachment of the chromosome to the spindle during mitosis and meiosis, and 
telomeres are nucleotide sequences defining the ends of chromosomes. Telomeres 
are essential for proper replication of the chromosome. 


12:28 What Is a DNA Library? 


A DNA library is a set of cloned fragments that collectively represent the genes of a 
specific organism. Particular genes can be isolated from DNA libraries, much as 
books can be obtained from conventional libraries. The secret is knowing where 
and how to look. 
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Combinatorial Libraries 


Specific recognition and binding of other molecules is a defining 
characteristic of any protein or nucleic acid. Often, target ligands 
of a particular protein are unknown, or in other instances, a 
unique ligand for a known protein may be sought in the hope of 
blocking the activity of the protein or otherwise perturbing its func- 
tion. Or, the hybridization of nucleic acids with each other accord- 
ing to base-pairing rules, as an act of specific recognition, can be 
exploited to isolate or identify pairing partners. Combinatorial li- 
braries are the products of strategies to facilitate the identification 
and characterization of macromolecules (proteins, DNA, RNA) 
that interact with small-molecule ligands or with other macromole- 
cules. Unlike genomic libraries, combinatorial libraries consist of 
synthetic oligomers. Arrays of synthetic oligonucleotides printed as 
tiny dots on miniature solid supports are known as DNA chips. (See 
the section titled “DNA Microarrays (Gene Chips) Are Arrays of Dif- 
ferent Oligonucleotides Immobilized on a Chip.”) 

Specifically, combinatorial libraries contain very large numbers 
of chemically synthesized molecules (such as peptides or oligonu- 
cleotides) with randomized sequences or structures. Such libraries 
are designed and constructed with the hope that one molecule 
among a vast number will be recognized as a ligand by the protein 
(or nucleic acid) of interest. If so, perhaps that molecule will be 
useful in a pharmaceutical application. For instance, the synthetic 
oligomer may serve as a drug to treat a disease involving the pro- 
tein to which it binds. 

An example of this strategy is the preparation of a synthetic 
combinatorial library of hexapeptides. The maximum number of 
sequence combinations for hexapeptides is 20°, or 64,000,000. 
One approach to simplify preparation and screening possibilities 


for such a library is to specify the first two amino acids in the hexa- 
peptide while the next four are randomly chosen. In this approach, 
400 libraries (20°) are synthesized, each of which is unique in terms 
of the amino acids at positions 1 and 2 but random at the other 
four positions (as in AAXXXX, ACXXXX, ADXXXX, etc.), so each 
of the 400 libraries contains 20‘, or 160,000, different sequence 
combinations. Screening these libraries with the protein of interest 
reveals which of the 400 libraries contains a ligand with high affin- 
ity. Then, this library is expanded systematically by specifying the 
first three amino acids (knowing from the chosen 1-of-400 libraries 
which amino acids are best as the first two); only 20 synthetic 
libraries (each containing 20°, or 8000, hexapeptides) are made 
here (one for each third-position possibility, the remaining three 
positions being randomized). Selection for ligand binding, again 
with the protein of interest, reveals the best of these 20, and this 
particular library is then varied systematically at the fourth posi- 
tion, creating 20 more libraries (each containing 207, or 400, 
hexapeptides). This cycle of synthesis, screening, and selection is 
repeated until all six positions in the hexapeptide are optimized 
to create the best ligand for the protein. A variation on this basic 
strategy using synthetic oligonucleotides rather than peptides 
identified a unique 15-mer (sequence GGTTGGTGTGGTTGG) 
with high affinity (Kp = 2.7 nM) toward thrombin, a serine pro- 
tease in the blood coagulation pathway. Thrombin is a major target 
for the pharmacological prevention of clot formation in coronary 
thrombosis. 


From Cortese, R., 1996. Combinatorial Libraries: Synthesis, Screening and Ap- 
plication Potential. Berlin: Walter de Gruyter. 





Genomic Libraries Are Prepared from the Total DNA in an Organism 


Any particular gene constitutes only a small part of an organism’s genome. For ex- 
ample, if the organism is a mammal whose entire genome exceeds 10° kbp and the 
gene is 10 kbp, then the gene represents less than 0.001% of the total nuclear DNA. 
It is impractical to attempt to recover such rare sequences directly from isolated nu- 
clear DNA because of the overwhelming amount of extraneous DNA sequences. In- 
stead, a genomic library is prepared by isolating total DNA from the organism, di- 
gesting it into fragments of suitable size, and cloning the fragments into an 
appropriate vector. This approach is called shotgun cloning because the strategy has no 
way of targeting a particular gene but instead seeks to clone all the genes of the or- 
ganism at one time. The intent is that at least one recombinant clone will contain at 
least part of the gene of interest. Usually, the isolated DNA is only partially digested 
by the chosen restriction endonuclease so that not every restriction site is cleaved in 
every DNA molecule. Then, even if the gene of interest contains a susceptible re- 
striction site, some intact genes might still be found in the digest. Genomic libraries 
have been prepared from thousands of different species. 

Many clones must be created to be confident that the genomic library contains 
the gene of interest. The probability, P, that some number of clones, N, contains a 
particular fragment representing a fraction, f, of the genome is 


P=L= (T=) 
Thus, 
Z In (1 — P) 


In (1 = f) 


For example, if the library consists of 10-kbp fragments of the E. coli genome (4640 kbp 
total), more than 2000 individual clones must be screened to have a 99% probability 
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(P = 0.99) of finding a particular fragment. Since f = 10/4640 = 0.0022 and P = 0.99, 
N = 2093. For a 99% probability of finding a particular sequence within the 3 X 10° 
kbp human genome, N would equal almost 1.4 million if the cloned fragments aver- 
aged 10 kbp in size. The need for cloning vectors capable of carrying very large DNA 
inserts becomes obvious from these numbers. 


Libraries Can Be Screened for the Presence of Specific Genes 


A common method of screening genomic libraries is to carry out a colony 
hybridization experiment. In a typical experiment, host bacteria containing a 
plasmid-based library are plated out on a petri dish and allowed to grow overnight 
to form colonies (Figure 12.7). A replica of the bacterial colonies is then obtained 
by overlaying the plate with a flexible, absorbent disc. The disc is removed, treated 
with alkali to dissociate bound DNA duplexes into single-stranded DNA, dried, and 
placed in a sealed bag with labeled probe (see the Critical Developments in Bio- 
chemistry box on page 364). If the probe DNA is duplex DNA, it must be denatured 
by heating at 70°C. The probe and target DNA complementary sequences must be 
in a single-stranded form if they are to hybridize with one another. Any DNA se- 
quences complementary to probe DNA will be revealed by autoradiography of the 
absorbent disc. Bacterial colonies containing clones bearing target DNA are identi- 
fied on the film and can be recovered from the master plate. 


Probes for Southern Hybridization Can Be Prepared 
in a Variety of Ways 


Clearly, specific probes are essential reagents if the goal is to identify a particular 
gene against a background of innumerable DNA sequences. Usually, the probes that 
are used to screen libraries are nucleotide sequences that are complementary to 
some part of the target gene. Making useful probes requires some information about 
the gene’s nucleotide sequence. Sometimes such information is available. Alterna- 
tively, if the amino acid sequence of the protein encoded by the gene is known, it is 
possible to work backward through the genetic code to the DNA sequence (Figure 
12.8). Because the genetic code is degenerate (that is, several codons may specify the 
same amino acid; see Chapter 30), probes designed by this approach are usually 
degenerate oligonucleotides about 17 to 50 residues long (such oligonucleotides are 
so-called 17- to 50-mers). The oligonucleotides are synthesized so that different bases 
are incorporated at sites where degeneracies occur in the codons. The final prepa- 
ration thus consists of a mixture of equal-length oligonucleotides whose sequences 
vary to accommodate the degeneracies. Presumably, one oligonucleotide sequence 
in the mixture will hybridize with the target gene. These oligonucleotide probes are 
at least 17-mers because shorter degenerate oligonucleotides might hybridize with 
sequences unrelated to the target sequence. 

A piece of DNA from the corresponding gene in a related organism can also be 
used as a probe in screening a library for a particular gene. Such probes are termed 


CENGAGENOW” ACTIVE FIGURE 12.7 Screening a genomic library by colony hybridization. Host bacteria 
transformed with a plasmid-based genomic library are plated on a petri plate and incubated overnight to 
allow bacterial colonies to form. A replica of the colonies is obtained by overlaying the plate with a flexible disc 
composed of absorbent material (such as nitrocellulose or nylon) (1). Nitrocellulose strongly binds nucleic 
acids; single-stranded nucleic acids are bound more tightly. Once the disc has taken up an impression of the 
bacterial colonies, it is removed and the petri plate is set aside and saved. The disc is treated with 2 M NaOH, 
neutralized, and dried (2). NaOH both lyses any bacteria (or phage particles) and dissociates the DNA strands. 
When the disc is dried, the DNA strands become immobilized on the filter. The dried disc is placed in a sealable 
plastic bag, and a solution containing heat-denatured (single-stranded), labeled probe is added (3). The bag is 
incubated to allow annealing of the probe DNA to any target DNA sequences that might be present on the 
disc. The filter is then washed, dried, and placed on a piece of X-ray film to obtain an autoradiogram (4). The 
position of any spots on the X-ray film reveals where the labeled probe has hybridized with target DNA (5). 
The location of these spots can be used to recover the genomic clone from the bacteria on the original petri 
plate. Test yourself on the concepts in this figure at www.cengage.com/login. 





heterologous probes because they are not derived from the homologous (same) 
organism. 

Problems arise if a complete eukaryotic gene is the cloning target; eukaryotic 
genes can be tens or even hundreds of kilobase pairs in size. Genes this size are frag- 
mented in most cloning procedures. Thus, the DNA identified by the probe may 
represent a clone that carries only part of the desired gene. However, most cloning 
strategies are based on a partial digestion of the genomic DNA, a technique that 
generates an overlapping set of genomic fragments. This being so, DNA segments 
from the ends of the identified clone can now be used to probe the library for 
clones carrying DNA sequences that flanked the original isolate in the genome. Re- 
peating this process ultimately yields the complete gene among a subset of overlap- 
ping clones. 


cDNA Libraries Are DNA Libraries Prepared from mRNA 


cDNAs are DNA molecules copied from mRNA templates. cDNA libraries are con- 
structed by synthesizing cDNA from purified cellular mRNA. These libraries pre- 
sent an alternative strategy for gene isolation, especially eukaryotic genes. Because 
most eukaryotic mRNAs carry 3’-poly(A) tails, mRNA can be selectively isolated 
from preparations of total cellular RNA by oligo(dT)-cellulose chromatography 
(Figure 12.9). DNA copies of the purified mRNAs are synthesized by first anneal- 
ing short oligo(dT) chains to the poly(A) tails. These oligo(dT) chains serve as 
primers for reverse transcriptase—driven synthesis of DNA (Figure 12.10). [Ran- 
dom oligonucleotides can also be used as primers, with the advantages being less 
dependency on poly(A) tracts and increased likelihood of creating clones repre- 
senting the 5’-ends of mRNAs.] Reverse transcriptase is an enzyme that synthe- 
sizes a DNA strand, copying RNA as the template. DNA polymerase is then used 
to copy the DNA strand and form a double-stranded (duplex DNA) molecule. 
Linkers are then added to the DNA duplexes rendered from the mRNA 
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CENGAGENOW” ANIMATED FIGURE 12.9 Isolation of eukaryotic mRNA via oligo(dT)-cellulose chromatography. 
(a) In the presence of 0.5 M NaCl, the poly(A) tails of eukaryotic MRNA anneal with short oligo(dT) chains cova- 
lently attached to an insoluble chromatographic matrix such as cellulose. Other RNAs, such as rRNA (green), pass 
right through the chromatography column. (b) The column is washed with more 0.5 M NaCl to remove residual 
contaminants. (c) Then the poly(A) mRNA (red) is recovered by washing the column with water because the 
base pairs formed between the poly(A) tails of the mRNA and the oligo(dT) chains are unstable in solutions of 
low ionic strength. See this figure animated at www.cengage.com/login. 
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Known amino acid sequence: 
Phe Met Glu Trp His Lys Asn 


Possible mRNA sequence: 


UUU AUG GAA UGG CAU AGG AAU 
CAC AAA AAC 


UUC GAG 
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CRITICAL DEVELOPMENTS IN BIOCHEMISTRY 


Identifying Specific DNA Sequences by Southern Blotting (Southern Hybridization) 


Any given DNA fragment is unique solely by virtue of its specific 
nucleotide sequence. The only practical way to find one particu- 
lar DNA segment among a vast population of different DNA frag- 
ments (such as you might find in genomic DNA preparations) is 
to exploit its sequence specificity to identify it. In 1975, E. M. 
Southern invented a technique capable of doing just that. 


Electrophoresis 


Southern first fractionated a population of DNA fragments 
according to size by gel electrophoresis (see step 2 in figure). The 
electrophoretic mobility of a nucleic acid is inversely proportional 
to its molecular mass. Polyacrylamide gels are suitable for separa- 
tion of nucleic acids of 25 to 2000 bp. Agarose gels are better if the 
DNA fragments range up to 10 times this size. Most preparations 
of genomic DNA show a broad spectrum of sizes, from less than 1 
kbp to more than 20 kbp. Typically, no discrete-size fragments are 
evident following electrophoresis, just a “smear” of DNA through- 
out the gel. 


Blotting 


Once the fragments have been separated by electrophoresis 
(step 3), the gel is soaked in a solution of NaOH. Alkali dena- 
tures duplex DNA, converting it to single-stranded DNA. After 
the pH of the gel is adjusted to neutrality with buffer, a sheet of 
absorbent material soaked in a concentrated salt solution is then 
placed over the gel, and salt solution is drawn through the gel in 
a direction perpendicular to the direction of electrophoresis 
(step 4). The salt solution is pulled through the gel in one of 
three ways: capillary action (blotting), suction (vacuum blotting), or 
electrophoresis (electroblotting). The movement of salt solution 
through the gel carries the DNA to the absorbent sheet, which 
binds the single-stranded DNA molecules very tightly, effectively 
immobilizing them in place on the sheet. Note that the distribu- 


tion pattern of the electrophoretically separated DNA is main- 
tained when the single-stranded DNA molecules bind to the ab- 
sorbent sheet (step 5 in figure). The sheet is then dried. Next, in 
the prehybridization step, the sheet is incubated with a solution 
containing protein (serum albumin, for example) and/or a 
detergent such as sodium dodecylsulfate. The protein and 
detergent molecules saturate any remaining binding sites 
for DNA on the absorbent sheet, so no more DNA can bind 
nonspecifically. 


Hybridization 

To detect a particular DNA within the electrophoretic smear of 
countless DNA fragments, the prehybridized sheet is incubated in 
a sealed plastic bag with a solution of specific probe molecules 
(step 6 in figure). A probe is usually a single-stranded DNA of de- 
fined sequence that is distinctively labeled, either with a radioac- 
tive isotope (such as °P) or some other easily detectable tag. The 
nucleotide sequence of the probe is designed to be complemen- 
tary to the sought-for or target DNA fragment. The single-stranded 
probe DNA anneals with the single-stranded target DNA bound 
to the sheet through specific base pairing to form a DNA duplex. 
This annealing, or hybridization as it is usually called, labels the 
target DNA, revealing its position on the sheet. For example, if 
the probe is **P-labeled, its location can be detected by autoradi- 
ographic exposure of a piece of X-ray film laid over the sheet 
(step 7 in figure). 

Southern’s procedure has been extended to the identification of 
specific RNA and protein molecules. In a play on Southern’s name, 
the identification of particular RNAs following separation by gel 
electrophoresis, blotting, and probe hybridization is called North- 
ern blotting. The analogous technique for identifying protein mol- 
ecules is termed Western blotting. In Western blotting, the probe of 
choice is usually an antibody specific for the target protein. 


> The Southern blotting technique involves the transfer of electrophoretically 
separated DNA fragments to an absorbent sheet and subsequent detection of 
specific DNA sequences. A preparation of DNA fragments [typically a restriction 
digest, (1)] is separated according to size by gel electrophoresis (2). The separa- 
tion pattern can be visualized by soaking the gel in ethidium bromide to stain 
the DNA and then illuminating the gel with UV light (3). Ethidium bromide mol- 
ecules intercalated between the hydrophobic bases of DNA are fluorescent 
under UV light. The gel is soaked in strong alkali to denature the DNA and then 
neutralized in buffer. Next, the gel is placed on a sheet of DNA-binding material 
and concentrated salt solution is passed through the gel (4) to carry the DNA 
fragments out of the gel where they are bound tightly to the sheet (5). 
Incubation of the sheet with a solution of labeled, single-stranded probe DNA 
(6) allows the probe to hybridize with target DNA sequences complementary to 
it. The location of these target sequences is then revealed by an appropriate 
means of detection, such as autoradiography (7). 
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CENGAGENOW” ACTIVE FIGURE 12.10 Reverse 
transcriptase—driven synthesis of cDNA from oligo(dT) 
primers annealed to the poly(A) tails of purified eukary- 
otic MRNA. (a) Oligo(dT) chains serve as primers for 
synthesis of a DNA copy of the mRNA by reverse tran- 
scriptase. Following completion of first-strand CDNA 
synthesis by reverse transcriptase, RNase H and DNA 
polymerase are added (b). RNase H specifically digests 
RNA strands in DNA:RNA hybrid duplexes. DNA poly- 
merase copies the first-strand cDNA, using as primers the 
residual RNA segments after RNase H has created nicks 
and gaps (c). DNA polymerase has a 5'—3’ exonuclease 
activity that removes the residual RNA as it fills in with 
DNA. The nicks remaining in the second-strand DNA are 
sealed by DNA ligase (d), yielding duplex cDNA. EcoRI 
adapters with 5’-overhangs are then ligated onto the 
cDNA duplexes (e) using phage T4 DNA ligase to create 
EcoRl-ended cDNA for insertion into a cloning vector. 
Test yourself on the concepts in this figure at 
www.cengage.com/login. 
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templates, and the cDNA is cloned into a suitable vector. Once a cDNA derived 
from a particular gene has been identified, the cDNA becomes an effective probe 
for screening genomic libraries for isolation of the gene itself. 

Because different cell types in eukaryotic organisms express selected subsets of 
genes, RNA preparations from cells or tissues in which genes of interest are selec- 
tively transcribed are enriched for the desired mRNAs. cDNA libraries prepared 
from such mRNA are representative of the pattern and extent of gene expression 
that uniquely define particular kinds of differentiated cells. cDNA libraries of many 
normal and diseased human cell types are commercially available, including cDNA 
libraries of many tumor cells. Comparison of normal and abnormal cDNA libraries, 
in conjunction with two-dimensional gel electrophoretic analysis (see Appendix to 
Chapter 5) of the proteins produced in normal and abnormal cells, is a promising 
new Strategy in clinical medicine to understand disease mechanisms. 


Expressed Sequence Tags When a cDNA library is prepared from the mRNAs syn- 
thesized in a particular cell type under certain conditions, these cDNAs represent 
the nucleotide sequences (genes) that have been expressed in this cell type under 
these conditions. Expressed sequence tags (ESTs) are relatively short (~200 nucleo- 
tides or so) sequences obtained by determining a portion of the nucleotide se- 
quence for each insert in randomly selected cDNAs. An EST represents part of a 
gene that is being expressed. Probes derived from ESTs can be labeled, radioactively 
or otherwise, and used in hybridization experiments to identify which genes in a ge- 
nomic library are being expressed in the cell. For example, labeled ESTs can be hy- 
bridized to a gene chip (see following discussion). 
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HUMAN BIOCHEMISTRY 


The Human Genome Project 


Completed in 2003, the Human Genome Project was a 13-year col- 
laborative international, government- and private-sponsored effort 
to map and sequence the entire human genome, some 3 billion 
base pairs distributed among the two sex chromosomes (X and Y) 
and 22 autosomes (chromosomes that are not sex chromosomes). 
A primary goal was to identify and map at least 3000 genetic mark- 
ers (genes or other recognizable loci on the DNA), which were 
evenly distributed throughout the chromosomes at roughly 100-kb 
intervals. At the same time, determination of the entire nucleotide 
sequence of the human genome was undertaken. J. Craig Venter 
and colleagues working at Celera, a private corporation, took an 
alternative approach based on computer alignment of sequenced 
human DNA fragments. A working draft of the human genome 
was completed in June 2000 and published in February 2001. An 
ancillary part of the project has focused on sequencing the 
genomes of other species (such as yeast, Drosophila melanogaster 
[the fruit fly], mice, and Arabidopsis thaliana [a plant]) to reveal 
comparative aspects of genetic and sequence organization (Table 
12.1). Information about whole genome sequences of organisms 
has created a new branch of science called bioinformatics: the 
study of the nature and organization of biological information. 
Bioinformatics includes such approaches as functional genomics 
and proteomics. Functional genomics addresses global issues of gene 
expression, such as looking at all the genes that are activated dur- 
ing major metabolic shifts (as from growth under aerobic to 
growth under anaerobic conditions) or during embryogenesis and 
development of organisms. Transcriptome is the word used in 
functional genomics to define the entire set of genes expressed (as 
mRNAs transcribed from DNA) in a particular cell or tissue under 
defined conditions. Functional genomics also provides new in- 
sights into evolutionary relationships between organisms. Pro- 
teomics is the study of all the proteins expressed by a certain cell or 
tissue under specified conditions. Typically, this set of proteins is 
revealed by running two-dimensional polyacrylamide gel elec- 
trophoresis on a cellular extract or by coupling protein separation 
techniques to mass spectrometric analysis. 

The Human Genome Project has proven to be very beneficial 
to medicine. Many human diseases have been traced to genetic de- 
fects whose position within the human genome has been identi- 
fied. As of 2007, the Human Gene Mutation Database (HGMD) 
listed more than 56,000 mutations in more than 2100 nuclear 
genes associated with human disease. Among these are 


cystic fibrosis gene 
the breast cancer genes, BRCA] and BRCA2 


Duchenne muscular dystrophy gene* (at 2.4 megabases, one of the 
largest known genes in any organism) 


*X-chromosome-linked gene. As of 2007, more than 295 disease-related 
genes have been mapped to the X chromosome (source: the GeneCards 
website at the Weizmann Institute of Science, Israel.) 


Huntington's disease gene 
neurofibromatosis gene 
neuroblastoma gene (a form of brain cancer) 
amyotrophic lateral sclerosis gene (Lou Gehrig’s disease) 
melanocortin-4 receptor gene (obesity and binge eating) 
fragile X-linked mental retardation gene* 
as well as genes associated with the development of diabetes, a 


variety of other cancers, and affective disorders such as schizophre- 
nia and bipolar affective disorder (manic depression). 











Genome Year 
Genome Size? Completed 
Bacteriophage ¢X174 0.0054 1977 
Bacteriophage A 0.048 1982 
Marchantia chloroplast genome 0.187 1986 
Vaccinia virus 0.192 1990 
Cytomegalovirus (CMV) 0.229 1991 
Marchantia mitochondrial genome 0.187 1992 
Variola (smallpox) virus 0.186 1993 
Haemophilus influenzae 1.830 1995 
(Gram-negative bacterium) 
Mycobacterium genitalium 0.58 1995 
(mycobacterium) 
Escherichia coli (Gram-negative 4.64 1996 
bacterium) 
Saccharomyces cerevisiae (yeast) 12.1 1996 
Methanococcus jannaschii 1.66 1998 
(archaeon) 
Arabidopsis thaliana (green plant) 115 2000 
Caenorhabditis elegans (simple 88 1998 
animal: nematode worm) 
Drosophila melanogaster (fruit fly) 117 2000 
Homo sapiens (human) 3038 2001 
Pan troglodytes (chimpanzee) 3109 2005 











'Data available from the National Center for Biotechnology Information at the National 
Library of Medicine. Website: http://vww.ncbi.nim.nih.gov/ 

?Genome size is given as millions of base pairs (mb). 

3Marchantia is a bryophyte (a nonvascular green plant). 

‘The first complete sequence for the genome of a free-living organism. 





DNA Microarrays (Gene Chips) Are Arrays of Different Oligonucleotides 
Immobilized on a Chip 


Robotic methods can be used to synthesize combinatorial libraries of DNA 
oligonucleotides directly on a solid support, such that the completed library is a 
two-dimensional array of different oligonucleotides (see the Critical Developments 
in Biochemistry box on combinatorial libraries, page 361). Synthesis is performed 
by phosphoramidite chemistry (Figure 11.29) adapted into a photochemical 
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process that can be controlled by light. Computer-controlled masking of the light 
allows chemistry to take place at some spots in the two-dimensional array of grow- 
ing oligonucleotides and not at others, so each spot on the array is a population of 
identical oligonucleotides of unique sequence. The final products of such proce- 
dures are referred to as “gene chips” because the oligonucleotide sequences syn- 
thesized upon the chip represent the sequences of chosen genes. Typically, the 
oligonucleotides are up to 25 nucleotides long (there are more than 101° possible 
sequence arrangements for 25-mers made from four bases), and as many as 
100,000 different oligonucleotides can be arrayed on a chip 1 cm square. The 
oligonucleotides on such gene chips are used as the probes in a hybridization ex- 
periment to reveal gene expression patterns. Figure 12.11 shows one design for 
gene chip analysis of gene expression. 
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hips (DNA microarrays) in the analysis of gene expression. Here 

is one of many analytical possibilities based on DNA microarray technology: (1) Gene 
e, ESTs) are isolated and amplified by PCR (see Figure 12.18), 
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create a gene chip. The gene chip usually is considered the “probe” in a “target: probe" Emission 
screening experiment. (2) Target preparation: Total RNA from two sets of cell treat- 

ments (control and test treatment) are isolated, and cDNA is produced from the two 

batches of RNA via reverse transcriptase. During CDNA production, the control is la- 

F ; oe © © 0e@ © 
beled with a specific fluorescent marker (green, for example) and the test treatment © 0000 © © ee 
is labeled with a different fluorescent marker (red, for example), so the wavelength of © 00 © 000000 

t ieas : © 0000 0000 © 
fluorescence allows discrimination between the two different sets of cDNAs. The two e © o © o o 
batches of labeled cDNA are pooled and hybridized to the gene chip. Laser excitation 
of the hybridized gene chip with light of appropriate wavelength allows collection of ý 
data indicating the intensities of fluorescence, and hence the degree of hybridization 
of the two different probes with the gene chips. Because the location of genes on : 
the gene chip is known, which genes are expressed (or not) and the degree to which @e@0000 
they are expressed is revealed by the fluorescent patterns. (Adapted from Figure 1 in @e © ce 
Duggan, D.J., et al., 1999. Expression profiling using CDNA microarrays. Nature Genetics 21 supple- ý 
ment:10-14.) Computer analysis 
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Expression Vectors Are Engineered So That the RNA or Protein Products 
of Cloned Genes Can Be Expressed 


Expression vectors are engineered so that any cloned insert can be transcribed into 
RNA, and, in many instances, even translated into protein. cDNA expression li- 
braries can be constructed in specially designed vectors. Proteins encoded by the 
various CDNA clones within such expression libraries can be synthesized in the host 
cells, and if suitable assays are available to identify a particular protein, its corre- 
sponding cDNA clone can be identified and isolated. Expression vectors designed 
for RNA expression or protein expression, or both, are available. 


RNA Expression A vector for in vitro expression of DNA inserts as RNA transcripts 
can be constructed by putting a highly efficient promoter adjacent to a versatile 
cloning site. Figure 12.12 depicts such an expression vector. Linearized recombi- 
nant vector DNA is transcribed in vitro using SP6 RNA polymerase. Large amounts 
of RNA product can be obtained in this manner; if radioactive or fluorescent- 
labeled ribonucleotides are used as substrates, labeled RNA molecules useful as 
probes are made. 


Protein Expression Because cDNAs are DNA copies of mRNAs, cDNAs are unin- 
terrupted copies of the exons of expressed genes. Because cDNAs lack introns, it 
is feasible to express these cDNA versions of eukaryotic genes in prokaryotic hosts 
that cannot process the complex primary transcripts of eukaryotic genes. To ex- 
press a eukaryotic protein in FE. coli, the eukaryotic cDNA must be cloned in an ex- 
pression vector that contains regulatory signals for both transcription and transla- 
tion. Accordingly, a promoter where RNA polymerase initiates transcription as well 
as a ribosome-binding site to facilitate translation are engineered into the vector just 
upstream from the restriction site for inserting foreign DNA. The AUG initiation 
codon that specifies the first amino acid in the protein (the translation start site) is 
contributed by the insert (Figure 12.13). 

Strong promoters have been constructed that drive the synthesis of foreign pro- 
teins to levels equal to 30% or more of total E. coli cellular protein. An example is the 
hybrid promoter, fac, Which was created by fusing part of the promoter for the E. colt 
genes encoding the enzymes of lactose metabolism (the lac promoter) with part of 
the promoter for the genes encoding the enzymes of tryptophan biosynthesis (the 
trp promoter) (Figure 12.14). In cells carrying pac expression vectors, the pac pro- 
moter is not induced to drive transcription of the foreign insert until the cells are ex- 
posed to inducers that lead to its activation. Analogs of lactose (a B-galactoside) such 
as isopropyl-B-thiogalactoside, or IPTG, are excellent inducers of pac Thus, expression 
of the foreign protein is easily controlled. (See Chapter 29 for detailed discussions 
of inducible gene expression.) 

Perhaps the most widely used protein expression system is based on the pET plas- 
mid. Transcription of the cloned gene insert is under the control of the bacterio- 
phage T7 RNA polymerase promoter in pET. This promoter is not recognized by 
the E. coli RNA polymerase, so transcription can only occur if the T7 RNA po- 
lymerase is present in host cells. Host E. coli cells are engineered so that the T7 RNA 
polymerase gene is inserted in the host chromosome under the control of the lac 
promoter. IPTG induction triggers T7 RNA polymerase production and subsequent 
transcription and translation of the pET insert. The bacteriophage T7 RNA po- 
lymerase is so active that most of the host cell’s resources are directed into protein 
expression and levels of expressed protein approach 50% of total cellular protein. 

The bacterial production of valuable eukaryotic proteins represents one of the 
most important uses of recombinant DNA technology. For example, human insulin 
for the clinical treatment of diabetes is now produced in bacteria. 
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CENGAGENOW” ANIMATED FIGURE 12.12 Expression 
vectors carrying the promoter recognized by the RNA 
polymerase of bacteriophage SP6 are useful for the pro- 
duction of multiple RNA copies of any DNA inserted at 
the polylinker. Before transcription is initiated, the circu- 
lar expression vector is linearized by a single cleavage at 
or near the end of the insert so that transcription termi- 
nates at a fixed point. See this figure animated at 
www.cengage.com/login. 
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CENGAGENOW” ANIMATED FIGURE 12.14 A p< protein 
expression vector contains the hybrid promoter Prac 
derived from fusion of the lac and trp promoters. 
Isopropyl-B-b-thiogalactoside, or IPTG, induces expres- 
sion from pc. See this figure animated at www 
.cengage.com/login. 





Analogous systems for expression of foreign genes in eukaryotic cells include vec- 
tors carrying promoter elements derived from mammalian viruses, such as simian 
virus 40 (SV40), the Epstein—Barr virus, and the human cytomegalovirus (CMV). A sys- 
tem for high-level expression of foreign genes uses insect cells infected with the bac- 
ulovirus expression vector. Baculoviruses infect lepidopteran insects (butterflies and 
moths). In engineered baculovirus vectors, the foreign gene is cloned downstream 
of the promoter for polyhedrin, a major viral-encoded structural protein, and the 
recombinant vector is incorporated into insect cells grown in culture. Expression 
from the polyhedrin promoter can lead to accumulation of the foreign gene prod- 
uct to levels as high as 500 mg/L. 

Technologies for the expression of recombinant proteins in mammalian cell cul- 
tures are commercially available. These technologies have the advantage that the 
unique post-translational modifications of proteins (such as glycosylation; see Chap- 
ter 31) seen in mammalian cells take place in vivo so that the expressed protein is 
produced in its naturally occurring form. 


Screening cDNA Expression Libraries with Antibodies Antibodies that specifically 
cross-react with a particular protein of interest are often available. If so, these anti- 
bodies can be used to screen a cDNA expression library to identify and isolate cDNA 
clones encoding the protein. The cDNA library is introduced into host bacteria, 
which are plated out and grown overnight, as in the colony hybridization scheme pre- 
viously described. DNA-binding nylon membranes are placed on the plates to obtain 
a replica of the bacterial colonies. The nylon membrane is then incubated under con- 
ditions that induce protein synthesis from the cloned cDNA inserts, and the cells are 
treated to release the synthesized protein. The synthesized protein binds tightly to the 
nylon membrane, which can then be incubated with the specific antibody. Binding of 
the antibody to its target protein product reveals the position of any cDNA clones ex- 
pressing the protein, and these clones can be recovered from the original plate. Like 
other libraries, expression libraries can be screened with oligonucleotide probes, too. 


Fusion Protein Expression Some expression vectors carry cDNA inserts cloned 
directly into the coding sequence of a vector-borne protein-coding gene (Figure 
12.15). Translation of the recombinant sequence leads to synthesis of a hybrid pro- 
tein or fusion protein. The N-terminal region of the fused protein represents amino 
acid sequences encoded in the vector, whereas the remainder of the protein is en- 
coded by the foreign insert. Keep in mind that the triplet codon sequence within 
the cloned insert must be in phase with codons contributed by the vector se- 
quences to make the right protein. The N-terminal protein sequence contributed 
by the vector can be chosen to suit purposes. Furthermore, adding an N-terminal 
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Codon: Cys Gln Lys Gly Asp Pro Ser Thr Leu Glu Ser Leu Ser Met 


Cloning site: TGT CAA AAA GGG GAT CCG TCG ACT CTA GAA AGC TTA TCG ATG 
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CENGAGENOW” ANIMATED FIGURE 12.15 A typical expression vector for the synthesis of a hybrid protein. 
The cloning site is located at the end of the coding region for the protein B-galactosidase. Insertion of foreign 
DNAs at this site fuses the foreign sequence to the B-galactosidase coding region (the lacZ gene). IPTG 
induces the transcription of the lacZ gene from its promoter Pia, causing expression of the fusion protein. 
(Adapted from Figure 2, Ruther, U., and Muller-Hill, B., 1983. EMBO Journal 2:1791-1794. See this figure animated at 
www.cengage.com/login. 


signal sequence that targets the hybrid protein for secretion from the cell simpli- 
fies recovery of the fusion protein. A variety of gene fusion systems have been de- 
veloped to facilitate isolation of a specific protein encoded by a cloned insert. The 
isolation procedures are based on affinity chromatography purification of the fu- 
sion protein through exploitation of the unique ligand-binding properties of the 
vector-encoded protein (Table 12.2). 


Reporter Gene Constructs Are Chimeric DNA Molecules Composed 
of Gene Regulatory Sequences Positioned Next to an Easily 
Expressible Gene Product 


Potential regulatory regions of genes (such as promoters) can be investigated by 
placing these regulatory sequences into plasmids upstream of a gene, called a 
reporter gene, whose expression is easy to measure. Such chimeric plasmids are 












TABLE 12.2 | 

Fusion Protein Secreted?* Affinity Ligand 

B-Galactosidase No p-Aminophenyl-£-p-thiogalactoside 
(APTG) 

Protein A Yes Immunoglobulin G (IgG) 

Chloramphenicol acetyltransferase Yes Chloramphenicol 

(CAT) 

Streptavidin Yes Biotin 

Glutathione-S-transferase (GST) No Glutathione 

Maltose-binding protein (MBP) Yes Starch 

Hexahistidine tag No Nickel or cobalt 

Hemagglutinin (HA) peptide No HA-peptide antibody 











*This indicates whether combined secretion-fusion gene systems have led to secretion of the protein product from the 
cells, which simplifies its isolation and purification. 
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FIGURE 12.16 Green fluorescent protein (GFP) as a re- 
porter gene. In the experiment here, GFP expression 
depends on the promoter for the Drosophilia melano- 
gaster Tdc2 gene. Tdc2 encodes a neuronal tyrosine de- 
carboxylase (TDC) whose expression is necessary for 
egg laying in fruit flies. (Bottom) Green fluorescence 
highlights neuronal projections expressing the Tdc2 
gene. (Top) Diagram of a fly, its nervous system, and 
ovaries. Note that Tdc2 neurons innervate the ovaries 
and oviducts of flies. (See Cole, S. H., et al., 2005. Two func- 
tional but noncomplementing Drosophila tyrosine decarboxy- 
lase genes. Journal of Biological Chemistry 280:14948-14955. GFP 
image courtesy of Shannon H. Cole and Jay Hirsh, the University 
of Virginia. Fly image derived from the Atlas of Drosophila Devel- 
opment by Volker Hartenstein, http://flybase.bio.indiana.edu/ 
allied-data/Ik/interactive-fly/atlas/OOcontents.htm.) 


FIGURE 12.17 The yeast two-hybrid system for identify- 
ing protein-protein interactions. If proteins X and Y 
interact, the lacZ reporter gene is expressed. Cells 
expressing lacZ exhibit B-galactosidase activity. 


then introduced into cells of choice (including eukaryotic cells) to assess the po- 
tential function of the nucleotide sequence in regulation because expression of 
the reporter gene serves as a report on the effectiveness of the regulatory ele- 
ment. A number of different genes have been used as reporter genes. A reporter 
gene with many inherent advantages is that encoding the green fluorescent pro- 
tein (or GFP), described in Chapter 4. Unlike the protein expressed by other re- 
porter gene systems, GFP does not require any substrate to measure its activity, 
nor is it dependent on any cofactor or prosthetic group. Detection of GFP re- 
quires only irradiation with near-UV or blue light (400-nm light is optimal), and 
the green fluorescence (light of 500 nm) that results is easily observed with the 
naked eye, although it can also be measured precisely with a fluorometer. Figure 
12.16 demonstrates the use of GFP as a reporter gene. EGFP is an engineered ver- 
sion of GFP that shows enhanced fluorescent properties. 


Specific Protein—Protein Interactions Can Be Identified Using 
the Yeast Two-Hybrid System 


Specific interactions between proteins (so-called protein-protein interactions) lie at 
the heart of many essential biological processes. One method to identify specific 
protein-protein interactions in vivo is through expression of a reporter gene whose 
transcription is dependent on a functional transcriptional activator, the GAL4 pro- 
tein. The GAL4 protein consists of two domains: a DNA-binding (or DB) domain and 
a transcriptional activation (or TA) domain. Even if expressed as separate proteins, 
these two domains will still work, provided they can be brought together. The method 
depends on two separate plasmids encoding two hybrid proteins, one consisting of 
the GAL4 DB domain fused to protein X and the other consisting of the GAL4TA do- 
main fused to protein Y (Figure 12.17a). If proteins X and Y interact in a specific 
protein-protein interaction, the GAL4 DB and TA domains are brought together so 


(a) 





(b) 
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that transcription of a reporter gene driven by the GAL4 promoter can take place 
(Figure 12.17b). Protein X, fused to the GAL4-DNA-binding domain (DB), serves as 
the “bait” to fish for the protein Y “target” and its fused GAL4 TA domain. This 
method can be used to screen cells for protein “targets” that interact specifically with 
a particular “bait” protein. To do so, cDNAs encoding proteins from the cells of in- 
terest are inserted into the TA-containing plasmid to create fusions of the cDNA cod- 
ing sequences with the GAL4 TA domain coding sequences, so a fusion protein library 
is expressed. Identification of a target of the “bait” protein by this method also yields 
directly a cDNA version of the gene encoding the “target” protein. 


Identifying Protein-Protein Interactions Through Immunoprecipitation If anti- 
bodies against one protein of a multiprotein complex are available, the entire com- 
plex can be immunoprecipitated and its composition analyzed. Attachment of such 
antibodies to glass or agarose beads, which easily sediment in a centrifuge, makes re- 
covery of the complex very simple. Because antibodies against it are commercially 
available, the hemagglutinin (HA) peptide, sequence YPYDVPDYA, is a useful protein 
fusion tag, not only for fusion protein purification (Table 12.2) but also for analysis 
of protein-protein interactions. Expressing an HA-tagged protein in vivo, followed by 
immunoprecipitation, allows the isolation of protein complexes of which the HA- 
tagged protein is a member. The other members of the complex can then be identi- 
fied to establish the various interacting partners within the multiprotein complex. 


12:4: What Is the Polymerase Chain Reaction (PCR)? 


Polymerase chain reaction, or PCR, is a technique for dramatically amplifying the 
amount of a specific DNA segment. A preparation of denatured DNA containing 
the segment of interest serves as template for DNA polymerase, and two specific 
oligonucleotides serve as primers for DNA synthesis (as in Figure 12.18). These 
primers, designed to be complementary to the two 3’-ends of the specific DNA 
segment to be amplified, are added in excess amounts of 1000 times or greater 
(Figure 12.18). They prime the DNA polymerase-catalyzed synthesis of the two 
complementary strands of the desired segment, effectively doubling its concen- 
tration in the solution. Then the DNA is heated to dissociate the DNA duplexes 
and then cooled so that primers bind to both the newly formed and the old 
strands. Another cycle of DNA synthesis follows. The protocol has been auto- 
mated through the invention of thermal cyclers that alternately heat the reaction 
mixture to 95°C to dissociate the DNA, followed by cooling, annealing of primers, 
and another round of DNA synthesis. The isolation of heat-stable DNA po- 
lymerases from thermophilic bacteria (such as the Taq DNA polymerase from 
Thermus aquaticus) has made it unnecessary to add fresh enzyme for each round 
of synthesis. Because the amount of target DNA theoretically doubles each round, 
25 rounds would increase its concentration about 33 million times. In practice, 
the increase is actually more like a million times, which is more than ample for 
gene isolation. Thus, starting with a tiny amount of total genomic DNA, a partic- 
ular sequence can be produced in quantity in a few hours. 

PCR amplification is an effective cloning strategy if sequence information for the 
design of appropriate primers is available. Because DNA from a single cell can be 
used as a template, the technique has enormous potential for the clinical diagnosis of 
infectious diseases and genetic abnormalities. With PCR techniques, DNA from a sin- 
gle hair or sperm can be analyzed to identify particular individuals in criminal cases 
without ambiguity. RT-PCR, a variation on the basic PCR method, is useful when the 
nucleic acid to be amplified is an RNA (such as mRNA). Reverse transcriptase (RT) 
is used to synthesize a cDNA strand complementary to the RNA, and this cDNA serves 
as the template for further cycles of PCR. (RT-PCR is also used to refer to yet another 
variation on PCR whose full name is real-time PCR. Real-time PCR uses PCR amplifi- 
cation to measure the relative amounts of mRNAs expressed in vivo.) 
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In Vitro Mutagenesis 
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The advent of recombinant DNA technology has made it possible to clone genes, ma- 
nipulate them in vitro, and express them in a variety of cell types under various con- 
ditions. The function of any protein is ultimately dependent on its amino acid se- 
quence, which in turn can be traced to the nucleotide sequence of its gene. The 
introduction of purposeful changes in the nucleotide sequence of a cloned gene rep- 
resents an ideal way to make specific structural changes in a protein. The effects of 
these changes on the protein’s function can then be studied. Such changes constitute 


12.6 Is It Possible to Make Directed Changes in the Heredity of an Organism? 


mutations introduced in vitro into the gene. In vitro mutagenesis makes it possible to 
alter the nucleotide sequence of a cloned gene systematically, as opposed to the 
chance occurrence of mutations in natural genes. 

One efficient technique for in vitro mutagenesis is PCR-based mutagenesis. Mu- 
tant primers are added to a PCR reaction in which the gene (or segment of a gene) 
is undergoing amplification. The mutant primers are primers whose sequence has been 
specifically altered to introduce a directed change at a particular place in the nu- 
cleotide sequence of the gene being amplified (Figure 12.19). Mutant versions of the 
gene can then be cloned and expressed to determine any effects of the mutation on 
the function of the gene product. 


12055 How Is RNA Interference Used to Reveal the Function 
of Genes? 


RNA interference (RNAi) has emerged as a method of choice in eukaryotic gene in- 
activation. RNAi leads to targeted destruction of a selected gene’s transcript. The con- 
sequences following loss of gene function reveal the role of the gene product in cell 
metabolism. Inactivation of gene expression by RNAi is sometimes referred to as gene 
knockdown. (Gene knockdown is a term that contrasts the method with gene knock- 
out, a procedure that inactivates a gene by disrupting its nucleotide sequence; see 
Chapter 28.) 

Procedures for silencing gene expression via RNAi depend on the introduction 
of double-stranded RNA (dsRNA) molecules into target cells by transfection, viral 
infection, or artificial expression. One strand of the dsRNA is designed to be an an- 
tisense RNA, in that its nucleotide sequence is complementary to the RNA tran- 
script of the gene selected for silencing. An ATP-dependent endogenous cellular 
protein system known as Dicer processes the dsRNA. Dicer is an RNase III family 
member that catalyzes endonucleolytic cleavage of both strands of dsRNA mole- 
cules to produce a double-stranded small interfering RNA (siRNA) 21 to 23 nucleo- 
tides long and having 2-nucleotide-long 3’-overhangs on each strand (Figure 
12.20). The siRNA is then passed to another protein complex known as RNA- 
induced silencing complex (RISC). In an ATP-dependent process, RISC unwinds 
the double-stranded siRNA and selects the antisense strand, which is referred to as 
the guide strand. The other strand, referred to as the passenger strand, is discarded. 
RISC pairs the single-stranded guide strand with a complementary region on the 
targeted gene transcript. RISC then carries out its “slicer function” by cleaving the 
RNA transcript between nucleotides 10 and 11 of the mRNA region that is base- 
paired with the guide strand. Such cleavage prevents expression of the product en- 
coded by the mRNA. The guide strand remains associated with RISC, and RISC can 
use it for multiple cycles of mRNA cleavage and post-transcriptional gene silencing. 


12:61 Is It Possible to Make Directed Changes in the Heredity 
of an Organism? 


Recombinant DNA technology is a powerful tool for the genetic modification of or- 
ganisms. The strategies and methodologies described in this chapter are but an 
overview of the repertoire of experimental approaches that have been devised by 
molecular biologists in order to manipulate DNA and the information inherent in 
it. The enormous success of recombinant DNA technology means that the molecu- 
lar biologist’s task in searching genomes for genes is now akin to that of a lexicog- 
rapher compiling a dictionary, a dictionary in which the “letters” (the nucleotide se- 
quences), spell out not words but rather genes and what they mean. Molecular 
biologists have no index or alphabetic arrangement to serve as a guide through the 
vast volume of information in a genome; nevertheless, this information and its or- 
ganization is rapidly being disclosed by the imaginative efforts and diligence of 
these scientists and their growing arsenal of analytical schemes. 
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CENGAGENOW” ANIMATED FIGURE 12.19 One method 
of PCR-based site-directed mutagenesis. (1) Template 
DNA strands are separated and amplified by PCR using 
mutagenic primers (represented as bent arrows) whose 
sequences introduce a single base substitution at site X 
(and its complementary base X’; thus, the desired amino 
acid change in the protein encoded by the gene). 
Ideally, the mutagenic primers also introduce a unique 
restriction site into the plasmid that was not present 
before. (2) Following many cycles of PCR, the DNA prod- 
uct can be used to transform E. coli cells. (3) The plasmid 
DNA can be isolated and screened for the presence of 
the mutation by screening for the presence of the 
unique restriction site by restriction endonuclease 
cleavage. For example, the nucleotide sequence 
GGATCT within a gene codes for amino acid residues 
Gly-Ser. Using mutagenic primers of nucleotide 
sequence AGATCT (and its complement AGATCT) 
changes the amino acid sequence from Gly-Ser to Arg- 
Ser and creates a Bg/ll restriction site (see Table 10.2). 
Gene expression of the isolated mutant plasmid in E. coli 
allows recovery and analysis of the mutant protein. See 
this figure animated at www.cengage.com/login. 
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FIGURE 12.20 Gene knockdown by RNAi. The dsRNA is 
processed by Dicer, which cleaves both strands of the 
dsRNA to form an siRNA, a ~20-nucleotide dsRNA with 
3’-overhangs. A helicase activity associated with Dicer 
unwinds the siRNA, and the guide strand is delivered to 
the RISC protein complex. An Argonaute protein family 
member (Ago) is the catalytic subunit of RISC. Ago has a 
dsRNA-binding domain that brings together the guide 
strand and a complementary nucleotide sequence on 
the targeted gene transcript. Ago also has a RNase 
H-type catalytic domain that cleaves the gene tran- 
script, rendering it incapable of translation by ribo- 
somes. This RNase H activity of Ago is whimsically 
referred to as the “slicer” function in RNAi. 
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Recombinant DNA technology now verges on the ability to engineer at will the 
heredity (or genetic makeup) of organisms for desired ends. The commercial pro- 
duction of therapeutic biomolecules in microbial cultures is already established (for 
example, the production of human insulin in quantity in Æ. coli cells). Agricultural 
crops with desired attributes, such as enhanced resistance to herbicides or elevated vi- 
tamin levels, are in cultivation. Transgenic mice are widely used as experimental ani- 
mals to investigate models of human disease and physiology (see Chapter 28). Al- 
ready, transgenic versions of domestic animals such as pigs, sheep, and even fish have 
been developed for human benefit. Perhaps most important, in a number of in- 
stances, clinical trials have been approved for gene replacement therapy (or, more 
simply, gene therapy) to correct particular human genetic disorders. 


Human Gene Therapy Can Repair Genetic Deficiencies 


Human gene therapy seeks to repair the damage caused by a genetic deficiency 
through introduction of a functional version of the defective gene. To achieve this 
end, a cloned variant of the gene must be incorporated into the organism in such a 
manner that it is expressed only at the proper time and only in appropriate cell types. 
At this time, these conditions impose serious technical and clinical difficulties. Many 
gene therapies have received approval from the National Institutes of Health for tri- 
als in human patients, including the introduction of gene constructs into patients. 
Among these are constructs designed to cure ADA™ SCID (severe combined im- 
munodeficiency due to adenosine deaminase [ADA] deficiency), neuroblastoma, or 
cystic fibrosis or to treat cancer through expression of the EIA and p53 tumor sup- 
pressor genes. 

A basic strategy in human gene therapy involves incorporation of a functional 
gene into target cells. The gene is typically in the form of an expression cassette con- 
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sisting of a cDNA version of the gene downstream from a promoter that will drive ex- 
pression of the gene in one of two ways. One way, the ex-vivo route, is to introduce a 
vector carrying the expression cassette into cells isolated from a patient and cultured 
in the laboratory. The modified cells are then reintroduced into the patient. The 
other way involves direct incorporation of the gene by treating the patient with a 
viral vector carrying the expression cassette. 

Retroviruses are RNA viruses that replicate their RNA genome by first making a 
DNA intermediate. Because retroviruses can transfer their genetic information di- 
rectly into the genome of host cells, retroviruses provide a route for permanent 
modification of host cells ex vivo. A replication-deficient mutant of Maloney murine 
leukemia virus (MMLV) can be generated by deleting the gag, pol, and env genes. 
This mutant retrovirus can introduce expression cassettes up to 9 kb (Figure 12.21). 
In the cytosol of the patient’s cells, a DNA copy of the viral RNA is synthesized by vi- 
ral reverse transcriptase. This DNA is then randomly integrated into the host cell 
genome, where its expression leads to synthesis of the expression cassette gene 
product (Figure 12.21). 

In 2000, scientists at the Pasteur Institute in Paris used such an ex vivo approach 
to successfully treat infants with X-linked SCID. The gene encoding the yc cytokine 
receptor subunit gene was defective in these infants, and gene therapy was used to 
deliver a functional yc cytokine receptor subunit gene to stem cells harvested from 
the infants. Transformed stem cells were reintroduced into the patients, who were 
then able to produce functional lymphocytes and lead normal lives. This achieve- 
ment represents the first successful outcome in human gene therapy. 

Adenovirus vectors, which can carry expression cassettes up to 7.5 kb, are a possi- 
ble in vivo approach to human gene therapy (Figure 12.22). Adenoviruses are DNA 
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CENGAGENOW” ANIMATED FIGURE 12.21 Retrovirus- 
mediated gene delivery ex vivo using MMLV. Deletion of 
the essential genes gag, pol, and env from MMLV (1) cre- 
ates a space for insertion of an expression cassette (2). 
The modified MMLV acts as a vector for the expression 
cassette. A second virus (the packaging cell line) that car- 
ries intact gag, pol, and env genes allows the modified 
MMLY to reproduce (3), and the packaged recombinant 
viruses can be collected and used to infect a patient (4). 
(Adapted from Figure 1 in Crystal, R. G., 1995. Transfer of genes to 
humans: Early lessons and obstacles to success. Science 270:404,) 
See this figure animated at www.cengage.com/ 
login. 
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CENGAGENOW” ANIMATED FIGURE 12.22 Adenovirus- 
mediated gene delivery in vivo. Adenoviruses are DNA 
viruses. The adenovirus genome (1). Adenovirus vectors 
are generated by deleting gene E1 (and sometimes E3 if 
more space for an expression cassette is needed) (2). 
Insertion of an expression cassette (3). Adenovirus 
progeny from the complementing cell line can be iso- 
lated and used to infect a patient (4). The recombinant 
viral DNA gains access to the cell nucleus (5), where the 
gene carried by the cassette is expressed (6). (Adapted 
from Figure 2 in Crystal, R.G., 1995. Transfer of genes to humans: 
Early lessons and obstacles to success. Science 270:404.) See 
this figure animated at www.cengage.com/login. 
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HUMAN BIOCHEMISTRY 


The gene defective in cystic fibrosis codes for CFTR (cystic fibro- 
sis transmembrane conductance regulator), a membrane protein 
that pumps CI” out of cells. If this Cl” pump is defective, Cl” ions 
remain in cells, which then take up water from the surrounding 
mucus by osmosis. The mucus thickens and accumulates in various 
organs, including the lungs, where its presence favors infections 
such as pneumonia. Left untreated, children with cystic fibrosis 
seldom survive past the age of 5 years. 

ADA” SCID (adenosine deaminase-defective severe combined 
immunodeficiency) is a fatal genetic disorder caused by defects in 
the gene that encodes ADA. The consequence of ADA deficiency 
is accumulation of adenosine and 2'-deoxyadenosine, substances 
toxic to lymphocytes, important cells in the immune response. 
2’-Deoxyadenosine is particularly toxic because its presence leads 
to accumulation of its nucleotide form, dATP, an essential sub- 
strate in DNA synthesis. Elevated levels of dATP actually block 
DNA replication and cell division by inhibiting synthesis of the 
other deoxynucleoside 5’-triphosphates (see Chapter 26). Accu- 
mulation of dATP also leads to selective depletion of cellular ATP, 
robbing cells of energy. Children with ADA™ SCID fail to develop 
normal immune responses and are susceptible to fatal infections, 
unless kept in protective isolation. 








The Biochemical Defects in Cystic Fibrosis and ADA- SCID 
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David, the Boy in the Bubble. David was born with SCID and lived all 
12 years of his life inside a sterile plastic “bubble” to protect him from 
germs common in the environment. He died in 1984 following an 
unsuccessful bone marrow transplant. 
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viruses. The 36-kb adenovirus genome is divided into early genes (E1 to E4) and late 
genes (L1 to L5). Deletion of El renders the adenovirus incapable of replication un- 
less introduced into a complementing cell line carrying the El gene. The comple- 
menting cell line produces adenovirus particles that can be used to infect patients. 
The recombinant adenoviruses enter the patient’s cells via specific receptors on the 
target cell surface; the transferred genetic information is expressed directly from 
the adenovirus recombinant DNA and is never incorporated into the host cell 
genome. Although many problems remain to be solved, human gene therapy as a 


clinical strategy is feasible. 


SUMMARY 


12.1 What Does It Mean “To Clone”? A clone is a collection of mole- 
cules or cells all identical to an original molecule or cell. Plasmids (nat- 
urally occurring, circular, extrachromosomal DNA molecules) are very 
useful in cloning genes. Artificial plasmids can be created by ligating dif- 
ferent DNA fragments together. In this manner, “foreign” DNA se- 
quences can be inserted into artificial plasmids, carried into £. col, and 
propagated as part of the plasmid. Recombinant plasmids are hybrid 
DNA molecules consisting of plasmid DNA sequences plus inserted 
DNA elements. A great number of cloning strategies have emerged to 
make recombinant plasmids for different purposes. 


12.2 What Is a DNA Library? A DNA library is a set of cloned frag- 
ments representing all the genes of an organism. Particular genes can 
be isolated from DNA libraries, even though a particular gene consti- 
tutes only a small part of an organism’s genome. Genomic libraries have 
been prepared from thousands of different species. Libraries can be 
screened for the presence of specific genes. A common method of 
screening plasmid-based genomic libraries is colony hybridization. Mak- 
ing useful probes requires some information about the gene’s nu- 
cleotide sequence (or the amino acid sequence of a protein whose gene 
is sought). DNA from the corresponding gene in a related organism can 
also be used as a probe in screening a library for a particular gene. 

cDNA libraries are DNA libraries prepared from mRNA. Because dif- 
ferent cell types in eukaryotic organisms express selected subsets of 
genes, cDNA libraries prepared from such mRNA are representative of 
the pattern and extent of gene expression that uniquely define particu- 
lar kinds of differentiated cells. 

Expressed sequence tags (ESTs) are relatively short (~200 nucleo- 
tides or so) sequences derived from determining a portion of the 
nucleotide sequence for each insert in randomly selected cDNAs. ESTs 
can be used to identify which genes in a genomic library are being 
expressed in the cell. For example, labeled ESTs can be hybridized to 
DNA microarrays (gene chips). DNA microarrays are arrays of different 
oligonucleotides immobilized on a solid support, or chip. The oligonu- 
cleotides on the chip represent a two-dimensional array of different 
oligonucleotides. Such gene chips are used to reveal gene expression 
patterns. 


12.3 Can the Cloned Genes in Libraries Be Expressed? Expression 
vectors are engineered so that any cloned insert can be transcribed into 
RNA and, in many instances, translated into protein. Strong promoters 
have been constructed that drive the synthesis of foreign proteins to lev- 
els equal to 30% or more of total E. coli cellular protein. cDNA expres- 
sion libraries can also be screened with antibodies to identify and isolate 
cDNA clones encoding a particular protein. 

Reporter gene constructs are chimeric DNA molecules composed of 
gene regulatory sequences positioned next to an easily expressible gene 


product, such as green fluorescent protein. Reporter gene constructs 
introduced into cells of choice (including eukaryotic cells) can reveal 
the function of nucleotide sequences involved in regulation. 


12.4 What Is the Polymerase Chain Reaction (PCR)? PCR is a technique 
for dramatically amplifying the amount of a specific DNA segment. De- 
natured DNA containing the segment of interest serves as template for 
DNA polymerase, and two specific oligonucleotides serve as primers for 
DNA synthesis. The protocol has been automated through the invention 
of thermal cyclers that alternately heat the reaction mixture to 95°C to 
dissociate the DNA, followed by cooling, annealing of primers, and an- 
other round of DNA synthesis. Because DNA from a single cell can be 
used as a template, the technique has enormous potential for the clinical 
diagnosis of infectious diseases and genetic abnormalities. 

Recombinant DNA technology makes it possible to clone genes, ma- 
nipulate them in vitro, and express them in a variety of cell types under 
various conditions. The introduction of changes in the nucleotide se- 
quence of a cloned gene represents an ideal way to make specific struc- 
tural changes in a protein; such changes constitute mutations intro- 
duced in vitro into the gene. One efficient technique for in vitro 
mutagenesis is PCR-based mutagenesis. 


12.5 How Is RNA Interference Used to Reveal the Function of Genes? 
RNAi can be used to selectively inactivate the expression of a target 
gene in a host cell (gene knockdown). Such inactivation reveals the 
function of the gene. RNAi relies on processing of an introduced 
double-stranded RNA molecule (dsRNA), one of whose strands (the 
guide strand) is complementary to a region of the RNA transcript made 
from the gene destined for knockdown. The dsRNA is processed by the 
host cell Dicer protein complex to yield a ~20-nucleotide-long siRNA, 
followed by delivery of the siRNA guide strand sequence to the RISC 
protein complex. RISC then aligns the guide strand with its comple- 
mentary RNA transcript and cleaves the RNA transcript between nu- 
cleotides 10 and 11 of the region that is base-paired with the guide 
strand. Transcript cleavage causes post-transcriptional gene silencing 
because the cleaved transcript cannot be translated into protein. 


12.6 Is It Possible to Make Directed Changes in the Heredity of an 
Organism? Recombinant DNA technology now verges on the ability to 
engineer at will the heredity (or genetic makeup) of organisms for de- 
sired ends. In a number of instances, clinical trials have been approved 
for gene replacement therapy (or, more simply, gene therapy) to correct 
particular human genetic disorders. Human gene therapy seeks to re- 
pair the damage caused by a genetic deficiency through the introduc- 
tion of a functional version of the defective gene. In 2000, scientists at 
the Pasteur Institute in Paris used an ex vivo approach to successfully 
treat infants with X-linked SCID. 
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PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. A DNA fragment isolated from an EcoRI digest of genomic DNA was 
combined with a plasmid vector linearized by EcoRI digestion so 
that sticky ends could anneal. Phage T4 DNA ligase was then added 
to the mixture. List all possible products of the ligation reaction. 

2. The nucleotide sequence of a polylinker in a particular plasmid 
vector is 


-GAATTCCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGC- 


This polylinker contains restriction sites for BamHI, EcoRI, PstI, 
Sall, Smal, Sphi, and Xbal. Indicate the location of each restriction 
site in this sequence. (See Table 10.2 of restriction enzymes for their 
cleavage sites.) 

3. A vector has a polylinker containing restriction sites in the follow- 
ing order: Hind Ill, Sacl, Xhol, BglII, Xbal, and Clal. 

a. Give a possible nucleotide sequence for the polylinker. 

b. The vector is digested with HindIII and Clal. A DNA segment 
contains a HindIII restriction site fragment 650 bases upstream 
from a Clal site. This DNA fragment is digested with HindIII 
and Clal, and the resulting Hind I-Clal fragment is direction- 
ally cloned into the HindUI—Clal-digested vector. Give the nu- 
cleotide sequence at each end of the vector and the insert and 
show that the insert can be cloned into the vector in only one 
orientation. 

4. Yeast (Saccharomyces cerevisiae) has a genome size of 1.21 X 10’ bp. If 
a genomic library of yeast DNA was constructed in a vector capable 
of carrying 16-kbp inserts, how many individual clones would have 
to be screened to have a 99% probability of finding a particular 
fragment? 

5. The South American lungfish has a genome size of 1.02 X 10™ bp. 
If a genomic library of lungfish DNA was constructed in a vector ca- 
pable of carrying inserts averaging 45 kbp in size, how many indi- 
vidual clones would have to be screened to have a 99% probability 
of finding a particular DNA fragment? 


6. Given the following short DNA duplex of sequence (5'—>3') 


ATGCCGTAGTCGATCATTACGATAGCATAGCACAGGGATCCA- 
CATGCACACACATGACATAGGACAGATAGCAT 


what oligonucleotide primers (17-mers) would be required for PCR 
amplification of this duplex? 

7. Figure 12.3 shows a polylinker that falls within the B-galactosidase 
coding region of the lacZ gene. This polylinker serves as a cloning 
site in a fusion protein expression vector where the closed insert is 
expressed as a -galactosidase fusion protein. Assume the vector 
polylinker was cleaved with BamHI and then ligated with an insert 
whose sequence reads 


GATCCATTTATCCACCGGAGAGCTGGTATCCCCAAAAGACG- 
GCC... 

What is the amino acid sequence of the fusion protein? Where is 

the junction between -galactosidase and the sequence encoded by 


the insert? (Consult the genetic code table on the inside front cover 
to decipher the amino acid sequence.) 


8. The amino acid sequence across a region of interest in a protein is 
Asn-Ser-Gly-Met-His-Pro-Gly-Lys-Leu-Ala-Ser-Trp-Phe-Val-Gly-Asn-Ser 


The nucleotide sequence encoding this region begins and ends 
with an EcoRI site, making it easy to clone out the sequence and am- 
plify it by the polymerase chain reaction (PCR). Give the nucleotide 
sequence of this region. Suppose you wished to change the middle 


Ser residue to a Cys to study the effects of this change on the pro- 
tein’s activity. What would be the sequence of the mutant oligonu- 
cleotide you would use for PCR amplification? 


9. Combinatorial chemistry can be used to synthesize polymers such as 
oligopeptides or oligonucleotides. The number of sequence possi- 
bilities for a polymer is given by x, where x is the number of differ- 
ent monomer types (for example, 20 different amino acids in a pro- 
tein or 4 different nucleotides in a nucleic acid) and y is the 
number of monomers in the oligomers. 

a. Calculate the number of sequence possibilities for RNA oligomers 
15 nucleotides long. 

b. Calculate the number of amino acid sequence possibilities for 
pentapeptides. 

10. Imagine that you are interested in a protein that interacts with pro- 
teins of the cytoskeleton in human epithelial cells. Describe an ex- 
perimental protocol based on the yeast two-hybrid system that 
would allow you to identify proteins that might interact with your 
protein of interest. 

11. Describe an experimental protocol for the preparation of two 
cDNA libraries, one from anaerobically grown yeast cells and the 
second from aerobically grown yeast cells. 


12. Describe an experimental protocol based on DNA microarrays 
(gene chips) that would allow you to compare gene expression in 
anaerobically grown yeast versus aerobically grown yeast. 

13. You have an antibody against yeast hexokinase A (hexokinase is the 
first enzyme in the glycolytic pathway). Describe an experimental 
protocol using the cDNA libraries prepared in problem 11 that 
would allow you to identify and isolate the cDNA for hexokinase. 
Consulting Chapter 5 for protein analysis protocols, describe an ex- 
perimental protocol to verify that the protein you have identified is 
hexokinase A. 

14. In your experiment in problem 12, you discover a gene that is 
strongly expressed in anaerobically grown yeast but turned off in 
aerobically grown yeast. You name this gene nox (for “no oxygen”). 
You have the “bright idea” that you can engineer a yeast strain that 
senses Oy levels if you can isolate the nox promoter. Describe how 
you might make a reporter gene construct using the nox promoter 
and how the yeast strain bearing this reporter gene construct might 
be an effective oxygen sensor. 


Biochemistry on the Web 


15. Search the National Center for Biotechnology Information (NCBI) 
website at http://www.ncbi.nlm.nih. gou/sites/entrez?db=Genome to dis- 
cover the number of organisms whose genome sequences have 
been completed. Explore the rich depository of sequence informa- 
tion available here by selecting one organism from the list and 
browsing through the contents available. 


Preparing for the MCAT Exam 


16. Figure 12.1 shows restriction endonuclease sites for the plasmid 
pBR322. You want to clone a DNA fragment and select for it in 
transformed bacteria by using resistance to tetracycline and sensi- 
tivity to ampicillin as a way of identifying the recombinant plasmid. 
What restriction endonucleases might be useful for this purpose? 

17. Suppose in the amino acid sequence in Figure 12.8, tryptophan was 
replaced by cysteine. How would that affect the possible mRNA se- 
quence? (Consult the inside front cover of this textbook for amino 
acid codons.) How many nucleotide changes are necessary in re- 
placing Trp with Cys in this coding sequence? What is the total 
number of possible oligonucleotide sequences for the mRNA if Cys 
replaces Trp? 
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The space shuttle must accelerate from zero velocity 
to a velocity of more than 25,000 miles per hour in 
order to escape earth's gravity. 





There is more to life than increasing tts speed. 


Mahatma Gandhi (1869-1948) 
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Enzymes—Kinetics 
and Specificity 





ESSENTIAL QUESTIONS 


At any moment, thousands of chemical reactions are taking place in any living cell. 
Enzymes are essential for these reactions to proceed at rates fast enough to 
sustain life. 


What are enzymes, and what do they do? 


Living organisms seethe with metabolic activity. Thousands of chemical reactions 
are proceeding very rapidly at any given instant within all living cells. Virtually all of 
these transformations are mediated by enzymes—proteins (and occasionally RNA) 
specialized to catalyze metabolic reactions. The substances transformed in these re- 
actions are often organic compounds that show little tendency for reaction outside 
the cell. An excellent example is glucose, a sugar that can be stored indefinitely on 
the shelf with no deterioration. Most cells quickly oxidize glucose, producing car- 
bon dioxide and water and releasing lots of energy: 


CsH1206 + 6 O.—>6 CO, + 6 HO + 2870 kJ of energy 


(—2870 kJ/mol is the standard-state free energy change [AG°’] for the oxidation 
of glucose.) In chemical terms, 2870 kJ is a large amount of energy, and glucose 
can be viewed as an energy-rich compound even though at ambient temperature 
it is not readily reactive with oxygen outside of cells. Stated another way, glucose 
represents thermodynamic potentiality: Its reaction with oxygen is strongly exer- 
gonic, but it doesn’t occur under normal conditions. On the other hand, en- 
zymes can catalyze such thermodynamically favorable reactions, causing them to 
proceed at extraordinarily rapid rates (Figure 13.1). In glucose oxidation and 
countless other instances, enzymes provide cells with the ability to exert kinetic 
control over thermodynamic potentiality. That is, living systems use enzymes to accel- 
erate and control the rates of vitally important biochemical reactions. 
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FIGURE 13.1 Reaction profile showing the large AG* for glucose oxidation. Enzymes lower AG*, thereby 
accelerating rate. 
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Enzymes Are the Agents of Metabolic Function 


Acting in sequence, enzymes form metabolic pathways by which nutrient mole- 
cules are degraded, energy is released and converted into metabolically useful 
forms, and precursors are generated and transformed to create the literally thou- 
sands of distinctive biomolecules found in any living cell (Figure 13.2). Situated 
at key junctions of metabolic pathways are specialized regulatory enzymes capa- 
ble of sensing the momentary metabolic needs of the cell and adjusting their cat- 
alytic rates accordingly. The responses of these enzymes ensure the harmonious 
integration of the diverse and often divergent metabolic activities of cells so that 
the living state is promoted and preserved. 


ABl What Characteristic Features Define Enzymes? 


Enzymes are remarkably versatile biochemical catalysts that have in common three 
distinctive features: catalytic power, specificity, and regulation. 


Catalytic Power Is Defined as the Ratio of the Enzyme-Catalyzed Rate 
of a Reaction to the Uncatalyzed Rate 


Enzymes display enormous catalytic power, accelerating reaction rates as much as 107! 
over uncatalyzed levels, which is far greater than any synthetic catalysts can achieve, 
and enzymes accomplish these astounding feats in dilute aqueous solutions under 
mild conditions of temperature and pH. For example, the enzyme jack bean urease 
catalyzes the hydrolysis of urea: 


O 
|l 
HəN—C—NHp + 2 HO + H* —> 2 NH,4* + HCO3- 


At 20°C, the rate constant for the enzyme-catalyzed reaction is 3 X 10t/sec; the rate 
constant for the uncatalyzed hydrolysis of urea is 3 X 107!°/sec. Thus, 10" is the ratio 
of the catalyzed rate to the uncatalyzed rate of reaction. Such a ratio is defined as the 
relative catalytic power of an enzyme, so the catalytic power of urease is 10!*. 


Specificity Is the Term Used to Define the Selectivity of Enzymes 
for Their Substrates 


A given enzyme is very selective, both in the substances with which it interacts and in 
the reaction that it catalyzes. The substances upon which an enzyme acts are tradi- 
tionally called substrates. In an enzyme-catalyzed reaction, none of the substrate is di- 
verted into nonproductive side reactions, so no wasteful by-products are produced. It 
follows then that the products formed by a given enzyme are also very specific. This 
situation can be contrasted with your own experiences in the organic chemistry labo- 
ratory, where yields of 50% or even 30% are viewed as substantial accomplishments 
(Figure 13.3). The selective qualities of an enzyme are collectively recognized as its 
specificity. Intimate interaction between an enzyme and its substrates occurs through 
molecular recognition based on structural complementarity; such mutual recognition 
is the basis of specificity. The specific site on the enzyme where substrate binds and 
catalysis occurs is called the active site. 


Regulation of Enzyme Activity Ensures That the Rate of Metabolic 
Reactions Is Appropriate to Cellular Requirements 


Regulation of enzyme activity is essential to the integration and regulation of me- 
tabolism. Enzyme regulation is achieved in a variety of ways, ranging from controls 
over the amount of enzyme protein produced by the cell to more rapid, reversible 
interactions of the enzyme with metabolic inhibitors and activators. Chapter 15 is 
devoted to discussions of this topic. Because most enzymes are proteins, we can 
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FIGURE 13.2 The breakdown of glucose by glycolysis 
provides a prime example of a metabolic pathway. 
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FIGURE 13.3 A 90% yield over 10 steps, for example, in a 
metabolic pathway, gives an overall yield of 35%. There- 
fore, yields in biological reactions must be substantially 
greater; otherwise, unwanted by-products would accu- 
mulate to unacceptable levels. 
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anticipate that the functional attributes of enzymes are due to the remarkable ver- 
satility found in protein structures. 


Enzyme Nomenclature Provides a Systematic Way of Naming 
Metabolic Reactions 


Traditionally, enzymes were named by adding the suffix -ase to the name of the sub- 
strate upon which they acted, as in wrease for the urea-hydrolyzing enzyme or phos- 
phatase for enzymes hydrolyzing phosphoryl groups from organic phosphate com- 
pounds. Other enzymes acquired names bearing little resemblance to their activity, 
such as the peroxide-decomposing enzyme catalase or the proteolytic enzymes (pro- 
teases) of the digestive tract, trypsin and pepsin. Because of the confusion that arose from 
these trivial designations, an International Commission on Enzymes was established to 
create a systematic basis for enzyme nomenclature. Although common names for 
many enzymes remain in use, all enzymes now are classified and formally named ac- 
cording to the reaction they catalyze. Six classes of reactions are recognized (Table 
13.1). Within each class are subclasses, and under each subclass are sub-subclasses 
within which individual enzymes are listed. Classes, subclasses, sub-subclasses, and in- 
dividual entries are each numbered so that a series of four numbers serves to specify a 
particular enzyme. A systematic name, descriptive of the reaction, is also assigned to 
each entry. To illustrate, consider the enzyme that catalyzes this reaction: 


ATP + D-glucose—> ADP + b-glucose-6-phosphate 








TABLE 13.1 n 

E.C. Number Systematic Name and Subclasses E.C. Number Systematic Name and Subclasses 

1 Oxidoreductases (oxidation -reduction reactions) 4 Lyases (addition to double bonds) 

1.1 Acting on CH—OH group of donors 4.1 C=C lyases 

1.1.1 With NAD or NADP as acceptor 4.1.1 Carboxy lyases 

1.1.3 With Os as acceptor 4.1.2 Aldehyde lyases 

1.2 Acting on the =O group of donors 4.2 C=O lyases 

1.2.3 With Os» as acceptor 4.2.1 Hydrolases 

1.3 Acting on the CH—CH group of donors 4.3 C=N lyases 

1.3.1 With NAD or NADP as acceptor 4.3.1 Ammonia-lyases 

2 Transferases (transfer of functional groups) 5 Isomerases (isomerization reactions) 

2.1 Transferring C-1 groups 5.1 Racemases and epimerases 

2.1.1 Methyltransferases 5.1.3 Acting on carbohydrates 

2.1.2 Hydroxymethyltransferases and 5.2 Cis-trans isomerases 
formyltransferases 6 Ligases (formation of bonds with ATP cleavage) 

2.1.3 Carboxyltransferases and 6.1 Forming C—O bonds 
carbamoyltransferases 6.1.1 Amino acid—RNA ligases 

2.2 Transferring aldehydic or ketonic residues 6.2 Forming C—S bonds 

2.3 Acyltransferases 6.3 Forming C—N bonds 

2.4 Glycosyltransferases 6.4 Forming C—C bonds 

2.6 Transferring N-containing groups 6.4.1 Carboxylases 

2.6.1 Aminotransferases 

2.7 Transferring P-containing groups 

2.7.1 With an alcohol group as acceptor 

3 Hydrolases (hydrolysis reactions) 

3.1 Cleaving ester linkage 

3.1.1 Carboxylic ester hydrolases 

3.1.3 Phosphoric monoester hydrolases 

3.1.4 Phosphoric diester hydrolases 
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A phosphate group is transferred from ATP to the C-6-OH group of glucose, so the 
enzyme is a transferase (class 2, Table 13.1). Subclass 7 of transferases is enzymes trans- 
Jerring phosphorus-containing groups, and sub-subclass 1 covers those phosphotransferases 
with an alcohol group as an acceptor. Entry 2 in this sub-subclass is ATP: D-glucose-6- 
phosphotransferase, and its classification number is 2.7.1.2. In use, this number is 
written preceded by the letters E.C., denoting the Enzyme Commission. For exam- 
ple, entry 1 in the same sub-subclass is E.C.2.7.1.1, ATP :D-hexose-6-phosphotrans- 
ferase, an ATP-dependent enzyme that transfers a phosphate to the 6-OH of hexoses 
(that is, it is nonspecific regarding its hexose acceptor). These designations can be 
cumbersome, so in everyday usage, trivial names are commonly used. The glucose- 
specific enzyme E.C.2.7.1.2 is called glucokinase, and the nonspecific E.C.2.7.1.1 is 
known as hexokinase. Kinase is a trivial term for enzymes that are ATP-dependent 
phosphotransferases. 


Coenzymes and Cofactors Are Nonprotein Components Essential 
to Enzyme Activity 


Many enzymes carry out their catalytic function relying solely on their protein struc- 
ture. Many others require nonprotein components, called cofactors (Table 13.2). 
Cofactors may be metal ions or organic molecules referred to as coenzymes. Coen- 
zymes and cofactors provide proteins with chemically versatile functions not found 
in amino acid side chains. Many coenzymes are vitamins or contain vitamins as part 
of their structure. Usually coenzymes are actively involved in the catalytic reaction 
of the enzyme, often serving as intermediate carriers of functional groups in the 
conversion of substrates to products. In most cases, a coenzyme is firmly associated 
with its enzyme, perhaps even by covalent bonds, and it is difficult to separate the 
two. Such tightly bound coenzymes are referred to as prosthetic groups of the en- 
zyme. The catalytically active complex of protein and prosthetic group is called the 
holoenzyme. The protein without the prosthetic group is called the apoenzyme; it 
is catalytically inactive. 


TABLE 13.2 | 








Metal lons and Some 
Enzymes That Require Them 


Coenzymes Serving as Transient Carriers 
of Specific Atoms or Functional Groups 








Mg?* Hexokinase 
Glucose-6-phosphatase 
Mn?* Arginase 


Kt Pyruvate kinase 
(also requires Mg?*) 

Ni?* Urease 

Mo Nitrate reductase 


Se Glutathione peroxidase 








5’-Deoxyadenosylcobalamin 
(vitamin By») 

Biotin (biocytin) 
Tetrahydrofolate (THF) 


H atoms and alkyl groups 


CO» 

Other one-carbon groups, 
such as formyl and methyl 
groups 


Metal Representative Enzymes 
lon Enzyme Coenzyme Entity Transferred Using Coenzymes 
Fe?* or Cytochrome oxidase Thiamine pyrophosphate (TPP) Aldehydes Pyruvate dehydrogenase 
Fe?+ Catalase Flavin adenine dinucleotide (FAD) Hydrogen atoms Succinate dehydrogenase 
Peroxidase Nicotinamide adenine dinucleotide Hydride ion (:H7) Alcohol dehydrogenase 
Cu?* Cytochrome oxidase (NAD) 
Zn?* DNA polymerase Coenzyme A (CoA) Acyl groups Acetyl-CoA carboxylase 
Carbonic anhydrase Pyridoxal phosphate (PLP) Amino groups Aspartate 
Alcohol dehydrogenase aminotransferase 


Methylmalonyl-CoA mutase 


Propionyl-CoA carboxylase 
Thymidylate synthase 
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1302: Can the Rate of an Enzyme-Catalyzed Reaction 
Be Defined in a Mathematical Way? 


Kinetics is the branch of science concerned with the rates of reactions. The study of 
enzyme kinetics addresses the biological roles of enzymatic catalysts and how they ac- 
complish their remarkable feats. In enzyme kinetics, we seek to determine the maxi- 
mum reaction velocity that the enzyme can attain and its binding affinities for sub- 
strates and inhibitors. Coupled with studies on the structure and chemistry of the 
enzyme, analysis of the enzymatic rate under different reaction conditions yields in- 
sights regarding the enzyme’s mechanism of catalytic action. Such information is es- 
sential to an overall understanding of metabolism. 

Significantly, this information can be exploited to control and manipulate the 
course of metabolic events. The science of pharmacology relies on such a strategy. 
Pharmaceuticals, or drugs, are often special inhibitors specifically targeted at a par- 
ticular enzyme in order to overcome infection or to alleviate illness. A detailed 
knowledge of the enzyme’s kinetics is indispensable to rational drug design and 
successful pharmacological intervention. 


Chemical Kinetics Provides a Foundation for Exploring Enzyme Kinetics 


Before beginning a quantitative treatment of enzyme kinetics, it will be fruitful to re- 
view briefly some basic principles of chemical kinetics. Chemical kinetics is the study 
of the rates of chemical reactions. Consider a reaction of overall stoichiometry: 


A—>P 


Although we treat this reaction as a simple, one-step conversion of A to P, it more 
likely occurs through a sequence of elementary reactions, each of which is a simple 
molecular process, as in 


A— I—>J—>P 


where I and J represent intermediates in the reaction. Precise description of all of 
the elementary reactions in a process is necessary to define the overall reaction 
mechanism for AP. 

Let us assume that A—>P is an elementary reaction and that it is spontaneous and 
essentially irreversible. Irreversibility is easily assumed if the rate of P conversion to A 
is very slow or the concentration of P (expressed as [P]) is negligible under the con- 
ditions chosen. The velocity, v, or rate, of the reaction A—>P is the amount of P 
formed or the amount of A consumed per unit time, t. That is, 

_ d[P] _ —dlA] 

v= u or v= “u (13.1) 
The mathematical relationship between reaction rate and concentration of reac- 
tant(s) is the rate law. For this simple case, the rate law is 

—d[A] 
v=— y “MA (13.2) 

From this expression, it is obvious that the rate is proportional to the concentration 
of A, and k is the proportionality constant, or rate constant. k has the units of 
(time) ~!, usually sec™!. vis a function of [A] to the first power, or in the terminol- 
ogy of kinetics, v is first-order with respect to A. For an elementary reaction, the or- 
der for any reactant is given by its exponent in the rate equation. The number of 
molecules that must simultaneously interact is defined as the molecularity of the re- 
action. Thus, the simple elementary reaction of A—P is a first-order reaction. 
Figure 13.4 portrays the course of a first-order reaction as a function of time. The 
rate of decay of a radioactive isotope, like !*C or “°P, is a first-order reaction, as is an 
intramolecular rearrangement, such as AP. Both are unimolecular reactions (the 
molecularity equals 1). 
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Bimolecular Reactions Are Reactions Involving Two Reactant Molecules 


Consider the more complex reaction, where two molecules must react to yield 
products: 


A+B—>P+Q 


Assuming this reaction is an elementary reaction, its molecularity is 2; that is, it is a 
bimolecular reaction. The velocity of this reaction can be determined from the rate 
of disappearance of either A or B, or the rate of appearance of P or Q: 





—d[A] —d[B] d[P] d[Q] 
Vo wu a u u i 
The rate law is 
v = k[A][B] (13.4) 


Since A and B must collide in order to react, the rate of their reaction will be pro- 
portional to the concentrations of both A and B. Because it is proportional to the 
product of two concentration terms, the reaction is second-order overall, first- 
order with respect to A and first-order with respect to B. (Were the elementary 
reaction 2A—>P + Q, the rate law would be v = &[A]?, second-order overall and 
second-order with respect to A.) Second-order rate constants have the units of 
(concentration) ~! (time) ~!, as in M`! sec7!. 

Molecularities greater than 2 are rarely found (and greater than 3, never). (The 
likelihood of simultaneous collision of three molecules is very, very small.) When 
the overall stoichiometry of a reaction is greater than two (for example, as in 
A + B + C— or 2A + B—), the reaction almost always proceeds via unimolecular 
or bimolecular elementary steps, and the overall rate obeys a simple first- or second- 
order rate law. 

At this point, it may be useful to remind ourselves of an important caveat 
that is the first principle of kinetics: Kinetics cannot prove a hypothetical mechanism. Ki- 
netic experiments can only rule out various alternative hypotheses because they 
don’t fit the data. However, through thoughtful kinetic studies, a process of elimi- 
nation of alternative hypotheses leads ever closer to the reality. 


Catalysts Lower the Free Energy of Activation for a Reaction 


In a first-order chemical reaction, the conversion of A to P occurs because, at any 
given instant, a fraction of the A molecules has the energy necessary to achieve a re- 
active condition known as the transition state. In this state, the probability is very 
high that the particular rearrangement accompanying the A—>P transition will oc- 
cur. This transition state sits at the apex of the energy profile in the energy diagram 
describing the energetic relationship between A and P (Figure 13.5). The average 
free energy of A molecules defines the initial state, and the average free energy of 


FIGURE 13.4 Plot of the course of a first-order reaction. 
The half-time, ty/2, is the time for one-half of the starting 
amount of A to disappear. 
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FIGURE 13.5 Energy diagram for a chemical reaction (AP) and the effects of (a) raising the temperature from 
T, to T> or (b) adding a catalyst. 


P molecules is the final state along the reaction coordinate. The rate of any chemi- 
cal reaction is proportional to the concentration of reactant molecules (A in this 
case) having this transition-state energy. Obviously, the higher this energy is above 
the average energy, the smaller the fraction of molecules that will have this energy 
and the slower the reaction will proceed. The height of this energy barrier is called 
the free energy of activation, AG". Specifically, A G} is the energy required to raise 
the average energy of 1 mol of reactant (at a given temperature) to the transition- 
state energy. The relationship between activation energy and the rate constant of 
the reaction, k, is given by the Arrhenius equation: 


k = Ag AGy/kr (13.5) 


where A is a constant for a particular reaction (not to be confused with the reac- 
tant species, A, that we’re discussing). Another way of writing this is 1/k = 
(1/A)e^®/RT, That is, k is inversely proportional to e^4®/RT, Therefore, if the 
energy of activation decreases, the reaction rate increases. 


Decreasing AG* Increases Reaction Rate 


We are familiar with two general ways that rates of chemical reactions may be 
accelerated. First, the temperature can be raised. This will increase the kinetic en- 
ergy of reactant molecules, and more reactant molecules will possess the energy to 
reach the transition state (Figure 13.5a). In effect, increasing the average energy 
of reactant molecules makes the energy difference between the average energy and 
the transition-state energy smaller. (Also note that the equation k= Ae 4/8? 
demonstrates that k increases as T increases.) The rates of many chemical reac- 
tions are doubled by a 10°C rise in temperature. Second, the rates of chemical re- 
actions can also be accelerated by catalysts. Catalysts work by lowering the energy 
of activation rather than by raising the average energy of the reactants (Figure 
13.5b). Catalysts accomplish this remarkable feat by combining transiently with the 
reactants in a way that promotes their entry into the reactive, transition-state con- 
dition. Two aspects of catalysts are worth noting: (1) They are regenerated after 
each reaction cycle (A—P), and therefore can be used over and over again; and 
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(2) catalysts have no effect on the overall free energy change in the reaction, the 
free energy difference between A and P (Figure 13.5b). 
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Examination of the change in reaction velocity as the reactant concentration is var- 
ied is one of the primary measurements in kinetic analysis. Returning to AP, a 
plot of the reaction rate as a function of the concentration of A yields a straight 
line whose slope is k (Figure 13.6). The more A that is available, the greater the 
rate of the reaction, v. Similar analyses of enzyme-catalyzed reactions involving 
only a single substrate yield remarkably different results (Figure 13.7). At low con- 
centrations of the substrate S, vis proportional to [S], as expected for a first-order 
reaction. However, v does not increase proportionally as [S] increases, but instead 
begins to level off. At high [S], v becomes virtually independent of [S] and ap- 
proaches a maximal limit. The value of v at this limit is written Vmax: Because rate 
is no longer dependent on [S] at these high concentrations, the enzyme-catalyzed 
reaction is now obeying zero-order kinetics; that is, the rate is independent of the 
reactant (substrate) concentration. This behavior is a saturation effect: When v 
shows no increase even though [S] is increased, the system is saturated with sub- 
strate. Such plots are called substrate saturation curves. The physical interpreta- 
tion is that every enzyme molecule in the reaction mixture has its substrate- 
binding site occupied by S. Indeed, such curves were the initial clue that an 
enzyme interacts directly with its substrate by binding it. 


The Substrate Binds at the Active Site of an Enzyme 


An enzyme molecule is often (but not always) orders of magnitude larger than its 
substrate. In any case, its active site, that place on the enzyme where S binds, com- 
prises only a portion of the overall enzyme structure. The conformation of the 
active site is structured to form a special pocket or cleft whose three-dimensional 
architecture is complementary to the structure of the substrate. The enzyme and 
the substrate molecules “recognize” each other through this structural comple- 
mentarity. The substrate binds to the enzyme through relatively weak forces— 
H bonds, ionic bonds (salt bridges), and van der Waals interactions between steri- 
cally complementary clusters of atoms. 
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Enzyme 
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Slope = k 


Velocity, v 


Reactant concentration, [A] 


FIGURE 13.6 A plot of v versus [A] for the unimolecular 
chemical reaction, AP yields a straight line having a 
slope equal to k. 


FIGURE 13.7 Substrate saturation curve for an enzyme- 
catalyzed reaction. The amount of enzyme is constant, 
and the velocity of the reaction is determined at vari- 
ous substrate concentrations. The reaction rate, v, as a 
function of [S] is described mathematically by a 
rectangular hyperbola. The H,O molecule provides a 
rough guide to scale. 
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CENGAGENOW” ANIMATED FIGURE 13.8 Time course 
for a typical enzyme-catalyzed reaction obeying the 
Michaelis-Menten, Briggs—-Haldane models for enzyme 
kinetics. The early stage of the time course is shown in 


greater magnification in the bottom graph. See this fig- 


ure animated at www.cengage.com/login. 


The Michaelis—Menten Equation Is the Fundamental Equation 
of Enzyme Kinetics 


Lenore Michaelis and Maud L. Menten proposed a general theory of enzyme action 
in 1913 consistent with observed enzyme kinetics. Their theory was based on the as- 
sumption that the enzyme, E, and its substrate, S, associate reversibly to form an 
enzyme-substrate complex, ES: 


kı 
E + S==ES (13.6) 
=] 


This association/dissociation is assumed to be a rapid equilibrium, and K, is the 
enzyme: substrate dissociation constant. At equilibrium, 





k_,[ES] = k [E] [S] (13.7) 
and 
[E] [S] ky 
K, = ES] & (13.8) 


Product, P, is formed in a second step when ES breaks down to yield E + P. 
—>5E +P (13.9) 


E is then free to interact with another molecule of S. 


Assume That [ES] Remains Constant During an Enzymatic Reaction 


The interpretations of Michaelis and Menten were refined and extended in 1925 by 
Briggs and Haldane, who assumed the concentration of the enzyme-—substrate com- 
plex ES quickly reaches a constant value in such a dynamic system. That is, ES is 
formed as rapidly from E + S as it disappears by its two possible fates: dissociation to 
regenerate E + S and reaction to form E + P. This assumption is termed the steady- 
state assumption and is expressed as 


d[ES] 
dt 





=0 (13.10) 


That is, the change in concentration of ES with time, t, is 0. Figure 13.8 illustrates 
the time course for formation of the ES complex and establishment of the steady- 
state condition. 


Assume That Velocity Measurements Are Made Immediately 
After Adding S 


One other simplification will be advantageous. Because enzymes accelerate the rate 
of the reverse reaction as well as the forward reaction, it would be helpful to ignore 
any back reaction by which E + P might form ES. The velocity of this back reaction 
would be given by v = k_»[E][P]. However, if we observe only the initial velocity for 
the reaction immediately after E and S are mixed in the absence of P, the rate of any 
back reaction is negligible because its rate will be proportional to [P], and [P] is es- 
sentially 0. Given such simplification, we now analyze the system described by Equa- 
tion 13.9 in order to describe the initial velocity v as a function of [S] and amount 
of enzyme. 
The total amount of enzyme is fixed and is given by the formula 


Total enzyme, [E7] = [E] + [ES] (13.11) 
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where [E] is free enzyme and [ES] is the amount of enzyme in the enzyme- 
substrate complex. From Equation 13.9, the rate of [ES] formation is 

vy = k ([Er] — [ES]) [S] 
where 
[E7] — [ES] = [E] (13.12) 
From Equation 13.9, the rate of [ES] disappearance is 
va = k [ES] + ko[ES] = (k-ı + ko) [ES] (13.13) 
At steady state, d[ES]/dt = 0, and therefore, v= v4. So, 
ki([Er] — [ES])[S] = (k-ı + kə) LES] (13.14) 
Rearranging gives 


({E7] — [ES])[S] (k-1 + ko) 
[ES] E kı 





(13.15) 


The Michaelis Constant, K,,, Is Defined as (k_, + k,)/k, 


The ratio of constants (k_; + kə)/kı is itself a constant and is defined as the 
Michaelis constant, K, 
_ (ha + hy) 


= 13.1 
m hi (13.16) 


Note from Equation 13.15 that K„ is given by the ratio of two concentrations 
(({E7] — [ES]) and [S]) to one ([ES]), so K,, has the units of molarity. (Also, because 
the units of k_, and ko are in time! and the units of kı are M7'!time!, it becomes ob- 
vious that the units of K,, are M.) From Equation 13.15, we can write 


(LE7] — [ES])[S] 
[ES] 





= Ka (13.17) 


which rearranges to 


[Er] [S] 
ES] = > 13.18 
[ESI K+ [S] an 
Now, the most important parameter in the kinetics of any reaction is the rate of 
product formation. This rate is given by 


v= d (13.19) 
and for this reaction 
v = k[ES] (13.20) 
Substituting the expression for [ES] from Equation 13.18 into Equation 13.20 
gives 
— beLE7I[S] m 
"= Kn + [S] a 


The product ko[Er] has special meaning. When [S] is high enough to saturate all 
of the enzyme, the velocity of the reaction, v, is maximal. At saturation, the 
amount of [ES] complex is equal to the total enzyme concentration, Er, its maxi- 
mum possible value. From Equation 13.20, the initial velocity v then equals 
ko[Er] = Vmax: Written symbolically, when [S] >> [E7] (and K,,), [E7] = [ES] and 
v= Vmax. Therefore, 


Vinax = kə [E7] (13:22) 
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Substituting this relationship into the expression for v gives the Michaelis-Menten 
equation: 
Vinax LS] 


*~ Ka+ [S] 


(13.23) 


This equation says that the initial rate of an enzyme-catalyzed reaction, v, is de- 
termined by two constants, K,, and Vma» and the initial concentration of substrate. 


When [S] = Km, V = Vinax/2 


We can provide an operational definition for the constant K,, by rearranging Equa- 
tion 13.23 to give 
Vinax 
Ky, = [S] Gs = 1) (13.24) 


U 


Then, at v = Vinnax/2, Km = [S]. That is, K,, is defined by the substrate concentration 
that gives a velocity equal to one-half the maximal velocity. Table 13.3 gives the K» 
values of some enzymes for their substrates. 


Plots of v Versus [S] Illustrate the Relationships Between Vmax, Km, 
and Reaction Order 


The Michaelis-Menten equation (Equation 13.23) describes a curve known from 
analytical geometry as a rectangular hyperbola. In such curves, as [S] is increased, v 
approaches the limiting value, Vmax, in an asymptotic fashion. Vinax can be approx- 
imated experimentally from a substrate saturation curve (Figure 13.7), and K,, 











TABLE 13.3 
Enzyme Substrate Km (mM) 
Carbonic anhydrase CO2 12 
Chymotrypsin N-Benzoyltyrosinamide 2.5 
Acetyl-L-tryptophanamide 5 
N-Formyltyrosinamide 12 
N-Acetyltyrosinamide 32 
Glycyltyrosinamide 122 
Hexokinase Glucose 0.15 
Fructose 1.5 
B-Galactosidase Lactose 4 
Glutamate dehydrogenase NH,* 57 
Glutamate 0.12 
a-Ketoglutarate 2 
NAD* 0.025 
NADH 0.018 
Aspartate aminotransferase Aspartate 0.9 
a-Ketoglutarate 0.1 
Oxaloacetate 0.04 
Glutamate 4 
Threonine deaminase Threonine 5 
Arginyl-tRNA synthetase Arginine 0.003 
tRNAs 0.0004 
ATP 0.3 
Pyruvate carboxylase HCO; 1.0 
Pyruvate 0.4 
ATP 0.06 
Penicillinase Benzylpenicillin 0.05 
Lysozyme Hexa-N-acetylglucosamine 0.006 
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can be derived from Vpnax/2, so the two constants of the Michaelis-Menten equa- 
tion can be obtained from plots of v versus [S]. Note, however, that actual esti- 
mation of V,,,x, and consequently K,,, is only approximate from such graphs. That 
is, according to Equation 13.23, to get v = 0.99 Vinax, [S] must equal 99 K,,, a con- 
centration that may be difficult to achieve in practice. 

From Equation 13.23, when [S] >> Km then v = Vmax: That is, vis no longer de- 
pendent on [S], so the reaction is obeying zero-order kinetics. Also, when [S] < Km 
then v = (Vinax/ Km) [S]. That is, the rate, v, approximately follows a first-order rate 
equation, v = k' [A], where k' = Vnax/ Km 

K,, and Vna once known explicitly, define the rate of the enzyme-catalyzed re- 
action, provided: 


1. The reaction involves only one substrate, or if the reaction is multisubstrate, the 
concentration of only one substrate is varied while the concentrations of all other 
substrates are held constant. 

2. The reaction ES—>E + P is irreversible, or the experiment is limited to observing 
only initial velocities where [P] = 0. 

3. [S]o > [Ez] and [E7] is held constant. 

4. All other variables that might influence the rate of the reaction (temperature, 
pH, ionic strength, and so on) are held constant. 


Turnover Number Defines the Activity of One Enzyme Molecule 


The turnover number of an enzyme, kean is a measure of its maximal catalytic activ- 
ity. kea is defined as the number of substrate molecules converted into product per 
enzyme molecule per unit time when the enzyme is saturated with substrate. The 
turnover number is also referred to as the molecular activity of the enzyme. For the 
simple Michaelis-Menten reaction (Equation 13.9) under conditions of initial ve- 
locity measurements, ky = kea. Provided the concentration of enzyme, [E7], in the 
reaction mixture is known, kea can be determined from Vmax: At saturating [S], v = 
Vmax = ko [E7]. Thus, 
T E 

* [Er] 
The term kea represents the kinetic efficiency of the enzyme. Table 13.4 lists 
turnover numbers for some representative enzymes. Catalase has the highest 
turnover number known; each molecule of this enzyme can degrade 40 million mol- 
ecules of HO» in 1 second! At the other end of the scale, lysozyme requires 2 sec- 
onds to cleave a glycosidic bond in its glycan substrate. 

In many situations, the actual molar amount of the enzyme is not known. How- 
ever, its amount can be expressed in terms of the activity observed. The International 
Commission on Enzymes defines one international unit as the amount that catalyzes the 
formation of 1 micromole of product in 1 minute. (Because enzymes are very sensitive to 
factors such a pH, temperature, and ionic strength, the conditions of assay must be 
specified.) In the process of purifying enzymes from cellular sources, many extrane- 
ous proteins may be present. Then, the units of enzyme activity are expressed as en- 
zyme units per mg protein, a term known as specific activity (see Table 5.1). 


(13.25) 


The Ratio, k,a:/K, Defines the Catalytic Efficiency of an Enzyme 


Under physiological conditions, [S] is seldom saturating and kea itself is not partic- 
ularly informative. That is, the in vivo ratio of [S]/ K„ usually falls in the range of 0.01 
to 1.0, so active sites often are not filled with substrate. Nevertheless, we can derive a 
meaningful index of the efficiency of Michaelis-Menten-type enzymes under these 
conditions by using the following equations. As presented in Equation 13.23, if 


O VenaxlS] 
°K, + [S] 
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TABLE 13.4 

Enzyme koa (Sec!) 
Catalase 40,000,000 
Carbonic anhydrase 1,000,000 
Acetylcholinesterase 14,000 
Penicillinase 2,000 
Lactate dehydrogenase 1,000 
Chymotrypsin 100 
DNA polymerase I 15 
Lysozyme 0.5 
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and Vise = Rest [Ev], then 


= kealE7] [S] (13 26) 
°K, + [S] 
When [S] < K,,, the concentration of free enzyme, [E], is approximately equal to 
[Er], so 
= (Ba [E][S] 13.27 
í 7 ( Ky ( ` ) 


That is, kat/ Kn is an apparent second-order rate constant for the reaction of E and 
S to form product. Because K,„ is inversely proportional to the affinity of the enzyme 
for its substrate and kea is directly proportional to the kinetic efficiency of the en- 
zyme, keat/ Km provides an index of the catalytic efficiency of an enzyme operating at 
substrate concentrations substantially below saturation amounts. 

An interesting point emerges if we restrict ourselves to the simple case where 
Rea = ko. Then 





K, Ene ko (13.28) 
But k; must always be greater than or equal to kıko/ (k-1 + ko). That is, the reaction can 
go no faster than the rate at which E and S come together. Thus, kı sets the upper limit 
for kea/ Km In other words, the catalytic efficiency of an enzyme cannot exceed the diffusion- 
controlled rate of combination of E and S to form ES. In H:O, the rate constant for such dif- 
fusion is approximately 10°/ M: sec for small substrates (for example, glyceraldehyde 
3-P) and an order of magnitude smaller (~ 108/M- sec) for substrates the size of nu- 
cleotides. Those enzymes that are most efficient in their catalysis have k,.,/ Kn ratios ap- 
proaching this value. Their catalytic velocity is limited only by the rate at which they en- 
counter S; enzymes this efficient have achieved so-called catalytic perfection. All E and 
S encounters lead to reaction because such “catalytically perfect” enzymes can channel 
S to the active site, regardless of where S hits E. Table 13.5 lists the kinetic parameters 
of several enzymes in this category. Note that kea and K„ both show a substantial range 
of variation in this table, even though their ratio falls around 108/ M- sec. 


Linear Plots Can Be Derived from the Michaelis-Menten Equation 


Because of the hyperbolic shape of v versus [S] plots, Vina, can be determined only 
from an extrapolation of the asymptotic approach of v to some limiting value as [S] 
increases indefinitely (Figure 13.7); and K,, is derived from that value of [S] giving 











TABLE 13.5 
Keat Km Keat/Km 
Enzyme Substrate (sec!) (M) (M~" sec™') 
Acetylcholinesterase Acetylcholine 1.4 x 10t 9x10% 1.6 X 108 
Carbonic CO, 1 x 10° 0.012 8.3 X 107 
anhydrase HCO,” 4x 10° 0.026 1.5 X 107 
Catalase H,O, 4x 107 1.1 4x107 
Crotonase Crotonyl-CoA 5.7 X 108 2x 10% 2.8 X 108 
Fumarase Fumarate 800 5 x 10~° 1.6 X 108 
Malate 900 2.5 X 10-5 3.6 X 107 
Triosephosphate Glyceraldehyde- 4.3 X 108 1.8 xX 107 2.4 X 108 
isomerase 3-phosphate* 
B-Lactamase Benzylpenicillin 2 x 10° 2x 10% 1 X 108 








*Km for glyceraldehyde-3-phosphate is calculated on the basis that only 3.8% of the substrate in solution is unhydrated 
and therefore reactive with the enzyme. 
Adapted from Fersht, A., 1985. Enzyme Structure and Mechanism, 2nd ed. New York: W. H. Freeman. 
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v = Vnax/2. However, several rearrangements of the Michaelis-Menten equation 
transform it into a straight-line equation. The best known of these is the 
Lineweaver—Burk double-reciprocal plot: 

Taking the reciprocal of both sides of the Michaelis-Menten equation, Equation 


13.23, yields the equality 
l Kn 1 1 
4 7 ( Vinax \ [S] ü Vinax (13.29) 


This conforms to y= mx + b (the equation for a straight line), where y = 1/v; m, 
the slope, is Km/ Vmax; x = 1/[S]; and b= 1/Vmax. Plotting 1/v versus 1/[S] gives a 
straight line whose x-intercept is —1/K,,, whose y-intercept is 1/ Vmax and whose 
slope is Km/ Vmax (Figure 13.9). 
The Hanes—Woolf plot is another rearrangement of the Michaelis-Menten equa- 
tion that yields a straight line: 
Multiplying both sides of Equation 13.29 by [S] gives 


K 1 S Kn S 
a -isi ( . \ z) [S] _ 1s] ooh 














Vinas [ S Vn ax Vaas Via 
and 
IS] l [S] + Kn 13.31 
U E ( a Vmax ( = ) 


Graphing [S]/v versus [S] yields a straight line where the slope is 1/Vinax, the 
y-intercept is Km/ Vmax, and the x-intercept is —K,,, as shown in Figure 13.10. The 
Hanes—Woolf plot has the advantage of not overemphasizing the data obtained at 
low [S], a fault inherent in the Lineweaver—Burk plot. The common advantage of 
these plots is that they allow both K,, and Vmax to be accurately estimated by extrap- 
olation of straight lines rather than asymptotes. Computer fitting of v versus [S] data 
to the Michaelis-Menten equation is more commonly done than graphical plotting. 


Nonlinear Lineweaver—Burk or Hanes—Woolf Plots Are a Property 
of Regulatory Enzymes 


If the kinetics of the reaction disobey the Michaelis-Menten equation, the violation is 
revealed by a departure from linearity in these straight-line graphs. We shall see in the 
next chapter that such deviations from linearity are characteristic of the kinetics of 
regulatory enzymes known as allosteric enzymes. Such regulatory enzymes are very 
important in the overall control of metabolic pathways. 


CENGAGENOW” ACTIVE FIGURE 13.9 The 
Lineweaver—Burk double-reciprocal plot. Test yourself 
on the concepts in this figure at www.cengage.com/ 
login. 
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A DEEPER LOOK 


An Example of the Effect of Amino Acid Substitutions on Km and keat: Wild-Type 
and Mutant Forms of Human Sulfite Oxidase 


Mammalian sulfite oxidase is the last enzyme in the pathway for 
degradation of sulfur-containing amino acids. Sulfite oxidase 
(SO) catalyzes the oxidation of sulfite (SO3?7) to sulfate (SO,?7), 
using the heme-containing protein, cytochrome c, as electron 
acceptor: 


SO? + 2 cytochrome Goxidized + H:O = 
SO + 2 cytochrome Geducea + 2 H* 


Isolated sulfite oxidase deficiency is a rare and often fatal genetic dis- 
order in humans. The disease is characterized by severe neurolog- 
ical abnormalities, revealed as convulsions shortly after birth. 
R. M. Garrett and K. V. Rajagopalan at Duke University Medical 
Center have isolated the human cDNA for sulfite oxidase from the 
cells of normal (wild-type) and SO-deficient individuals. Expression 
of these SO cDNAs in transformed Escherichia coli cells allowed the 
isolation and kinetic analysis of wild-type and mutant forms of SO, 
including one (designated R160Q) in which the Arg at position 
160 in the polypeptide chain is replaced by Gln. A genetically en- 
gineered version of SO (designated R160K) in which Lys replaces 





ants for WARE RTayar ny, 


Kn sulfite (uM) Keat/Km (1 0° MT! sec™') 


kex (Sec!) 





Enzyme 

Wild-type 17 18 T1 
R160Q 1900 3 0.0016 
R160K 360 5.5 0.015 








Replacing R' in sulfite oxidase by Q increases K,,, decreases 
kat; and markedly diminishes the catalytic efficiency (kea/ Km) of 
the enzyme. The R160K mutant enzyme has properties intermedi- 
ate between wild-type and the R160Q mutant form. The substrate, 
SO;?-, is strongly anionic, and R!® is one of several Arg residues 
situated within the SO substrate-binding site. Positively charged 
side chains in the substrate-binding site facilitate SO;°~ binding 
and catalysis, with Arg being optimal in this role. 


Arg'® was also studied. 











CENGAGENOW” ANIMATED FIGURE 13.10 A Hanes- 
Woolf plot of [S]/v versus [S]. See this figure animated 
at www.cengage.com/login. 
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Enzymatic Activity Is Strongly Influenced by pH 


Enzyme-substrate recognition and the catalytic events that ensue are greatly de- 
pendent on pH. An enzyme possesses an array of ionizable side chains and pros- 
thetic groups that not only determine its secondary and tertiary structure but may 
also be intimately involved in its active site. Furthermore, the substrate itself often 
has ionizing groups, and one or another of the ionic forms may preferentially in- 
teract with the enzyme. Enzymes in general are active only over a limited pH range, 
and most have a particular pH at which their catalytic activity is optimal. These ef- 
fects of pH may be due to effects on K,, or Vinax Or both. Figure 13.11 illustrates the 
relative activity of four enzymes as a function of pH. Trypsin, an intestinal protease, 
has a slightly alkaline pH optimum, whereas pepsin, a gastric protease, acts in the 
acidic confines of the stomach and has a pH optimum near 2. Papain, a protease 
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found in papaya, is relatively insensitive to pHs between 4 and 8. Cholinesterase ac- 
tivity is pH-sensitive below pH 7 but not between pH 7 and 10. The cholinesterase 
activity-pH profile suggests that an ionizable group with a pK, near 6 is essential to its 
activity. Might this group be a histidine side chain within its active site? Although the 
pH optimum of an enzyme often reflects the pH of its normal environment, the op- 
timum may not be precisely the same. This difference suggests that the pH-activity re- 
sponse of an enzyme may be a factor in the intracellular regulation of its activity. 


The Response of Enzymatic Activity to Temperature Is Complex 


Like most chemical reactions, the rates of enzyme-catalyzed reactions generally in- 
crease with increasing temperature. However, at temperatures above 50° to 60°C, 
enzymes typically show a decline in activity (Figure 13.12). Two effects are operat- 
ing here: (1) the characteristic increase in reaction rate with temperature and 
(2) thermal denaturation of protein structure at higher temperatures. Most enzy- 
matic reactions double in rate for every 10°C rise in temperature (that is, Qio = 2, 
where Qj is defined as the ratio of activities at two temperatures 10° apart) as long as the 
enzyme is stable and fully active. Some enzymes, those catalyzing reactions having 
very high activation energies, show proportionally greater Qjo values. The increas- 
ing rate with increasing temperature is ultimately offset by the instability of higher 
orders of protein structure at elevated temperatures, where the enzyme is inacti- 
vated. Not all enzymes are quite so thermally labile. For example, the enzymes of 
thermophilic prokaryotes (thermophilic = “heat-loving”) found in geothermal springs 
retain full activity at temperatures in excess of 85°C. 


1345 What Can Be Learned from the Inhibition 
of Enzyme Activity? 


If the velocity of an enzymatic reaction is decreased or inhibited by some agent, the 
kinetics of the reaction obviously have been perturbed. Systematic perturbations are 
a basic tool of experimental scientists; much can be learned about the normal work- 
ings of any system by inducing changes in it and then observing the effects of the 
change. The study of enzyme inhibition has contributed significantly to our under- 
standing of enzymes. 


Enzymes May Be Inhibited Reversibly or Irreversibly 


Enzyme inhibitors are classified in several ways. The inhibitor may interact either re- 
versibly or irreversibly with the enzyme. Reversible inhibitors interact with the enzyme 
through noncovalent association/dissociation reactions. In contrast, irreversible 


FIGURE 13.11 The pH activity profiles of 
four different enzymes. 
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FIGURE 13.12 The effect of temperature on enzyme 
activity. 
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inhibitors usually cause stable, covalent alterations in the enzyme. That is, the conse- 
quence of irreversible inhibition is a decrease in the concentration of active enzyme. 
The kinetics observed are consistent with this interpretation, as we shall see later. 


Reversible Inhibitors May Bind at the Active Site or at Some Other Site 


Reversible inhibitors fall into three major categories: competitive, noncompetitive, 
and uncompetitive. Competitive inhibitors are characterized by the fact that the sub- 
strate and inhibitor compete for the same binding site on the enzyme, the so-called 
active site or substrate-binding site. Thus, increasing the concentration of S favors 
the likelihood of S binding to the enzyme instead of the inhibitor, I. That is, high [S] 
can overcome the effects of I. The effects of the other major types, noncompetitive 
and uncompetitive inhibition, cannot be overcome by increasing [S]. The three 
types can be distinguished by the particular patterns obtained when the kinetic data 
are analyzed in linear plots, such as double-reciprocal (Lineweaver—Burk) plots. A 
general formulation for common inhibitor interactions in our simple enzyme kinetic 
model would include 


E + I= FI and/or I + ES=—IES (13.32) 


Competitive Inhibition Consider the following system: 


hy ko ks 
E + S=—ES—>E + P E+ (13.33) 
=] = 


where an inhibitor, I, binds reversibly to the enzyme at the same site as S. S-binding and 
I-binding are mutually exclusive, competitive processes. Formation of the ternary com- 
plex, IES, where both S and I are bound, is physically impossible. This condition leads 
us to anticipate that S and I must share a high degree of structural similarity because 
they bind at the same site on the enzyme. Also notice that, in our model, EI does not 
react to give rise to E + P. That is, I is not changed by interaction with E. The rate of 
the product-forming reaction is v = ko[ES]. 

It is revealing to compare the equation for the uninhibited case, Equa- 
tion 13.23 (the Michaelis-Menten equation) with Equation 13.43 for the rate of 
the enzymatic reaction in the presence of a fixed concentration of the competi- 
tive inhibitor, [I] 


—_VonaxlSI 
~ Kat [S] 
Vmax [S] 
v= [s] + K,(1 a 
K 


I 


v 





(see also Table 13.6). The K,, term in the denominator in the inhibited case is in- 
creased by the factor (1 + [I]/ Kj); thus, vis less in the presence of the inhibitor, as 
expected. Clearly, in the absence of I, the two equations are identical. Figure 13.13 
shows a Lineweaver-Burk plot of competitive inhibition. Several features of com- 
petitive inhibition are evident. First, at a given [I], v decreases (1/v increases). 





TABLE 13.6 ff 














Inhibition Type Rate Equation Apparent Km Apparent Vinax 
None v= VinaxLS]/ (Kn + [S]) Kn Vinax 

Competitive v= VinaxLS]/([S] + Kp + (1/51) K,,(1 + [1] /5i) Vinax 
Noncompetitive v= (VinaxLS]/ Q + M/K))/ (Kin + [S]) Kn Vinax/ (1 + [I] / 7) 
Mixed v= Vinax[S]/((1 + [H/ 51) Kn + (1 + O]/Kr[S])) Ka + [1] /Ky)/(1 + M/K’) Vinax/ (1 + [1] /1’) 
Uncompetitive v= Vinax[S]/ (Kn + [S$]. + I]/Kr')) Ky,/ Q + [/ Ki’) Vinax/ (1 + [1] /K1’) 














K, is defined as the enzyme: inhibitor dissociation constant K, = [E][I]/[El]; K’ is defined as the enzyme-substrate complex: inhibitor dissociation constant K,’ = [ES][I/[IES]. 
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Kı 


CENGAGENOW” ACTIVE FIGURE 13.13 Lineweaver-Burk plot of competitive inhibition, showing lines for no |, 
[], and 2[I]. Note that when [S] is infinitely large (1/[S] = 0), Vmax is the same, whether | is present or not. Test 


yourself on the concepts in this figure at www.cengage.com/login. 


When [S] becomes infinite, v = Vnax and is unaffected by I because all of the en- 
zyme is in the ES form. Note that the value of the — x-intercept decreases as [I] in- 
creases. This — x-intercept is often termed the apparent K,, (or Knapp) because it is 
the K„ apparent under these conditions. The diagnostic criterion for competitive 
inhibition is that Vmax is unaffected by I; that is, all lines share a common y-intercept. 
This criterion is also the best experimental indication of binding at the same site by 


two substances. Competitive inhibitors resemble S structurally. 


Succinate Dehydrogenase—A Classic Example of Competitive Inhibition 


The 


enzyme succinate dehydrogenase (SDH) is competitively inhibited by malonate. Figure 


A DEEPER LOOK 





The Equations of Competitive Inhibition 


Given the relationships between E, S, and I described previously 
and recalling the steady-state assumption that d[ES]/dt = 0, from 
Equations (13.14) and (13.16) we can write 
_ WIES] _ [EJS] 
(ko F hk.) Kn 





(18.34) 


Assuming that E + I= EI reaches rapid equilibrium, the rate of 
EI formation, v; = ks[E] [I], and the rate of disappearance of EI, 
ui = k-s[EI], are equal. So, 


ks[E] [I] = k-s[EV] (13.35) 


Therefore, 


ks 
[EI] = 7 [E] [1] 


hes (13.36) 


If we define K; as k_3/k3, an enzyme-inhibitor dissociation con- 
stant, then 


[E] H] 





[EI] = K, (13.37) 
knowing [E7] = [E] + [ES] + [EI]. Then 
[E][S] [E] M] 
[Er] = [E] + (13.38) 


K m ° Kı 





Solving for [E] gives 


KK, [Eq 
~ (KK, + Ki[S] + Knll]) 





[E] (13.39) 


Because the rate of product formation is given by v = k [ES], from 
Equation 13.34 we have 











ko[E][S] 
pa (13.40) 
So, 
7 (kg Ki [E7] [S]) (13.41 
= (KK, + KS] + Kall) = 
Because Vmax = ko[E7], 
Vmax LS] 
TE KI (13.42) 
+ [S] + = 
K,, + [S] K, 
or 
Vaal S] 
(13.43) 
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FIGURE 13.14 Structures of succinate, the substrate of 
succinate dehydrogenase (SDH), and malonate, the 
competitive inhibitor. Fumarate (the product of SDH 
action on succinate) is also shown. 
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13.14 shows the structures of succinate and malonate. The structural similarity be- 
tween them is obvious and is the basis of malonate’s ability to mimic succinate and 
bind at the active site of SDH. However, unlike succinate, which is oxidized by SDH 
to form fumarate, malonate cannot lose two hydrogens; consequently, it is unreactive. 


Noncompetitive Inhibition Noncompetitive inhibitors interact with both E and 
ES (or with S and ES, but this is a rare and specialized case). Obviously, then, the 
inhibitor is not binding to the same site as S, and the inhibition cannot be overcome 
by raising [S]. There are two types of noncompetitive inhibition: pure and mixed. 


Pure Noncompetitive Inhibition In this situation, the binding of I by E has no 
effect on the binding of S by E. That is, S and I bind at different sites on E, and bind- 
ing of I does not affect binding of S. Consider the system 


Kı Ky’ 
E + I= EI ES + I—=IES (13.44) 


Pure noncompetitive inhibition occurs if Kı = Kj’. This situation is relatively uncom- 
mon; the Lineweaver—Burk plot for such an instance is given in Figure 13.15. Note 
that K,, is unchanged by I (the x-intercept remains the same, with or without I). Note 
also that the apparent Vmax decreases. A similar pattern is seen if the amount of en- 
zyme in the experiment is decreased. Thus, it is as if I lowered [E]. 


Mixed Noncompetitive Inhibition In this situation, the binding of I by E influences 
the binding of S by E. Either the binding sites for I and S are near one another or con- 
formational changes in E caused by I affect S binding. In this case, Ky and jy’, as de- 
fined previously, are not equal. Both the apparent K,,,and the apparent Vmax are altered 








+I 
Eee 
< K 
av Slope= —" (1 + m 
v We Ky 
E EI =l 
1 m 
Wea Kı Slope = I 
N V 
1 
7 
p. 1 
=. 7 
1 rh a = 1 
= a = 
E A7 v 
ES IES Re moz 
0 1 


IS] 


CENGAGENOW” ACTIVE FIGURE 13.15 Lineweaver-Burk plot of pure noncompetitive inhibition. Note that | 
does not alter Km but that it decreases Vmax. Test yourself on the concepts in this figure at www.cengage 
.com/login. 
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CENGAGENOW” ACTIVE FIGURE 13.16 Lineweaver-Burk plot of mixed noncompetitive inhibition. Note that 
both intercepts and the slope change in the presence of I. (a) When K; is less than K’; (b) when K; is greater 
than K’. Test yourself on the concepts in this figure at www.cengage.com/login. 


by the presence of I, and K„/Vmax is not constant (Figure 13.16). This inhibitory pat- 
tern is commonly encountered. A reasonable explanation is that the inhibitor is bind- 
ing at a site distinct from the active site yet is influencing the binding of S at the active 
site. Presumably, these effects are transmitted via alterations in the protein’s confor- 
mation. Table 13.6 includes the rate equations and apparent K„ and Vinax values for 
both types of noncompetitive inhibition. 


Uncompetitive Inhibition Completing the set of inhibitory possibilities is un- 
competitive inhibition. Unlike competitive inhibition (where I combines only with 
E) or noncompetitive inhibition (where I combines with E and ES), in uncompeti- 
tive inhibition, I combines only with ES. 


ey 
ES + I= IES (13.45) 





Because IES does not lead to product formation, the observed rate constant for 
product formation, ks, is uniquely affected. In simple Michaelis-Menten kinetics, 
ky is the only rate constant that is part of both V,,,, and Kn. The pattern obtained 
in Lineweaver—Burk plots is a set of parallel lines (Figure 13.17). A clinically im- 
portant example is the action of lithium in alleviating manic depression; Li* ions 
are uncompetitive inhibitors of myo-inositol monophosphatase. Some pesticides 
are also uncompetitive inhibitors, such as Roundup, an uncompetitive inhibitor of 
3-enolpyruvylshikimate-5-P synthase, an enzyme essential to aromatic amino acid 
biosynthesis (see Chapter 25). 


Enzymes Also Can Be Inhibited in an Irreversible Manner 


If the inhibitor combines irreversibly with the enzyme—for example, by covalent at- 
tachment—the kinetic pattern seen is like that of noncompetitive inhibition, be- 
cause the net effect is a loss of active enzyme. Usually, this type of inhibition can be 
distinguished from the noncompetitive, reversible inhibition case because the re- 
action of I with E (and/or ES) is not instantaneous. Instead, there is a time-dependent 
decrease in enzymatic activity as E + I—EI proceeds, and the rate of this inactivation 
can be followed. Also, unlike reversible inhibitions, dilution or dialysis of the en- 
zyme ‘inhibitor solution does not dissociate the EI complex and restore enzyme 
activity. 


Suicide Substrates—Mechanism-Based Enzyme Inactivators Suicide sub- 
strates are inhibitory substrate analogs designed so that, via normal catalytic ac- 
tion of the enzyme, a very reactive group is generated. This reactive group then 
forms a covalent bond with a nearby functional group within the active site of the 
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FIGURE 13.17 Lineweaver—Burk plot of uncompetitive 
inhibition. Note that both intercepts change but the 
slope (Kn/Vmax) remains constant in the presence of |. 


FIGURE 13.18 Penicillin is an irreversible inhibitor of the 
enzyme glycopeptide transpeptidase, also known as 
glycoprotein peptidase, which catalyzes an essential step 
in bacterial cell wall synthesis. 
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enzyme, thereby causing irreversible inhibition. Suicide substrates, also called 
Trojan horse substrates, are a type of affinity label. As substrate analogs, they bind 
with specificity and high affinity to the enzyme active site; in their reactive form, 
they become covalently bound to the enzyme. This covalent link effectively labels 
a particular functional group within the active site, identifying the group as a key 
player in the enzyme’s catalytic cycle. 





Penicillin 
OH 
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— >> 
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Penicilloyl-enzyme complex 
(enzymatically inactive) 


13.5 What Is the Kinetic Behavior of Enzymes Catalyzing Bimolecular Reactions? 


Penicillin—A Suicide Substrate Several drugs in current medical use are 
mechanism-based enzyme inactivators. For example, the antibiotic penicillin exerts 
its effects by covalently reacting with an essential serine residue in the active site of 
glycopeptide transpeptidase, an enzyme that acts to crosslink the peptidoglycan chains 
during synthesis of bacterial cell walls (Figure 13.18). Penicillin consists of a thia- 
zolidine ring fused to a 6-lactam ring to which a variable R group is attached. A re- 
active peptide bond in the -lactam ring covalently attaches to a serine residue in 
the active site of the glycopeptide transpeptidase. (The conformation of penicillin 
around its reactive peptide bond resembles the transition state of the normal glyco- 
peptide transpeptidase substrate.) The penicillinoyl-enzyme complex is catalyti- 
cally inactive. Once cell wall synthesis is blocked, the bacterial cells are very sus- 
ceptible to rupture by osmotic lysis and bacterial growth is halted. 


1325: What Is the Kinetic Behavior of Enzymes Catalyzing 
Bimolecular Reactions? 


Thus far, we have considered only the simple case of enzymes that act upon a sin- 
gle substrate, S. This situation is not common. Usually, enzymes catalyze reac- 
tions in which two (or even more) substrates take part. 

Consider the case of an enzyme catalyzing a reaction involving two substrates, A 
and B, and yielding the products P and Q: 


enzyme 





A+B==P+Q (13.46) 


Such a reaction is termed a bisubstrate reaction. In general, bisubstrate reactions 
proceed by one of two possible routes: 





1. Both A and B are bound to the enzyme and then reaction occurs to give P + Q: 
E + A + B—>AEB— > PEQ—>E+P+Q (13.47) 
Reactions of this type are defined as sequential or single-displacement reactions. 
They can be either of two distinct classes: 
a. random, where either A or B may bind to the enzyme first, followed by the 
other substrate, or 
b. ordered, where A, designated the leading substrate, must bind to E first before 
B can be bound. 
Both classes of single-displacement reactions are characterized by lines that in- 
tersect to the left of the 1/v axis in Lineweaver—Burk plots where the rates ob- 
served with different fixed concentrations of one substrate (B) are graphed ver- 
sus a series of concentrations of A (Figure 13.19). 
2. The other general possibility is that one substrate, A, binds to the enzyme and 
reacts with it to yield a chemically modified form of the enzyme (E’) plus the 
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FIGURE 13.19 Single-displacement bisubstrate 
mechanism. 
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HUMAN BIOCHEMISTRY 


Viagra—An Unexpected Outcome in a Program of Drug Design 


Prior to the accumulation of detailed biochemical information 
on metabolism, enzymes, and receptors, drugs were fortuitous 
discoveries made by observant scientists; the discovery of peni- 
cillin as a bacteria-killing substance by Fleming is an example. 
Today, drug design is the rational application of scientific knowl- 
edge and principles to the development of pharmacologically ac- 
tive agents. A particular target for therapeutic intervention is 
identified (such as an enzyme or receptor involved in illness), 
and chemical analogs of its substrate or ligand are synthesized in 
hopes of finding an inhibitor (or activator) that will serve as a 
drug to treat the illness. Sometimes the outcome is unantici- 
pated, as the story of Viagra (sildenafil citrate) reveals. 

When the smooth muscle cells of blood vessels relax, blood flow 
increases and blood pressure drops. Such relaxation is the result of 
decreases in intracellular [Ca?*] triggered by increases in intracel- 
lular [CGMP] (which in turn is triggered by nitric oxide, NO; see 
Chapter 32). Cyclic GMP (cGMP) is hydrolyzed by phosphodiesterases 
to form 5’-GMP, and the muscles contract again. Scientists at Pfizer 





cGMP 


CHCHoO HN 


reasoned that, if phosphodiesterase inhibitors could be found, they 
might be useful drugs to treat angina (chest pain due to inadequate 
blood flow to heart muscle) or hypertension (high blood pressure). 
The phosphodiesterase (PDE) prevalent in vascular muscle is 
PDE 5, one of at least nine different substypes of PDE in human 
cells. The search was on for substances that inhibit PDE 5, but not 
the other prominent PDE types, and Viagra was found. Disappoint- 
ingly, Viagra showed no significant benefits for angina or hyperten- 
sion, but some men in clinical trials reported penile erection. Ap- 
parently, Viagra led to an increase in [CGMP] in penile vascular 
tissue, allowing vascular muscle relaxation, improved blood flow, 
and erection. A drug was born. 

In a more focused way, detailed structural data on enzymes, re- 
ceptors, and the ligands that bind to them has led to rational drug 
design, in which computer modeling of enzyme-ligand interactions re- 
places much of the initial chemical synthesis and clinical pre- 
screening of potential therapeutic agents, saving much time and 
effort in drug development. 
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Viagra cGMP (left) and Viagra (right). 











product, P. The second substrate, B, then reacts with E’, regenerating E and 
forming the other product, Q. 


Se 
P B 


(13.48) 


Reactions that fit this model are called ping-pong or double-displacement reactions. 
Two distinctive features of this mechanism are the obligatory formation of a modi- 
fied enzyme intermediate, E’, and the pattern of parallel lines obtained in double- 
reciprocal plots of the rates observed with different fixed concentrations of one sub- 
strate (B) versus a series of concentrations of A (see Figure 13.22). 


The Conversion of AEB to PEQ Is the Rate-Limiting Step in Random, 
Single-Displacement Reactions 


In this type of sequential reaction, all possible binary enzyme-substrate complexes 
(AE, EB, PE, EQ) are formed rapidly and reversibly when the enzyme is added to a 
reaction mixture containing A, B, P, and Q: 


T 
ae 


EP => P +E 
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AEB == PE 


e) 
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QE == E + Q (13.49) 
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The rate-limiting step is the reaction AEB—>PEQ. It doesn’t matter whether A or 
B binds first to E, or whether Q or P is released first from QEP. Sometimes, re- 
actions that follow this random order of addition of substrates to E can be dis- 
tinguished from reactions obeying an ordered, single-displacement mechanism. 
If A has no influence on the binding constant for B (and vice versa) and the 
mechanism is purely random, the lines in a Lineweaver—Burk plot intersect at the 
1/[A] axis (Figure 13.20). 


Creatine Kinase Acts by a Random, Single-Displacement Mechanism An exam- 
ple of a random, single-displacement mechanism is seen in the enzyme creatine ki- 
nase, a phosphoryl transfer enzyme that uses ATP as a phosphoryl donor to form 
creatine phosphate (CrP) from creatine (Cr). Creatine-P is an important reservoir 
of phosphate-bond energy in muscle cells (Figure 13.21). 


ADP:E == ADP +E 


Vi 


ATP:E:Cr == ADP:E:CrP 


\ 


E:CrP == E + CrP 


ATP + E == ATP:E 


NZ 


E + Cr == E:Cr 


The overall direction of the reaction will be determined by the relative concentra- 
tions of ATP, ADP, Cr, and CrP and the equilibrium constant for the reaction. The 
enzyme can be considered to have two sites for substrate (or product) binding: an 
adenine nucleotide site, where ATP or ADP binds, and a creatine site, where Cr or 
CrP is bound. In such a mechanism, ATP and ADP compete for binding at their 
unique site while Cr and CrP compete at the specific Cr/CrP-binding site. Note that 
no modified enzyme form (E’), such as an E-PO, intermediate, appears here. The 
reaction is characterized by rapid and reversible binary ES complex formation, fol- 
lowed by addition of the remaining substrate, and the rate-determining reaction 
taking place within the ternary complex. 


In an Ordered, Single-Displacement Reaction, the Leading Substrate 
Must Bind First 


In this case, the leading substrate, A (also called the obligatory or compulsory 
substrate), must bind first. Then the second substrate, B, binds. Strictly speaking, 
B cannot bind to free enzyme in the absence of A. Reaction between A and B 
occurs in the ternary complex and is usually followed by an ordered release of 
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FIGURE 13.20 Random, single-displacement bisubstrate 
mechanism where A does not affect B binding, and vice 
versa. 


Creatine 


Creatine-P 


FIGURE 13.21 The structures of creatine and creatine 
phosphate, guanidinium compounds that are important 
in muscle energy metabolism. 
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the products of the reaction, P and Q. In the following schemes, P is the product 
of A and is released last. One representation, suggested by W. W. Cleland, 
follows: 


A Q P 


| 


E AE AEB =— PEQ PE E (13.50) 








Another way of portraying this mechanism is as follows: 
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Note that A and P are competitive for binding to the free enzyme, E, but not A and 
B (or P and B). 


NAD+-Dependent Dehydrogenases Show Ordered Single-Displacement Mecha- 
nisms Nicotinamide adenine dinucleotide (NAD*)-dependent dehydrogenases are enzymes 
that typically behave according to the kinetic pattern just described. A general reac- 
tion of these dehydrogenases is 


NAD* + BHyg==NADH + H* +B 


The leading substrate (A) is nicotinamide adenine dinucleotide (NAD+), and 
NAD* and NADH (product P) compete for a common site on E. A specific exam- 
ple is offered by alcohol dehydrogenase (ADH): 


NAD+ + CH;CHOH==NADH + H* + CH;CHO 
(A) ethanol (P) acetaldehyde 
(B) (Q) 


We can verify that this ordered mechanism is not random by demonstrating that no 
B (ethanol) is bound to E in the absence of A (NAD*). 


Double-Displacement (Ping-Pong) Reactions Proceed Via Formation of 
a Covalently Modified Enzyme Intermediate 


Double-displacement reactions are characterized by a pattern of parallel lines 
when 1/v is plotted as a function of 1/[A] at different concentrations of B, the 
second substrate (Figure 13.22). Reactions conforming to this kinetic pattern are 
characterized by the fact that the product of the enzyme’s reaction with A (called 
P in the following schemes) is released prior to reaction of the enzyme with the 
second substrate, B. As a result of this process, the enzyme, E, is converted to a 
modified form, E’, which then reacts with B to give the second product, Q, and 
regenerate the unmodified enzyme form, E: 


A P B Q 


| | | 
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Note that these schemes predict that A and Q compete for the free enzyme form, 
E, while B and P compete for the modified enzyme form, E’. A and Q do not bind 
to E’, nor do B and P combine with E. 


w 





E'B 





Aminotransferases Show Double-Displacement Catalytic Mechanisms One 
class of enzymes that follow a ping-pong—type mechanism are aminotransferases 
(previously known as transaminases). These enzymes catalyze the transfer of an 
amino group from an amino acid to an a-keto acid. The products are a new 
amino acid and the keto acid corresponding to the carbon skeleton of the amino 
donor: 


amino acid, + keto acidg— keto acid, + amino acids 


A specific example would be glutamate: aspartate aminotransferase. Figure 13.23 de- 
picts the scheme for this mechanism. Note that glutamate and aspartate are com- 
petitive for E and that oxaloacetate and a-ketoglutarate compete for E’. In gluta- 
mate : aspartate aminotransferase, an enzyme-bound coenzyme, pyridoxal phosphate 
(a vitamin Bg derivative), serves as the amino group acceptor/donor in the enzy- 
matic reaction. The unmodified enzyme, E, has the coenzyme in the aldehydic 
pyridoxal form, whereas in the modified enzyme, E’, the coenzyme is actually pyri- 
doxamine phosphate (Figure 13.23). Not all enzymes displaying ping-pong—type 
mechanisms require coenzymes as carriers for the chemical substituent trans- 
ferred in the reaction. 
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FIGURE 13.22 Double-displacement (ping-pong) bisub- 
strate mechanisms are characterized by parallel lines. 
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FIGURE 13.23 Glutamate : aspartate aminotransferase, an 
enzyme conforming to a double-displacement bisub- 


strate mechanism. 
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Exchange Reactions Are One Way to Diagnose Bisubstrate Mechanisms 


Kineticists rely on a number of diagnostic tests for the assignment of a reaction 
mechanism to a specific enzyme. One is the graphic analysis of the kinetic patterns 
observed. It is usually easy to distinguish between single- and double-displacement 
reactions in this manner, and examining competitive effects between substrates aids 
in assigning reactions to random versus ordered patterns of S binding. A second di- 
agnostic test is to determine whether the enzyme catalyzes an exchange reaction. 
Consider as an example the two enzymes sucrose phosphorylase and maltose phosphory- 
lase. Both catalyze the phosphorolysis of a disaccharide and both yield glucose-1- 
phosphate and a free hexose: 


Sucrose + P; = —glucose-1-phosphate + fructose 


Maltose + P; = —glucose-l-phosphate + glucose 





Interestingly, in the absence of sucrose and fructose, sucrose phosphorylase will 
catalyze the exchange of inorganic phosphate, P;, into glucose-1-phosphate. This re- 
action can be followed by using **P; as a radioactive tracer and observing the incor- 
poration of **P into glucose-1-phosphate: 





322P, + G-1-P==P, + G-1-"P 





Maltose phosphorylase cannot carry out a similar reaction. The **P exchange reac- 
tion of sucrose phosphorylase is accounted for by a double-displacement mecha- 
nism where E’ is E-glucose: 


Sucrose + E=—E-glucose + fructose 
E-glucose + P| =—=E + glucose-l-phosphate 


Thus, in the presence of just **P; and glucose-1-phosphate, sucrose phosphorylase 
still catalyzes the second reaction and radioactive P; is incorporated into glucose-1- 
phosphate over time. 

Maltose phosphorylase proceeds via a single-displacement reaction that neces- 
sarily requires the formation of a ternary maltose:E:P; (or glucose: E:glucose-1- 
phosphate) complex for any reaction to occur. Exchange reactions are a character- 


istic of enzymes that obey double-displacement mechanisms at some point in their 
catalysis. 


Multisubstrate Reactions Can Also Occur in Cells 


Thus far, we have considered enzyme-catalyzed reactions involving one or two 
substrates. How are the kinetics described in those cases in which more than two 
substrates participate in the reaction? An example might be the glycolytic enzyme 
glyceraldehyde-3-phosphate dehydrogenase (see Chapter 18): 





NAD* + glyceraldehyde-3-P + P; NADH + H+ + 1,3-bisphosphoglycerate 





Many other multisubstrate examples abound in metabolism. In effect, these situations 
are managed by realizing that the interaction of the enzyme with its many substrates 
can be treated as a series of unisubstrate or bisubstrate steps in a multistep reaction 
pathway. Thus, the complex mechanism of a multisubstrate reaction is resolved into 
a sequence of steps, each of which obeys the single- and double-displacement patterns 
just discussed. 


13261 How Can Enzymes Be So Specific? 


The extraordinary ability of an enzyme to catalyze only one particular reaction is a 
quality known as specificity. Specificity means an enzyme acts only on a specific sub- 
stance, its substrate, invariably transforming it into a specific product. That is, an en- 
zyme binds only certain compounds, and then, only a specific reaction ensues. 
Some enzymes show absolute specificity, catalyzing the transformation of only one 
specific substrate to yield a unique product. Other enzymes carry out a particular 
reaction but act on a class of compounds. For example, hexokinase (ATP : hexose-6- 
phosphotransferase) will carry out the ATP-dependent phosphorylation of a num- 
ber of hexoses at the 6-position, including glucose. Specificity studies on enzymes 
entail an examination of the rates of the enzymatic reaction obtained with various 
structural analogs of the substrate. By determining which functional and structural 
groups within the substrate affect binding or catalysis, enzymologists can map the 
properties of the active site, analyzing questions such as: Can the active site accom- 
modate sterically bulky groups? Are ionic interactions between E and S important? 
Are H bonds formed? 


The “Lock and Key” Hypothesis Was the First Explanation for Specificity 


Pioneering enzyme specificity studies at the turn of the 20th century by the great or- 
ganic chemist Emil Fischer led to the notion of an enzyme resembling a “lock” and 
its particular substrate the “key.” This analogy captures the essence of the specificity 
that exists between an enzyme and its substrate, but enzymes are not rigid templates 
like locks. 


The “Induced Fit” Hypothesis Provides a More Accurate Description 

of Specificity 

Enzymes are highly flexible, conformationally dynamic molecules, and many of 
their remarkable properties, including substrate binding and catalysis, are due to 
their structural pliancy. Realization of the conformational flexibility of proteins led 
Daniel Koshland to hypothesize that the binding of a substrate by an enzyme is an 
interactive process. That is, the shape of the enzyme’s active site is actually modified 
upon binding S, in a process of dynamic recognition between enzyme and substrate 
aptly called induced fit. In essence, substrate binding alters the conformation of the 
protein, so that the protein and the substrate “fit” each other more precisely. The 
process is truly interactive in that the conformation of the substrate also changes as 
it adapts to the conformation of the enzyme. 


13.6 How Can Enzymes Be So Specific? 409 








The “Induced Fit” Hypothesis 
Provides a More Accurate 
Description of Specificity 


New ideas do not always gain immediate 
acceptance: “Although we did many ex- 
periments that in my opinion could only 
be explained by the induced-fit theory, 
gaining acceptance for the theory was still 
an uphill fight. One (journal) referee 
wrote, ‘The Fischer Key-Lock theory has 
lasted 100 years and will not be over- 
turned by speculation from an embryonic 
scientist.” Daniel Koshland, 1996. How to 
get paid for having fun. Annual Review of 
Biochemistry 65:1-13. 
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FIGURE 13.24 A drawing, roughly to scale, of H20, glycerol, glucose, and an idealized hexokinase molecule. 


This idea also helps explain some of the mystery surrounding the enormous cat- 
alytic power of enzymes: In enzyme catalysis, precise orientation of catalytic residues 
comprising the active site is necessary for the reaction to occur; substrate binding in- 
duces this precise orientation by the changes it causes in the protein’s conformation. 


“Induced Fit” Favors Formation of the Transition State 


The catalytically active enzyme substrate complex is an interactive structure in 
which the enzyme causes the substrate to adopt a form that mimics the transition 
state of the reaction. Thus, a poor substrate would be one that was less effective in 
directing the formation of an optimally active enzyme: transition state conforma- 
tion. This active conformation of the enzyme molecule is thought to be relatively 
unstable in the absence of substrate, and free enzyme thus reverts to a conforma- 
tionally different state. 


Specificity and Reactivity 


Consider, for example, why hexokinase catalyzes the ATP-dependent phosphor- 
ylation of hexoses but not smaller phosphoryl-group acceptors such as glyc- 
erol, ethanol, or even water. Surely these smaller compounds are not sterically for- 
bidden from approaching the active site of hexokinase (Figure 13.24). Indeed, wa- 
ter should penetrate the active site easily and serve as a highly effective phosphoryl- 
group acceptor. Accordingly, hexokinase should display high ATPase activity. It does 
not. Only the binding of hexoses induces hexokinase to assume its fully active con- 
formation. The hexose-binding site of hexokinase is located between two protein 
domains. Binding of glucose in the active site induces a conformational change in 
hexokinase that causes the two domains to close upon one another, creating the 
catalytic site. 

In Chapter 14, we explore in greater detail the factors that contribute to the re- 
markable catalytic power of enzymes and examine specific examples of enzyme re- 
action mechanisms. 


‘1307/5 Are All Enzymes Proteins? 


RNA Molecules That Are Catalytic Have Been Termed “Ribozymes” 


It was long assumed that all enzymes are proteins. However, several decades ago, in- 
stances of biological catalysis by RNA molecules were discovered. Catalytic RNAs, or 
ribozymes, satisfy several enzymatic criteria: They are substrate specific, they enhance 
the reaction rate, and they emerge from the reaction unchanged. Most ribozymes act 


in RNA processing, cutting the phosphodiester backbone at specific sites and religat- 
ing needed segments to form functional RNA strands while discarding extraneous 
pieces. For example, bacterial RNase P is a ribozyme involved in the formation of ma- 
ture tRNA molecules from longer RNA transcripts. RNase P requires an RNA com- 
ponent as well as a protein subunit for its activity in the cell. In vitro, the protein alone 
is incapable of catalyzing the maturation reaction, but the RNA component by itself 
can carry out the reaction under appropriate conditions. As another example, the in- 
trons within some rRNAs and mRNAs are ribozymes that can catalyze their own exci- 
sion from large RNA transcripts by a process known as self-splicing. For instance, in 
the ciliated protozoan Tetrahymena, formation of mature ribosomal RNA from a pre- 
rRNA precursor involves the removal of an internal RNA segment and the joining of 
the two ends. The excision of this intron and ligation of the exons is catalyzed by the 
intron itself, in the presence of Mg?* and a free molecule of guanosine nucleoside or 
nucleotide (Figure 13.25). In vivo, the intervening sequence RNA probably acts only 
in splicing itself out; in vitro, however, it can act many times, turning over like a true 
enzyme. 


The Ribosome Is aRibozyme A particularly significant case of catalysis by RNA oc- 
curs in protein synthesis. The peptidyl transferase reaction, which is the reaction of 
peptide bond formation during protein synthesis, is catalyzed by the 23S rRNA of the 
50S subunit of ribosomes (see Chapters 10 and 30). The substrates for the peptidyl 
transferase reaction are two tRNA molecules, one bearing the growing peptide 
chain (the peptidyl-tRNAp) and the other bearing the next amino acid to be added 
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FIGURE 13.25 RNA splicing in Tetrahymena rRNA maturation: (a) the guanosine-mediated reaction involved in 
the autocatalytic excision of the Tetrahymena rRNA intron and (b) the overall splicing process. The cyclized in- 
tron is formed via nucleophilic attack of the 3’-OH on the phosphodiester bond that is 15 nucleotides from 
the 5'-GA end of the spliced-out intron. Cyclization frees a linear 15-mer with a 5'-GA end. 
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FIGURE 13.27 The peptidyl transferase reaction. 
Abstraction of an amide proton from the a-amino 
group of the aminoacyl-tRNA; (shown in red) by the 
2'-O of the terminal A of the peptidyl-tRNAo (blue) is 
aided by hydrogen-bonding interactions with neigh- 
boring 23S rRNA nucleotides (green). These interactions 
facilitate nucleophilic attack by the a-amino group of 
the aminoacyl-tRNA, on the carbonyl C of the peptidyl- 
tRNAp and peptide bond formation between the 
incoming amino acid and the growing peptide chain to 
give a one-residue-longer peptide chain attached to the 
tRNAa. (Adapted from Figure 3 in Beringer, M., and Rodnina, 
M.V, 2007.The ribosomal peptidyl transferase. Molecular Cell 
26:311-321,) 


(b) 


G2551 





FIGURE 13.26 (a) The 50S subunit from H. marismortui (pdb id = 1FFK). Ribosomal proteins are shown in blue, 
the 23S rRNA backbone in brown, the 5S rRNA backbone in olive, and a tRNA substrate analog in red. The tRNA 
analog identifies the peptidyl transferase catalytic center of the 50S subunit. (b) The aminoacyl-tRNA, (yellow) 
and the peptidyl-tRNAp (orange) in the peptidyl transferase active site. Bases of the 23S rRNA shown in green 
and labeled according to their position in the 23S rRNA sequence. Interactions between the tRNAs and the 
23S rRNA are indicated by dotted lines. The a-amino group of the aminoacyl-tRNA, (blue) is positioned for the 
attack on the carbonyl-C (green) peptidyl-tRNAp. (Adapted from Figure 2 in Beringer, M., and Rodnina, M.V., 2007. The ri- 
bosomal peptidyl transferase. Molecular Cell 26:31 1-321.) 


(the aminoacyl-tRNA,). Both the peptidyl chain and the amino acid are attached to 
their respective tRNAs via ester bonds to the O atom at the CCA-3' ends of these 
tRNAs (see Figure 11.33). Base-pairing between these C residues in the two tRNAs 
and G residues in the 23S rRNA position the substrates for the reaction to occur 
(Figure 13.26). The two Cs at the peptidyl-tRNAp CCA end pair with G2251 and 
G2252 of the 23S rRNA, and the last C (C75) at the 3’-end of the aminoacyl-tRNA, 
pairs with G2553. The 3'-terminal A of the aminoacyl-tRNA, interacts with G2583, 
and the terminal A of the peptidyl-tRNAp binds to A2450. Addition of the incoming 
amino acid to the peptidyl chain occurs when the a-amino group of the aminoacyl- 
tRNA, makes a nucleophilic attack on the carbonyl C linking the peptidyl chain to 
its t(RNAp. Specific 23S rRNA bases and ribose-OH groups facilitate this nucleophilic 
attack by favoring proton abstraction from the aminoacyl a-amino group (Figure 
13.27). The products of this reaction are a one-residue-longer peptidyl chain at- 
tached to the tRNA, and the “empty” tRNAp. 

The fact that RNA can catalyze such important reactions is experimental sup- 
port for the idea that a primordial world dominated by RNA molecules existed 
before the evolution of DNA and proteins. Sidney Altman and Thomas R. Cech 
shared the 1989 Nobel Prize in Chemistry for their discovery of the catalytic prop- 
erties of RNA. 
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Antibody Molecules Can Have Catalytic Activity 


Antibodies are immunoglobulins, which, of course, are proteins. Catalytic antibodies 
are antibodies with catalytic activity (catalytic antibodies are also called abzymes, a 
word created by combining “Ab,” the abbreviation for antibody, with “enzyme.”) 
Like other antibodies, catalytic antibodies are elicited in an organism in response 
to immunological challenge by a foreign molecule called an antigen (see Chapter 
28 for discussions on the molecular basis of immunology). In this case, however, 
the antigen is purposefully engineered to be an analog of the transition state in a re- 
action. The rationale is that a protein specific for binding the transition state of a 
reaction will promote entry of the normal reactant into the reactive, transition- 
state conformation. Thus, a catalytic antibody facilitates, or catalyzes, a reaction by 
forcing the conformation of its substrate in the direction of its transition state. (A 
prominent explanation for the remarkable catalytic power of conventional en- 
zymes is their great affinity for the transition state in the reactions they catalyze; see 
Chapter 14.) 

One proof of this principle has been to prepare ester analogs by substituting 
a phosphorus atom for the carbon in the ester group (Figure 13.28). The phos- 
phonate compound mimics the natural transition state of ester hydrolysis, and 
antibodies elicited against these analogs act like enzymes in accelerating 
the rate of ester hydrolysis as much as 1000-fold. Abzymes have been developed 
for a number of other classes of reactions, including C—C bond formation via al- 
dol condensation (the reverse of the aldolase reaction [see Figure 13.2, reaction 
4, and Chapter 18]) and the pyridoxal 5'-P-dependent aminotransferase reac- 
tion shown in Figure 13.23. This biotechnology offers the real possibility of cre- 
ating specially tailored enzymes designed to carry out specific catalytic processes. 

Catalytic antibodies apparently occur naturally. Autoimmune diseases are dis- 
eases that arise because an individual begins to produce antibodies against one of 
their own cellular constituents. Multiple sclerosis (MS), one such autoimmune dis- 
ease, is characterized by gradual destruction of the myelin sheath surrounding neu- 
rons throughout the brain and spinal cord. Blood serum obtained from some MS 
patients contains antibodies capable of carrying out the proteolytic destruction of 
myelin basic protein (MBP). That is, these antibodies were MBP-destructive pro- 
teases. Similarly, hemophilia A is a blood-clotting disorder due to lack of the factor 
VIII, an essential protein for formation of a blood clot. Serum from some sufferers 
of hemophilia A contained antibodies with proteolytic activity against factor VIII. 
Thus, some antibodies may be proteases. 
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FIGURE 13.28 (a) The intramolecular hydrolysis of a hydroxy ester to yield as products a 6-lactone and the 
alcohol phenol. Note the cyclic transition state. (b) The cyclic phosphonate ester analog of the cyclic transition 
state. 
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13285 Is It Possible to Design an Enzyme to Catalyze Any 
Desired Reaction? 


Enzymes have evolved to catalyze metabolic reactions with high selectivity, speci- 
ficity, and rate enhancements. Given these remarkable attributes, it would be very 
desirable to have the ability to create designer enzymes individually tailored to cat- 
alyze any imaginable reaction, particularly those that might have practical uses in 
industrial chemistry, the pharmaceutical industry, or environmental remediation. 
To this end, several approaches have been taken to create a desired enzyme de 
novo (de novo: literally “anew”; colloquially “from scratch.” In biochemistry, the syn- 
thesis of some end product from simpler precursors.) Most approaches begin with 
a known enzyme and then engineer it by using in vitro mutagenesis (see Chapter 
12) to replace active-site residues with a new set that might catalyze the desired re- 
action. This strategy has the advantage that the known protein structure provides 
a stable scaffold into which a new catalytic site can be introduced. As pointed out 
in Chapter 6, despite the extremely large number of possible amino acid se- 
quences for a polypeptide chain, a folded protein adopts one of a rather limited 
set of core protein structures. Yet proteins have an extraordinary range of func- 
tional possibilities. So, this approach is rational. A second, more difficult, approach 
attempts the completely new design of a protein with the desired structure and ac- 
tivity. Often, this approach relies on in silico methods, where the folded protein 
structure and the spatial and reactive properties of its putative active site are mod- 
eled, refined, and optimized via computer. Although this approach has fewer lim- 
itations in terms of size and shape of substrates, it brings other complications, such 
as protein folding and stability, to the problem, to say nothing of the difficulties of 
going from the computer model (in silico) to a real enzyme in a cellular environ- 
ment (in vivo). 

Enzymes have shown adaptability over the course of evolution. New enzyme func- 
tions have appeared time and time again, as mutation and selection according to 
Darwinian principles operate on existing enzymes. Some enzyme designers have 
coupled natural evolutionary processes with rational design using in vitro mutage- 
nesis. Expression of mutated versions of the gene encoding the enzyme in bacteria, 
followed by rounds of selection for bacteria producing an enzyme with even better 
catalytic properties, takes advantage of naturally occurring processes to drive fur- 
ther mutation and selection for an optimal enzyme. This dual approach is whimsi- 
cally referred to as semirational design because it relies on the rational substitution of 
certain amino acids with new ones in the active site, followed by directed evolution 
(selection for bacteria expressing more efficient versions of the enzyme). 

An example of active-site engineering is the site-directed mutation of an epox- 
ide hydrolase to change its range of substrate selection so that it now acts on chlo- 
rinated epoxides (Figure 13.29). Degradation of chlorinated epoxides is a major 
problem in the removal of toxic pollutants from water resources. Mutation of a 
bacterial epoxide hydrolase at three active-site residues (F!°, I?!, and C*48) and se- 
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FIGURE 13.29 cis-1,2-Dichloroethylene (DCE) is an industrial solvent that poses hazards to human health; DCE 
occurs as a pollutant in water sources. Bacterial metabolism of DCE to form cis-1,2-dichloroepoxyethane (step 
1) occurs readily, but enzymatic degradation of the epoxide to glyoxal and chloride ions (step 2) is limited. 
Microbial detoxification of DCE in ground water requires enzymes for both steps 1 and 2. Genetic engineering 
of an epoxide hydrolase to create an enzyme capable of using cis-1,2-dichloroepoxyethane as a substrate is a 
practical example of de novo enzyme design. 
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lection in bacteria for enhanced chlorinated epoxide hydrolase activity yielded an 
F108L, I219L, C248I mutant enzyme that catalyzed the conversion of cis-dichloroe- 
poxyethane to Cl” ions and glyoxal with a dramatically increased Vnax/ Kn ratio. 


SUMMARY 


Living systems use enzymes to accelerate and control the rates of vitally 
important biochemical reactions. Enzymes provide kinetic control over 
thermodynamic potentiality: Reactions occur in a timeframe suitable to 
the metabolic requirements of cells. Enzymes are the agents of meta- 
bolic function. 


13.1 What Characteristic Features Define Enzymes? Enzymes can be 
characterized in terms of three prominent features: catalytic power, speci- 
ficity, and regulation. The site on the enzyme where substrate binds and 
catalysis occurs is called the active site. Regulation of enzyme activity is es- 
sential to the integration and regulation of metabolism. 


13.2 Can the Rate of an Enzyme-Catalyzed Reaction Be Defined in a 
Mathematical Way? Enzyme kinetics can determine the maximum re- 
action velocity that the enzyme can attain, its binding affinities for sub- 
strates and inhibitors, and the mechanism by which it accomplishes its 
catalysis. The kinetics of simple chemical reactions provides a foundation 
for exploring enzyme kinetics. Enzymes, like other catalysts, act by lower- 
ing the free energy of activation for a reaction. 


13.3 What Equations Define the Kinetics of Enzyme-Catalyzed Reac- 
tions? A plot of the velocity of an enzyme-catalyzed reaction v versus the 
concentration of the substrate S is called a substrate saturation curve. The 
Michaelis-Menten equation is derived by assuming that E combines with 
S to form ES and then ES reacts to give E + P. Rapid, reversible combina- 
tion of E and S and ES breakdown to yield P reach a steady-state condition 
where [ES] is essentially constant. The Michaelis-Menten equation says 
that the initial rate of an enzyme reaction, v, is determined by two con- 
stants, K,, and Vmax and the initial concentration of substrate. The 
turnover number of an enzyme, kca is a measure of its maximal catalytic 
activity (the number of substrate molecules converted into product per en- 
zyme molecule per unit time when the enzyme is saturated with substrate). 
The ratio k,,,/K,, defines the catalytic efficiency of an enzyme. This ratio, 
Reat/ Km, cannot exceed the diffusion-controlled rate of combination of E 
and S to form ES. 

Several rearrangements of the Michaelis-Menten equation trans- 
form it into a straight-line equation, a better form for experimental de- 
termination of the constants K,, and Vmax and for detection of regula- 
tory properties of enzymes. 


13.4 What Can Be Learned from the Inhibition of Enzyme Activity? 
Inhibition studies on enzymes have contributed significantly to our 
understanding of enzymes. Inhibitors may interact either reversibly or 
irreversibly with an enzyme. Reversible inhibitors bind to the enzyme 
through noncovalent association/dissociation reactions. Irreversible 
inhibitors typically form stable, covalent bonds with the enzyme. Re- 
versible inhibitors may bind at the active site of the enzyme (competi- 
tive inhibition) or at some other site on the enzyme (noncompetitive 
inhibition). Uncompetitive inhibitors bind only to the ES complex. 


13.5 What Is the Kinetic Behavior of Enzymes Catalyzing Bimolecular 
Reactions? Usually, enzymes catalyze reactions in which two (or even 


more) substrates take part, so the reaction is bimolecular. Several pos- 
sibilities arise. In single-displacement reactions, both substrates, A and 
B, are bound before reaction occurs. In double-displacement (or ping- 
pong) reactions, one substrate (A) is bound and reaction occurs to 
yield product P and a modified enzyme form, E’. The second substrate 
(B) then binds to E’ and reaction occurs to yield product Q and E, the 
unmodified form of enzyme. Graphical methods can be used to distin- 
guish these possibilities. Exchange reactions are another way to diag- 
nose bisubstrate mechanisms. 


13.6 How Can Enzymes Be So Specific? Early enzyme specificity 
studies by Emil Fischer led to the hypothesis that an enzyme resembles 
a “lock” and its particular substrate the “key.” However, enzymes are 
not rigid templates like locks. Koshland noted that the conformation 
of an enzyme is dynamic and hypothesized that the interaction of E 
with S is also dynamic. The enzyme’s active site is actually modified 
upon binding S, in a process of dynamic recognition between enzyme 
and substrate called induced fit. Hexokinase provides a good illustra- 
tion of the relationship between substrate binding, induced fit, and 
catalysis. 


13.7 Are All Enzymes Proteins? Not all enzymes are proteins. Cat- 
alytic RNA molecules (“ribozymes”) play important cellular roles 
in RNA processing and protein synthesis, among other things. Catalytic 
RNAs give support to the idea that a primordial world dominated by 
RNA molecules existed before the evolution of DNA and proteins. 
Antibodies that have catalytic activity (“abzymes”) can be elicited in 
an organism in response to immunological challenge with an analog of 
the transition state for a reaction. Such antibodies are catalytic because 
they bind the transition state of a reaction and promote entry of the 
normal substrate into the reactive, transition-state conformation. 


13.8 Is It Possible to Design an Enzyme to Catalyze Any Desired 
Reaction? Several approaches have been taken to create designer 
enzymes individually tailored to catalyze any imaginable reaction. One 
rational approach is to begin with a known enzyme and then engineer 
it using in vitro mutagenesis to replace active-site residues with a new set 
that might catalyze the desired reaction. A second, more difficult ap- 
proach uses computer modeling to design a protein with the desired 
structure and activity. A third approach is to couple natural evolution- 
ary processes with rational design using in vitro mutagenesis. Expression 
of mutated versions of the gene encoding the enzyme in bacteria is fol- 
lowed by selection for bacteria producing an enzyme with even better 
catalytic properties. This dual approach is sometimes called semira- 
tional design, because it relies on the rational substitution of certain 
amino acids with new ones in the active site, followed by directed evo- 
lution. Active-site engineering and site-directed mutation have been 
used to modify an epoxide hydrolase so that it now acts on chlorinated 
epoxides, substances that are serious pollutants in water resources. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 
1. According to the Michaelis-Menten equation, what is the v/ Vmax 
ratio when [S] = 4 K,,? 


2. If Vinax = 100 wmol/mL : sec and K,, = 2 mM, what is the velocity of 
the reaction when [S] = 20 mM? 

3. For a Michaelis-Menten reaction, kj = 7 X 107/M~ sec, k- = 
1 X 10°/sec, and ky = 2 X 10*/sec. What are the values of Ks and 
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K„? Does substrate binding approach equilibrium, or does it be- 
have more like a steady-state system? 

. The following kinetic data were obtained for an enzyme in the ab- 
sence of any inhibitor (1), and in the presence of two different in- 
hibitors (2) and (3) at 5 mM concentration. Assume [E7] is the 
same in each experiment. 


(1) (2) (3) 
[S] v (umol/ v (umol/ v (umol/ 
(mM) mL : sec) mL : sec) mL : sec) 
1 12 4.3 5.5 
2 20 8 9 
4 29 14 13 
8 35 21 16 
12 40 26 18 


Graph these data as Lineweaver-Burk plots and use your graph to find 
answers to a. and b. 

a. Determine Vina, and K,, for the enzyme. 

b. Determine the type of inhibition and the K; for each inhibitor. 

. Using Figure 13.7 as a model, draw curves that would be obtained 
in vversus [S] plots when 

. twice as much enzyme is used. 

. half as much enzyme is used. 

. a competitive inhibitor is added. 

. a pure noncompetitive inhibitor is added. 

. an uncompetitive inhibitor is added. 

For each example, indicate how Vmax and K,, change. 


ohana ap 


. The general rate equation for an ordered, single-displacement reac- 
tion where A is the leading substrate is 

Vinax LA] [B] 
~ (KKP + K,A[B] + KPIA] + [A][B]) 





v 


Write the Lineweaver-Burk (double-reciprocal) equivalent of this 

equation and from it calculate algebraic expressions for the following: 

a. The slope 

b. The y-intercepts 

c. The horizontal and vertical coordinates of the point of intersec- 
tion when 1/v is plotted versus 1/[B] at various fixed concentra- 
tions of A 

. The following graphical patterns obtained from kinetic experiments 

have several possible interpretations depending on the nature of the 

experiment and the variables being plotted. Give at least two possi- 

bilities for each. 
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8. 


10. 


11. 


Liver alcohol dehydrogenase (ADH) is relatively nonspecific and will 
oxidize ethanol or other alcohols, including methanol. Methanol 
oxidation yields formaldehyde, which is quite toxic, causing, among 
other things, blindness. Mistaking it for the cheap wine he usually 
prefers, my dog Clancy ingested about 50 mL of windshield washer 
fluid (a solution 50% in methanol). Knowing that methanol would 
be excreted eventually by Clancy’s kidneys if its oxidation could be 
blocked, and realizing that, in terms of methanol oxidation by ADH, 
ethanol would act as a competitive inhibitor, I decided to offer 
Clancy some wine. How much of Clancy’s favorite vintage (12% 
ethanol) must he consume in order to lower the activity of his ADH 
on methanol to 5% of its normal value if the K,, values of canine 
ADH for ethanol and methanol are | millimolar and 10 millimolar, 
respectively? (The K; for ethanol in its role as competitive inhibitor 
of methanol oxidation by ADH is the same as its K,,.) Both the 
methanol and ethanol will quickly distribute throughout Clancy’s 
body fluids, which amount to about 15 L. Assume the densities of 
50% methanol and the wine are both 0.9 g/mL. 


. Measurement of the rate constants for a simple enzymatic reaction 


obeying Michaelis—Menten kinetics gave the following results: 

kı = 2 X 108 M7! sec! 

kı = 1 X 10° sec™! 

kə = 5 X 10° sec"! 

a. What is Ks, the dissociation constant for the enzyme-substrate 
complex? 

. What is Km, the Michaelis constant for this enzyme? 

. What is ke (the turnover number) for this enzyme? 

. What is the catalytic efficiency (kat/ Km) for this enzyme? 

. Does this enzyme approach “kinetic perfection”? (That is, does 
ke /K,, approach the diffusion-controlled rate of enzyme associa- 
tion with substrate?) 

f. Ifa kinetic measurement was made using 2 nanomoles of enzyme 
per mL and saturating amounts of substrate, what would Vmax 
equal? 

. Again, using 2 nanomoles of enzyme per mL of reaction mixture, 
what concentration of substrate would give v = 0.75 Vinax? 

h. Ifa kinetic measurement was made using 4 nanomoles of enzyme 

per mL and saturating amounts of substrate, what would Vmax 
equal? What would K,, equal under these conditions? 


conne 
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Triose phosphate isomerase catalyzes the conversion of glyceralde- 
hyde-3-phosphate to dihydroxyacetone phosphate. 





Glyceraldehyde-3-P == dihydroxyacetone-P 


The K,, of this enzyme for its substrate glyceraldehyde-3-phosphate 
is 1.8 X 107” M. When [glyceraldehydes-3-phosphate] = 30 uM, the 
rate of the reaction, v, was 82.5 umol mL! sec™!. 

a. What is Vinax for this enzyme? 

b. Assuming 3 nanomoles per mL of enzyme was used in this 
experiment ([E, a1] = 3 nanomol/mL), what is kea for this 
enzyme? 

c. What is the catalytic efficiency (ka/K,,) for triose phosphate 
isomerase? 

d. Does the value of ka/Km reveal whether triose phosphate iso- 
merase approaches “catalytic perfection”? 

e. What determines the ultimate speed limit of an enzyme-catalyzed 
reaction? That is, what is it that imposes the physical limit on 
kinetic perfection? 

The citric acid cycle enzyme fumarase catalyzes the conversion of 

fumarate to form malate. 





Fumarate + HO === malate 


The turnover number, keas for fumarase is 800/sec. The K„ of 

fumarase for its substrate fumarate is 5 wM. 

a. In an experiment using 2 nanomole/L of fumarase, what is 
Vmax? 

b. The cellular concentration of fumarate is 47.5 uM. What is vwhen 
[fumarate] = 47.5 uM? 

c. What is the catalytic efficiency of fumarase? 

d. Does fumarase approach “catalytic perfection”? 


12. Carbonic anhydrase catalyzes the hydration of CO»: 
CO, + HO == HCO; 


The K,, of carbonic anhydrase for CO, is 12 mM. Carbonic anhy- 
drase gave an initial velocity v = 4.5 umol HCO; formed/mL : sec 
when [CO.] = 36 mM. 
a. What is Vinax for this enzyme? 
b. Assuming 5 pmol/mL (5 X 107"? moles/mL) of enzyme were 
used in this experiment, what is k.,, for this enzyme? 

c. What is the catalytic efficiency of this enzyme? 
d. Does carbonic anhydrase approach “catalytic perfection”? 

13. Acetylcholinesterase catalyzes the hydrolysis of the neurotransmit- 
ter acetylcholine: 


Acetylcholine + H:O—> acetate + choline 


The K,, of acetylcholinesterase for its substrate acetylcholine is 
9 xX 10™ M. In a reaction mixture containing 5 nanomoles/mL of 
acetylcholinesterase and 150 uM acetylcholine, a velocity v = 
40 umol/mL - sec was observed for the acetylcholinesterase reaction. 
a. Calculate Vmax for this amount of enzyme. 
b. Calculate ke for acetylcholinesterase. 
c. Calculate the catalytic efficiency (Kea/K») for acetylcholinesterase. 
d. Does acetylcholinesterase approach “catalytic perfection”? 

14. The enzyme catalase catalyzes the decomposition of hydrogen per- 
oxide: 

2 HO; = —2 HO + Og 

The turnover number (kea) for catalase is 40,000,000 sec”!. The K,, 
of catalase for its substrate HO» is 0.11 M. 
a. In an experiment using 3 nanomole/L of catalase, what is Vmax? 
b. What is v when [H09] = 0.75 M? 
c. What is the catalytic efficiency of fumarase? 
d. Does catalase approach “catalytic perfection”? 
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15. Equation 13.9 presents the simple Michaelis-Menten situation where 
the reaction is considered to be irreversible ([P] is negligible). Many 
enzymatic reactions are reversible, and P does accumulate. 

a. Derive an equation for v, the rate of the enzyme-catalyzed reaction 
S—P in terms of a modified Michaelis-Menten model that incor- 
porates the reverse reaction that will occur in the presence of 
product, P. 

b. Solve this modified Michaelis-Menten equation for the special sit- 
uation when v = 0 (that is, S=—P is at equilibrium, or in other 
words, Ke4 = [P]/[S]). 

(J. B. S. Haldane first described this reversible Michaelis-Menten 

modification, and his expression for Keq in terms of the modified 

M-M equation is known as the Haldane relationship.) 


Preparing for the MCAT Exam 
16. Enzyme A follows simple Michaelis-Menten kinetics. 

a. The K,,of enzyme A for its substrate S is K,°=1 mM. Enzyme A 
also acts on substrate T and its K,'=10 mM. Is S or T the preferred 
substrate for enzyme A? 

b. The rate constant ky with substrate S is 2 X 10* sec™!; with sub- 
strate T, ko = 4 X 10°sec™!. Does enzyme A use substrate S or sub- 
strate T with greater catalytic efficiency? 

17. Use Figure 13.12 to answer the following questions. 

a. Is the enzyme whose temperature versus activity profile is shown 
in Figure 13.12 likely to be from an animal or a plant? Why? 

b. What do you think the temperature versus activity profile for an 
enzyme from a thermophilic prokaryote growing in an 80°F pool 
of water would resemble? 
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Mechanisms of Enzyme 
Action 





ESSENTIAL QUESTION 


Although the catalytic properties of enzymes may seem almost magical, it is simply 
chemistry—the breaking and making of bonds—that gives enzymes their prowess. 
This chapter will explore the unique features of this chemistry. The mechanisms of 
thousands of enzymes have been studied in at least some detail. In this chapter, it will 
be possible to examine only a few of these. 


What are the universal chemical principles that influence the mechanisms of 
enzymes and allow us to understand their enormous catalytic power? 


14l What Are the Magnitudes of Enzyme-Induced 
Rate Accelerations? 


Enzymes are powerful catalysts. Enzyme-catalyzed reactions are typically 107 to 10! 
times faster than their uncatalyzed counterparts (Table 14.1). The most impressive 
reaction acceleration known is that of fructose-1,6-bisphosphatase, an enzyme found 
in liver and kidneys that is involved in the synthesis of glucose (see Chapter 22). 

These large rate accelerations correspond to substantial changes in the free energy 
of activation for the reaction in question. The urease reaction, for example, 


| 
H,sN—C—NH, + 2 HO + H* — > 2NH,* + HCO,” 


shows an energy of activation 84 kJ/mol smaller than that for the corresponding un- 
catalyzed reaction. To fully understand any enzyme reaction, it is important to ac- 
count for the rate acceleration in terms of the structure of the enzyme and its mech- 
anism of action. 

In all chemical reactions, the reacting atoms or molecules pass through a state 
that is intermediate in structure between the reactant(s) and the product(s). Con- 
sider the transfer of a proton from a water molecule to a chloride anion: 


= H—O®* ..-H---cl> = HO +H—-Cl 
Products 


H—O—H +C 


Reactants Transition state 


In the middle structure, the proton undergoing transfer is shared equally by the hy- 
droxyl and chloride anions. This structure represents, as nearly as possible, the tran- 
sition between the reactants and products, and it is known as the transition state.! 
All transition states contain at least one partially formed bond. 

Linus Pauling was the first to suggest (in 1946) that the active sites of enzymes bind 
the transition state more readily than the substrate and that, by doing so, they stabi- 
lize the transition state and lower the activation energy of the reaction. Many subse- 
quent studies have shown that this idea is essentially correct, but it is just the begin- 
ning in understanding what enzymes do. Reaction rates can also be accelerated by 
destabilizing (raising the energy of) the enzyme-substrate complex. Chemical groups 
arrayed across the active site actually guide the entering substrate toward the forma- 
tion of the transition state. Thus, the enzyme active site is said to be “preorganized.” 


lIt is important to distinguish transition states from intermediates. A transition state is envisioned as 
an extreme distortion of a bond, and thus the lifetime of a typical transition state is viewed as being on 
the order of the lifetime of a bond vibration, typically 107! sec. Intermediates, on the other hand, are 
longer lived, with lifetimes in the range of 107!° to 107° sec. 


David W. Grisham 





Like the workings of machines, the details of enzyme 
mechanisms are at once complex and simple. 





No single thing abides but all things flow. 

Fragment to fragment clings and thus they grow 

Until we know them by name. 

Then by degrees they change and are no more the 
things we know. 


Lucretius (ca. 94 B.C.—50 B.C.) 
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TABLE 14.1 











Uncatalyzed Catalyzed 
Rate, vu Rate, ve 
Reaction Enzyme (sec”') (sec”') Vel Vy 
Fructose-1,6-bisP —— fructose-6-P + P; Fructose-1,6-bisphosphatase 2x 107% 21 1.05 x 10?! 
(Glucose) , + HO —> (glucose) „-2 + maltose B-amylase 1.9 x 107" 1.4 x 108 7.2 X 10!7 
DNA, RNA cleavage Staphylococcal nuclease TX 10716 95 1.4 x 10!” 
CH,—O—PO,” + HO —> CH;OH + HPO,” Alkaline phosphatase 1 x 107” 14 1.4 x 1016 
HN I NH, + 2H,O + Ht —> 2 NH,” + HCO,” Urease 3 x 10-19 3 x 10 1x10 
O 
RC- O— CHCH; + HO —> RCOOH + HOCH,CH; Chymotrypsin 1 X 107” 1 x 10° 1x10" 
Glucose + ATP — > Glucose-6-P + ADP Hexokinase <1 x 10-8 1.3 x 1073 >1.3 X 10! 
O 
CHCH OH + NAD* —> cu, du + NADH + H* Alcohol dehydrogenase <6x10°% 27X10 >4.5 x 10° 
CO, + HO —> HCO,- + H* Carbonic anhydrase 10-2 105 1 x 107 
Creatine + ATP —> Cr-P + ADP Creatine kinase <3 x 107° 4x107% >1.33 x 104 











Adapted from Koshland, D., 1956. Molecular geometry in enzyme action. Journal of Cellular Comparative Physiology, Supp. 1, 47:217; and Wolfenden, R., 2006. Degrees of difficulty of water- 
consuming reactions in the absence of enzymes. Chemical Reviews 106:3379-3396. 


Enzymes are dynamic, and fluctuations in the active-site structure are presumed to or- 
ganize the initial enzyme-substrate complex into a reactive conformation and on to 
the transition state. Along the way, electrostatic and hydrophobic interactions be- 
tween the enzyme and the substrate mediate and direct these changes that make the 
reaction possible. Often, catalytic groups provided by the enzyme participate directly 
in proton transfers and other bond-making and bond-breaking events. 

This chapter describes and elaborates on each of these contributions to the cat- 
alytic prowess of enzymes and then illustrates the lessons learned by looking closely 
at the mechanisms of three well-understood enzymes. 


14:25 What Role Does Transition-State Stabilization Play 
in Enzyme Catalysis? 


Chemical reactions in which a substrate (S) is converted to a product (P) can be pic- 
tured as involving a transition state (which we henceforth denote as Xt), a species in- 
termediate in structure between S and P (Figure 14.1). As seen in Chapter 13, the 


(b) 


Free energy, G 


FIGURE 14.1 Enzymes catalyze reactions 
by lowering the activation energy. Here 
the free energy of activation for (a) the 
uncatalyzed reaction, AG,*, is larger than 
that for (b) the enzyme-catalyzed 

reaction, AG. S ——> RK mH E+S <> ES = Ext = E+P 





Reaction coordinate ——> 
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catalytic role of an enzyme is to reduce the energy barrier between substrate and 
transition state. This is accomplished through the formation of an enzyme-substrate 
complex (ES). This complex is converted to product by passing through a transition 
state, EX? (Figure 14.1). As shown, the energy of EX? is clearly lower than that of Xt. 
One might be tempted to conclude that this decrease in energy explains the rate en- 
hancement achieved by the enzyme, but there is more to the story. 

The energy barrier for the uncatalyzed reaction (Figure 14.1) is of course the 
difference in energies of the S and X? states. Similarly, the energy barrier to be sur- 
mounted in the enzyme-catalyzed reaction, assuming that E is saturated with S, is the 
energy difference between ES and EX?. Reaction rate acceleration by an enzyme means sim- 
ply that the energy barrier between ES and EX? is less than the energy barrier between S and X?. 
In terms of the free energies of activation, A G4 < AG,}. 

There are important consequences for this statement. The enzyme must stabilize 
the transition-state complex, EX!, more than it stabilizes the substrate complex, ES. 
Put another way, enzymes bind the transition-state structure more tightly than the 
substrate (or the product). The dissociation constant for the enzyme-substrate 
complex is 


[E][S] 
Ks = — 14.1 
s [ES] (14.1) 
and the corresponding dissociation constant for the transition-state complex is 
ae [E][X*] T 


Enzyme catalysis requires that Kr < Ks. According to transition-state theory (see refer- 
ences at the end of this chapter), the rate constants for the enzyme-catalyzed (ke) 
and uncatalyzed (kų) reactions can be related to Ks and Kr by 


ke/ ku = Ks/Kr (14.3) 


Thus, the enzymatic rate enhancement is approximately equal to the ratio of the 
dissociation constants of the enzyme-substrate and enzyme-transition-state com- 
plexes, at least when E is saturated with S. 


14:35 How Does Destabilization of ES Affect Enzyme 
Catalysis? 


How is it that X? is stabilized more than S at the enzyme active site? To understand 
this, we must dissect and analyze the formation of the enzyme-substrate complex, 
ES. There are a number of important contributions to the free energy difference 
between the uncomplexed enzyme and substrate (E + S) and the ES complex 
(Figure 14.2). The favorable interactions between the substrate and amino acid 
residues on the enzyme account for the intrinsic binding energy, A G. The intrinsic 
binding energy ensures the favorable formation of the ES complex, but if uncom- 
pensated, it makes the activation energy for the enzyme-catalyzed reaction unnec- 
essarily large and wastes some of the catalytic power of the enzyme. 

Compare the two cases in Figure 14.3. Because the enzymatic reaction rate is de- 
termined by the difference in energies between ES and EX}, the smaller this differ- 
ence, the faster the enzyme-catalyzed reaction. Tight binding of the substrate deep- 
ens the energy well of the ES complex and actually lowers the rate of the reaction. 

The message of Figure 14.3 is that raising the energy of ES will increase the 
enzyme-catalyzed reaction rate. This is accomplished in two ways: (1) loss of 
entropy due to the binding of S to E and (2) destabilization of ES by strain, distor- 
tion, desolvation, or other similar effects. The entropy loss arises from the formation 
of the ES complex (Figure 14.4), a highly organized (low-entropy) entity compared to 
E + Sin solution (a disordered, high-entropy situation). Because A Sis negative for this 
process, the term — TA Sis a positive quantity, and the intrinsic binding energy of ES is com- 
pensated to some extent by the entropy loss that attends the formation of the complex. 





Reaction coordinate 


FIGURE 14.2 The intrinsic binding energy of the 
enzyme-substrate (ES) complex (AG,) is compensated 
to some extent by entropy loss due to the binding of 
E and S (TAS) and by destabilization of ES (AGa) by 
strain, distortion, desolvation, and similar effects. If AG, 
were not compensated by TAS and AGa, the formation 
of ES would follow the dashed line. 
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FIGURE 14.3 (a) Catalysis does not occur if the ES com- 
plex and the transition state for the reaction are stabi- 
lized to equal extents. (b) Catalysis will occur if the tran- 
sition state is stabilized to a greater extent than the ES 
complex (right). Entropy loss and destabilization of the 
ES complex AGa ensure that this will be the case. 


CENGAGENOW" ACTIVE FIGURE 14.4 (a) Formation of 
the ES complex results in entropy loss. Before binding, 
E and S are free to undergo translational and rotational 
motion. The ES complex is a more highly ordered, low- 
entropy complex. (b) Substrates typically lose waters of 
hydration in the formation of the ES complex. Desolva- 
tion raises the energy of the ES complex, making it 
more reactive. (c) Electrostatic destabilization of a sub- 
strate may arise from juxtaposition of like charges in the 
active site. If such charge repulsion is relieved in the 
course of the reaction, electrostatic destabilization can 
result in a rate increase. Test yourself on the concepts 
in this figure at www.cengage.com/login. 


(b) 





No destabilization, Destabilization of ES 
thus no catalysis facilitates catalysis 


Destabilization of the ES complex can involve structural strain, desolvation, or 
electrostatic effects. Destabilization by strain or distortion is usually just a conse- 
quence of the fact (noted previously) that the enzyme is designed to bind the transition 
state more strongly than the substrate. When the substrate binds, the imperfect nature 
of the “fit” results in distortion or strain in the substrate, the enzyme, or both. 


(a) 






Substrate (and enzyme) are free The highly ordered, low-entropy complex 


to undergo translational motion. 
A disordered, high-entropy situation 


(b) 


er 


Solvation shell 


(c) 


Substrate 





Electrostatic destabilization 
in ES complex 
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Solvation of charged groups on a substrate in solution releases energy, making 
the charged substrate more stable. When a substrate with charged groups moves 
from water into an enzyme active site (Figure 14.4), the charged groups are often 
desolvated to some extent, becoming less stable and therefore more reactive. 

Similarly, when a substrate enters the active site, charged groups may be forced 
to interact (unfavorably) with charges of like sign, resulting in electrostatic 
destabilization (Figure 14.4). The reaction pathway acts in part to remove this 
stress. If the charge on the substrate is diminished or lost in the course of 
reaction, electrostatic destabilization can result in rate acceleration. 

Whether by strain, desolvation, or electrostatic effects, destabilization raises the 
energy of the ES complex, and this increase is summed in the term A Ga, the free 
energy of destabilization (Figures 14.2 and 14.3). 


14:4: How Tightly Do Transition-State Analogs Bind 
to the Active Site? 


Although not apparent at first, there are other important implications of Equation 
14.3. It is important to consider the magnitudes of Ks and Ky. The ratio ke/ku 
may even exceed 10!°, as noted previously. Given a ratio of 10!° and a typical Ks of 
10~* M, the value of Ky should be 107° M. The value of Ky for fructose-1,6-bisphos- 
phatase (see Table 14.1) is an astounding 7 X 10~°° M! This is the dissociation con- 
stant for the transition state from the enzyme, and this very low value corresponds to 
very tight binding of the transition state by the enzyme. 

It is unlikely that such tight binding in an enzyme transition state will ever be de- 
termined in a direct equilibrium measurement, however, because the transition 
state itself is a “moving target.” It exists only for about 107! to 107} sec, less than 
the time required for a bond vibration. On the other hand, the nature of the elu- 
sive transition state can be explored using transition-state analogs, stable molecules 
that are chemically and structurally similar to the transition state. Such molecules 
should bind more strongly than a substrate and more strongly than competitive in- 
hibitors that bear no significant similarity to the transition state. Hundreds of ex- 
amples of such behavior have been reported. For example, Robert Abeles studied a 
series of inhibitors of proline racemase (Figure 14.5) and found that pyrrole-2- 
carboxylate bound to the enzyme 160 times more tightly than L-proline, the normal 
substrate. This analog binds so tightly because it is planar and is similar in structure 
to the planar transition state for the racemization of proline. Two other examples 


Proline racemase reaction 


Ht Ht 
ay hes 
ii | 1 
H H H 
t-Proline Planar transition p-Proline 


state 


N 
| | 
H H 
Pyrrole-2-carboxylate A-1-Pyrroline-2-carboxylate 


FIGURE 14.5 The proline racemase reaction. Pyrrole-2-carboxylate and A-1-pyrroline-2-carboxylate mimic the 
planar transition state of the reaction. 
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A DEEPER LOOK 
Transition-State Analogs Make Our World Better 


Enzymes (human, plant, and bacterial) are often targets for drugs peutic agents that improve the lives of millions and millions of 
and other beneficial agents. Transition-state analogs (TSAs), with people. A few applications of transition-state analogs for human 
very high affinities for their enzyme-binding sites, often make health and for agriculture are shown here. 

ideal enzyme inhibitors, and TSAs have become ubiquitous thera- 


Enalapril and Aliskiren Lower Blood Pressure H O 
High blood pressure is a significant risk factor CH,CH,OOC Nn JC 
for cardiovascular disease, and drugs that lower H N 
blood pressure reduce the risk of heart attacks, CH, 
heart failure, strokes, and kidney disease. Blood 

pressure is partly regulated by aldosterone, a HOOC 


steroid synthesized in the adrenal cortex and re- 
leased in response to angiotensin II, a peptide 
produced from angiotensinogen in two prote- 
olytic steps by renin (an aspartic protease) and 
angiotensin-converting enzyme (ACE). Vasotec Enalapril 
(enalapril) manufactured by Merck and Biovail 
is an ACE inhibitor. Novartis and Speedel have 
developed Tekturna (aliskiren) as a renin in- 
hibitor. Both are TSAs. 





Courtesy of James Gathany/CDC 





Aliskiren 


Statins Lower Serum Cholesterol 


Statins such as Lipitor are powerful cholesterol- 
lowering drugs, because they are transition-state 
analog inhibitors of HMG-CoA reductase, a key 
enzyme in the biosynthetic pathway for choles- 
terol (discussed in Chapter 24). 





© AP Photo/Amy Sancetta, File 


Atorvastatin (Lipitor) 








Courtesy of the Otis Historical Archives/National Museum of 


Health and Medicine 


Protease Inhibitors Are AIDS Drugs 

Crixivan (indinavir) by Merck, Invirase (saquinavir) by Roche, and 
similar “protease inhibitor” drugs are transition-state analogs for 
the HIV-1 protease, discussed on pages 440-443. 


XQ u o o NH 
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ba OH 
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Saquinavir 


Tamiflu Is a Viral Neuraminidase Inhibitor 

Influenza is a serious respiratory illness that affects 5% to 15% of 
the earth’s population each year and results in 250,000 to 500,000 
deaths annually, mostly among children and the elderly. Protec- 
tion from influenza by vaccines is limited by the antigenic varia- 
tion of the influenza virus. Neuraminidase is a major glycoprotein 
on the influenza virus membrane envelope that is essential for vi- 
ral replication and infectivity. Tamiflu is a neuraminidase inhibitor 
and antiviral agent based on the transition state of the neura- 
minidase reaction. 
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Tamiflu 





A The 1918 flu pandemic killed more than 20 million people worldwide. 
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Juvenile Hormone Esterase Is a Pesticide Target 


Insects have significant effects on human health, being the trans- 
mitting agents (vectors) for diseases such as malaria, West Nile 
virus, and viral encephalitis, all carried by mosquitoes, and Lyme 
disease and Rocky Mountain spotted fever, carried by ticks. One 
strategy for controlling insect populations is to alter the actions of 
juvenile hormone, a terpene-based substance that regulates insect 
life cycle processes. Levels of juvenile hormone are controlled by 
juvenile hormone esterase (JHE), and inhibition of JHE is toxic to 
insects. OTFP (figure) is a potent transition state analog inhibitor 
of JHE. 


3-Octylthio-1,1,1-trifluoropropan-2-one (OTFP) 


© Darwin Dale/Photo Researchers, Inc. 








Courtesy of James Gathany/CDC 


How Many Other Drug Targets Might There Be? 


If the human genome contains approximately 20,000 genes, how 
many of these might be targets for drug therapy? Andrew Hopkins 
has proposed the term “druggable genome” to conceptualize the 
subset of human genes that might express proteins able to bind 
druglike molecules. The DrugBank database (http://redpoll 
-_pharmacy.ualberta.ca/drugbank) contains more than a thousand 
FDA-approved small molecule drugs. More than 300 of these are 
directed specifically to enzymes. More than 3000 experimental 
drugs are presently under study and testing. It is easy to imagine 
that thousands more drugs will eventually be developed, with 
many of these designed as transition-state analogs for enzyme 
reactions. 
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(a) Yeast aldolase reaction 
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Phosphoglycolohydroxamate 


(b) Calf intestinal adenosine deaminase reaction 
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FIGURE 14.6 (a) Phosphoglycolohydroxamate is an 
analog of the enediolate transition state of the yeast 
aldolase reaction. (b) Purine riboside, a potent inhibitor 
of the calf intestinal adenosine deaminase reaction, 
binds to adenosine deaminase as the 1,6-hydrate. The 





Hydrated form of 


hydrated form of purine riboside is an analog of the purine ribonucleoside 
proposed transition state for the reaction. CE 





of transition-state analogs are shown in Figure 14.6. Phosphoglycolohydroxamate binds 
40,000 times more tightly to yeast aldolase than the substrate dihydroxyacetone 
phosphate. Even more remarkable, the 1,6-hydrate of purine ribonucleoside has been 
estimated to bind to adenosine deaminase with a Kı of 3 X 107 M! 

It should be noted that transition-state analogs are only approximations of the 
transition state itself and will never bind as tightly as would be expected for the true 
transition state. These analogs are, after all, stable molecules and cannot be ex- 
pected to resemble a true transition state too closely. 


14:5» What Are the Mechanisms of Catalysis? 


Enzymes Facilitate Formation of Near-Attack Conformations 


Exquisite and beautiful details of enzyme active-site structure and dynamics have 
emerged from X-ray crystal structures of enzymes and computer simulations of 
molecular conformation and motion at the active site. Importantly, these studies have 
shown that the reacting atoms and catalytic groups are precisely positioned for their 
roles. This “preorganization” of the active site allows it to select and stabilize confor- 
mations of the substrate(s) in which the reacting atoms are in van der Waals contact and 
at an angle resembling the bond to be formed in the transition state. Thomas Bruice has 





A DEEPER LOOK 
How to Read and Write Mechanisms 


The custom among chemists and biochemists of writing chemical 
reaction mechanisms with electron dots and curved arrows began 
with two of the greatest chemists of the 20th century. Gilbert New- 
ton Lewis was the first to suggest that a covalent bond consists of a 
shared pair of electrons, and Sir Robert Robinson was the first to 
use curved arrows to illustrate a mechanism in a paper in the Jour- 
nal of the Chemical Society in 1922. 

Learning to read and write reaction mechanisms should begin 
with a review of Lewis dot structures in any good introductory 
chemistry text. It is also important to understand valence electrons 
and “formal charge.” The formal charge of an atom is calculated 
as the number of valence electrons minus the “electrons owned” 
by an atom. More properly 


Formal charge = group number — 
nonbonding electrons — (1/2 shared electrons) 


Students of mechanisms should also appreciate electronegativity— 
the tendency of an atom to attract electrons. Electronegativities of 
the atoms important in biochemistry go in the order 


F>O>N>COHH 


Thus, in a C-N bond, the N should be viewed as more electron- 
rich and C as electron-deficient. An electron-rich atom is termed 
nucleophilic and will have a tendency to react with an electron- 
deficient (electrophilic) atom. 

In written mechanisms, a curved arrow shows the movement of 
an electron pair from its original position to a new one. The tail of 
the arrow shows where the electron pair comes from, and the head 
of the arrow shows where the electron pair is going. Thus, the 
arrow represents the actual movement of a pair of electrons from 
a filled orbital into an empty one. By convention, an arrow with a 
full arrowhead ~~ represents movement of an electron pair, 
whereas a half arrowhead ~N represents a single electron (for ex- 
ample, in a free radical reaction). For a bond-breaking event, the 
arrow begins in the middle of the bond, and the arrowhead points 
at the atom that will accept the electrons: 


ALB 


—> A+B 

For a bond-making event, the arrow begins at the source of the 
electrons (for example, a nonbonded pair), and the arrowhead 
points to the atom where the new bond will be formed: 


At pe —> A—B 


It has been estimated that 75% of the steps in enzyme reaction 
mechanisms are proton (H+) transfers. If the proton is donated or 
accepted by a group on the enzyme, it is often convenient (and 
traditional) to represent that group as B, for “base,” even if B is 
protonated and behaving as an acid: 


BOND —> B+ HIND 
~ ~ 
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It is important to appreciate that a proton transfer can change a 
nucleophile into an electrophile, and vice versa. Thus, it is neces- 
sary to consider (1) the protonation states of substrate and 
active-site residues and (2) how pK, values can change in the en- 
vironment of the active site. For example, an active-site histidine, 
which might normally be protonated, can be deprotonated by an- 
other group and then act as a base, accepting a proton from the 
substrate: 


| a HtB— 
HN Non — > HN N: 
Zot \Z ` 
(3 
H—O0, 
CHo 
| Ser 
H*B— 
HN, ge 
TO 
“GH; 
Ser 


Water can often act as an acid or base at the active site through 
proton transfer with an assisting active-site residue: 


5 i m ia 


| 
—> HN NE + ne Ok 
OH 


These concepts provide a sense of what is reasonable and what 
makes good chemical sense in a reaction. Practice and experience 
are essential to building skills for reading and writing enzyme 
mechanisms. Excellent Web sites are available where such skills 
can be built (http://www.abdn.ac.uk/curly-arrows). 





termed such arrangements near-attack conformations (NACs), and he has proposed 
that NACs are the precursors to transition states of reactions (Figure 14.7). In the ab- 
sence of an enzyme, potential reactant molecules adopt a NAC only about 0.0001% 
of the time. On the other hand, NACs have been shown to form in enzyme active sites 
from 1% to 70% of the time. 
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FIGURE 14.7 (a) For reactions involving bonding be- 
tween O,N,C,and S atoms, NACs are characterized as 
having reacting atoms within 3.2 A and an approach 
angle of +15° of the bonding angle in the transition 
state. (b) In an enzyme active site, the enzyme- 
substrate complex and the NAC are separated by a 
small energy barrier, and NACs form readily. In the ab- 
sence of the enzyme, the energy gap between the sub- 
strate and the NAC is much greater and NACs are rarely 
formed. The energy separation between the NAC and 
the transition state is approximately the same in the 
presence and absence of the enzyme. (Adapted from 
Bruice, T., 2002. A view at the millennium: The efficiency 
of enzymatic catalysis. Accounts of Chemical Research 
35:139-148.) 





FIGURE 14.8 The complex of horse liver ADH with ben- 
zyl alcohol illustrates the approach to a near-attack con- 
formation. Computer simulations by Bruice and co-work- 
ers show that the side-chain oxygen of Ser*® approaches 
within 1.8A of the hydroxyl hydrogen of the substrate, 
benzyl alcohol, and that the pro-R hydrogen of benzyl 
alcohol lies 2.75 A from the C-4 carbon of the nicoti- 
namide ring. The reaction mechanism involves hydroxyl 
proton abstraction by Ser*® and hydride transfer from 
the substrate to C-4 of the NAD* nicotinamide ring 

(pdb id = 1HLD). 


(b) 





Free energy, G 





Reaction coordinate 


The alcohol dehydrogenase (ADH) reaction provides a good example of a NAC 
on the pathway to the reaction transition state (Figure 14.8). The ADH reaction con- 
verts a primary alcohol to an aldehyde (through an ordered, single-displacement 
mechanism, see page 406). The reaction proceeds by a proton transfer to water fol- 
lowed by a hydride transfer to NAD*. In the enzyme active site, Ser’ accepts the pro- 
ton from the alcohol substrate, the resulting negative charge is stabilized by a zinc 
ion, and the substrate pro-R hydrogen is poised above the NAD* ring prior to hydride 
transfer (Figure 14.8). Computer simulations of the enzyme-substrate complex in- 
volving the deprotonated alcohol show that this intermediate exists as a NAC 60% of 
the time. The kinetic advantage of such an enzymatic reaction, compared to its 
nonezymatic counterpart, is the ease of formation of the NAC and the favorable free energy 
difference between the NAC and the transition state (Figure 14.7). 


Protein Motions Are Essential to Enzyme Catalysis Proteins are constantly moving. 
As noted in Chapter 6 (Table 6.2), bonds vibrate, side chains bend and rotate, back- 
bone loops wiggle and sway, and whole domains move with respect to each other. En- 
zymes depend on such motions to provoke and direct catalytic events. Protein mo- 
tions may support catalysis in several ways: Active site conformation changes can 


assist substrate binding 

bring catalytic groups into position around a substrate 
induce formation of a NAC 

assist in bond making and bond breaking 

facilitate conversion of substrate to product 
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(a) @ 


FIGURE 14.9 (a) Human cyclophilin A is a prolyl 
isomerase, which catalyzes the interconversion be- 
tween trans and cis conformations of proline in pep- 
tides. (b) The active site of cyclophilin with a bound 
peptide containing proline in cis and trans conforma- 
tions (pdb id = 1RMH). 





A good example of protein motions facilitating catalysis is human cyclophilin A, 
which catalyzes the interconversion between cis and trans conformations of proline 
in peptides (Figure 14.9). NMR studies of cyclophilin A have provided direct mea- 
surements of the active-site motions occurring in this enzyme. Certain active-site 
residues (Lys**, Leu, Ser®, Ala!®!, GiIn!, Ala!, and Gly!) of the enzyme un- 
dergo conformation changes during substrate binding, whereas Arg®? is involved di- 
rectly in the cis-to-trans interconversion itself (Figure 14.10). 

The protein motions that assist catalysis may be quite complex. Stephen 
Benkovic and Sharon Hammes-Schiffer have characterized an extensive network of 
coupled protein motions in dihydrofolate reductase. This network extends from the 
active site to the surface of the protein, and the motions in this network span time 
scales of femtoseconds (107! sec) to milliseconds. Such extensive networks of mo- 
tion make it likely that the entire folded structure of the protein may be involved in 
catalysis at the active site. 


45 


35 


FIGURE 14.10 Catalysis in enzyme active sites depends 
on motion of active-site residues. NMR studies by 
Dorothee Kern and her co-workers show that several 
cyclophilin active-site residues, including Arg® (red dot) 


15 | and Lys®, Leu’, Ser??, Ala!®!, Gln! Ala!, and Gly! 
4 


25 


Relative motion 


(green dots), undergo greater motion during catalysis 
than residues elsewhere in the protein. (Adapted from 


J} ] -]- Eisenmesser, E., et al., 2002. Enzyme dynamics during 
0 60 80 100 120 catalysis. Science 295: 1520-1523.) 


Residue 
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FIGURE 14.11 Examples of covalent bond formation 
between enzyme and substrate. In each case, a nucleo- 
philic center (X:) on an enzyme attacks an electrophilic 
center on a substrate. 


Covalent Catalysis 


Some enzyme reactions derive much of their rate acceleration from the formation of 
covalent bonds between enzyme and substrate. Consider the reaction: 


BX + Y—>BY+ X 


and an enzymatic version of this reaction involving formation of a covalent 
intermediate: 


BX + Enz—>E:B + X + Y—>Enz + BY 


If the enzyme-catalyzed reaction is to be faster than the uncatalyzed case, the ac- 
ceptor group on the enzyme must be a better attacking group than Y and a better 
leaving group than X. Note that most enzymes that carry out covalent catalysis have 
ping-pong kinetic mechanisms. 

The side chains of amino acids in proteins offer a variety of nucleophilic centers 
for catalysis, including amines, carboxylates, aryl and alkyl hydroxyls, imidazoles, 
and thiol groups. These groups readily attack electrophilic centers of substrates, 
forming covalently bonded enzyme-substrate intermediates. Typical electrophilic 
centers in substrates include phosphoryl groups, acyl groups, and glycosyl groups 
(Figure 14.11). The covalent intermediates thus formed can be attacked in a sub- 
sequent step by a water molecule or a second substrate, giving the desired product. 
Covalent electrophilic catalysis is also observed, but it usually involves coenzyme 
adducts that generate electrophilic centers. Hundreds of enzymes are now known 
to form covalent intermediates during catalysis. Several examples of covalent catal- 
ysis will be discussed in detail in later chapters, as noted in Table 14.2. 


General Acid-Base Catalysis 


Nearly all enzyme reactions involve some degree of acid or base catalysis. There are 
two types of acid-base catalysis: (1) specific acid-base catalysis, in which the reac- 
tion is accelerated by H* or OH- diffusing in from the solution, and (2) general 
acid-base catalysis, in which H* or OH7™ is created in the transition state by another 
molecule or group, which is termed the general acid or general base, respectively. 
By definition, general acid-base catalysis is catalysis in which a proton is transferred in the 
transition state. Consider the hydrolysis of p-nitrophenylacetate by specific base catal- 
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TABLE 14.2 | 

Enzyme Reacting Group Covalent Intermediate 
Trypsin Serine Acyl-Ser 
Chymotrypsin (pages 434-439) 

Glyceraldehyde-3-P dehydrogenase Cysteine Acyl-Cys 
(page 547) 

Phosphoglucomutase (page 447) Serine Phospho-Ser 
Phosphoglycerate mutase (page 548) Histidine Phospho-His 
Succinyl-CoA synthetase (page 576) 

Aldolase (page 545) Lysine and other 

Pyridoxal phosphate enzymes amino groups Schiff base 
(pages 408, 782, and 807) 











ysis or with imidazole acting as a general base (Figure 14.12). In the specific base 
mechanism, hydroxide diffuses into the reaction from solution. In the general base 
mechanism, the hydroxide that catalyzes the reaction is generated from water in the 
transition state. The water has been made more nucleophilic without generation of 
a high concentration of OH- or without the formation of unstable, high-energy 
species. General acid or general base catalysis may increase reaction rates 10- to 
100-fold. In an enzyme, ionizable groups on the protein provide the H* transferred 
in the transition state. Clearly, an ionizable group will be most effective as an H* 
transferring agent at or near its pKa. Because the pK, of the histidine side chain is 
near 7, histidine is often the most effective general acid or base. Descriptions of sev- 
eral cases of general acid-base catalysis in typical enzymes follow. 


Low-Barrier Hydrogen Bonds 


As previously noted, the typical strength of a hydrogen bond is 10 to 30 kJ/mol. For 
an O—H—O hydrogen bond, the O—O separation is typically 0.28 nm and the 
H bond is a relatively weak electrostatic interaction. The hydrogen is firmly linked 
to one of the oxygens at a distance of approximately 0.1 nm, and the distance to the 
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FIGURE 14.12 Catalysis of p-nitrophenylacetate hydrolysis can occur either by specific base hydrolysis (where 
hydroxide from the solution is the attacking nucleophile) or by general base catalysis (in which a base like 
imidazole can promote hydroxide attack on the substrate carbonyl carbon by removing a proton from a 
nearby water molecule). 


o- + no \—no, + Ht 
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Bond order refers to the number of electron 
pairs in a bond. (For a single bond, the bond 
order is 1.) 


Hydrogen tunneling: a hydrogen transfer reac- 
tion that occurs through, rather than over, a 
thermodynamic barrier. 


(a) (b) (c) 
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FIGURE 14.13 Comparison of conventional (weak) hydrogen bonds (a) and low-barrier hydrogen bonds (b and 
c). The horizontal line in each case is the zero-point energy of hydrogen. (a) shows an O—H—O hydrogen bond 
of length 0.28 nm, with the hydrogen attached to one or the other of the oxygens. The bond order for the 
stronger O—H interaction is approximately 1.0, and the weaker O—H interaction is 0.07. As the O-O distance de- 
creases, the hydrogen bond becomes stronger, and the bond order of the weakest interaction increases. In (b), 
the O-O distance is 0.25 nm, and the barrier is equal to the zero-point energy. In (c), the O-O distance is 0.23 to 
0.24 nm, and the bond order of each OH interaction is 0.5. 


other oxygen is thus about 0.18 nm, which corresponds to a bond order of about 
0.07. Not all hydrogen bonds are weak, however. As the distance between hetero- 
atoms becomes smaller, the overall bond becomes stronger, the hydrogen becomes 
centered, and the bond order approaches 0.5 for both O—H interactions (Figure 
14.13). These interactions are more nearly covalent in nature, and the stabilization 
energy is much higher. Notably, the barrier that the hydrogen atom must surmount 
to exchange oxygens becomes lower as the O—O separation decreases (Figure 
14.13). When the barrier to hydrogen exchange has dropped to the point that it is 
at or below the zero-point energy level of hydrogen, the interaction is referred to as 
a low-barrier hydrogen bond (LBHB). The hydrogen is now free to move anywhere 
between the two oxygens (or, more generally, two heteroatoms). The stabilization 
energy of LBHBs may approach 100 kJ/mol in the gas phase and 60 kJ/mol or more 
in solution. LBHBs require matched pK,s for the two electronegative atoms that 
share the hydrogen. As the two pK, values diverge, the stabilization energy of the 
LBHB is decreased. Widely divergent pK, values thus correspond to ordinary, weak 
hydrogen bonds. 

How may low-barrier hydrogen bonds affect enzyme catalysis? A weak hydrogen 
bond in an enzyme ground state may become an LBHB in a transient intermediate, 
or even in the transition state for the reaction. In such a case, the energy released in 
forming the LBHB is used to help the reaction that forms it, lowering the activation 
barrier for the reaction. Alternatively, the purpose of the LBHB may be to redistrib- 
ute electron density in the reactive intermediate, achieving rate acceleration by fa- 
cilitation of “hydrogen tunneling.” Enzyme mechanisms that will be examined later 
in this chapter (the serine proteases and aspartic proteases) appear to depend upon 
one or the other of these effects. 


Metal lon Catalysis 


Many enzymes require metal ions for maximal activity. If the enzyme binds the 
metal very tightly or requires the metal ion to maintain its stable, native state, it is 
referred to as a metalloenzyme. Enzymes that bind metal ions more weakly, perhaps 
only during the catalytic cycle, are referred to as metal activated. One role for met- 
als in metal-activated enzymes and metalloenzymes is to act as electrophilic catalysts, 
stabilizing the increased electron density or negative charge that can develop dur- 
ing reactions. Among the enzymes that function in this manner (Figure 14.14) is 
thermolysin. Another potential function of metal ions is to provide a powerful 
nucleophile at neutral pH. Coordination to a metal ion can increase the acidity of 
a nucleophile with an ionizable proton: 


M?* + NucH == M?* (NucH) == M?? (Nuc’) + H+ 





The reactivity of the coordinated, deprotonated nucleophile is typically intermedi- 
ate between that of the un-ionized and ionized forms of free nucleophile. Carboxy- 
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How Do Active-Site Residues Interact to Support Catalysis? 


Only about half of the common amino acid residues (that is, His, 
Cys, Asp, Glu, Arg, Lys, Tyr, Ser, Thr, Asn, and Gln) engage directly 
in catalytic effects in enzyme active sites. These polar and charged 
residues provide a relatively limited range of catalytic capabilities. 
They can act as nucleophiles, facilitate substrate binding, and sta- 
bilize transition states. It has been estimated that up to 75% of the 
steps in enzyme mechanisms involve a simple proton transfer. Is 
this enough to explain the dramatic catalytic power of enzymes? 
Or might there be other phenomena at work? 

Janet Thornton and Alex Gutteridge have analyzed residue in- 
teractions at the active sites of 191 different enzymes. In this group 
of enzymes, each polar catalytic residue interacts with (on aver- 
age) 2.3 other polar residues in the active site, whereas noncat- 
alytic, buried polar residues have, on average, interactions with 
only 1.2 other polar residues. This suggests that some of the inter- 
actions between catalytic and noncatalytic residues are functional 
in some way. At the same time, in only 88 of the enzymes does the 
key catalytic residue have a direct interaction with a second cat- 
alytic residue, indicating that most catalytic residues do not re- 
quire direct interactions with other catalytic residues to be active. 

The catalytic capacities of polar and charged residues can be in- 
fluenced by other polar and charged residues at the active site and 
even by hydrophobic residues. The so-called secondary, or non- 
catalytic, residues at the active site play interesting roles: 


e Raising or lowering catalytic residue pK, values through electrosta- 
tic or hydrophobic interactions. In aldoketoreductase, an 
Asp-Lys pair facilitates general acid-base catalysis, with Lyst 
lowering the pK, of Tyr”! so that it can donate a proton to the 
substrate. On the other hand, nearby hydrophobic residues 
can provide a nonpolar environment that tends to raise the 
pKa values of acidic residues (such as Asp or Glu) and to lower 
the pK, values of basic residues (such as lysine and arginine). 
Hydrophobic environments can change pK, values by as much 
as 5 or 6 pH units. 


© Charge stabilization, as will be seen in chorismate mutase, where 


active-site arginines stabilize negatively charged carboxyl 
groups on the substrate (see Figures 14.31 and 14.33). 

Proton transfers via hydrogen tunneling. In such quantum me- 
chanical tunneling, the proton transfer is accomplished by 
molecular motions that lead to degeneracy of a pair of local- 
ized proton vibrational states (Figure 14.13). Proton tunneling 
can be facilitated by nearby molecular motions of secondary 
residues coupled to the motion and vibration of the bonds in 
question. David Leys has shown that aromatic amine dehydro- 
genase probably accomplishes catalysis by coupling local mo- 
tions (of two secondary residues, C!”! and T!”) to the vibra- 
tional states involved in a proton transfer reaction with D'S, as 
shown here. 


Oxidized Trp! 
(cofactor) 


Closeup of the crystal structure of aromatic amine dehydrogenase, 
showing the relationship of Asp!**, Thr!”, and Cys!7!. N atoms are blue; 
O atoms are red; C atoms are green; S atom is gold (pdb id = 2AH1). 


e Orientation of catalytic residues, as will be seen in the serine 
proteases, where Asp!” orients His”’ (see Figure 14.21). 














FIGURE 14.14 Thermolysin is an endoprotease (that is, 
Zn?* o Vaal it cleaves polypeptides in the middle of the chain) with 
| F : Il : a catalytic Zn** ion in the active site. The Zn* ion stabi- 
G=0" Hy : — C— OH lizes the buildup of negative charge on the peptide car- 
Glu DE- Glu i bonyl oxygen, as a glutamate residue deprotonates wa- 
H C o HO—C -N ter, promoting hydroxide attack on the carbonyl carbon. 
< oN | H Thermolysin is found in certain laundry detergents, 


where it is used to remove protein stains from fabrics. 
Initial step of thermolysin reaction 


peptidase (see Chapter 5) contains an active site Zn**, which facilitates deprotona- 
tion of a water molecule in this manner. 


14:6: What Can Be Learned from Typical Enzyme 
Mechanisms? 


The balance of this chapter will be devoted to several classic and representative en- 
zyme mechanisms, including the serine proteases, the aspartic proteases, and cho- 
rismate mutase. Both the serine proteases and the aspartic proteases use general 
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acid-base catalysis chemistry; the serine proteases also employ a covalent catalysis 
strategy. Chorismate mutase, on the other hand, uses neither of these and depends 
instead on the formation of a NAC to carry out its reaction. These particular cases 
are well understood, because the three-dimensional structures of the enzymes and 
the bound substrates are known at atomic resolution and because great efforts have 
been devoted to kinetic and mechanistic studies. They are important because they 
represent reaction types that appear again and again in living systems and because 
they demonstrate many of the catalytic principles cited previously. Enzymes are the 
catalytic machines that sustain life, and what follows is an intimate look at the inner 
workings of the machinery. 


Serine Proteases 


Serine proteases are a class of proteolytic enzymes whose catalytic mechanism 
is based on an active-site serine residue. Serine proteases are one of the best- 
characterized families of enzymes. This family includes trypsin, chymotrypsin, elastase, 
thrombin, subtilisin, plasmin, tissue plasminogen activator, and other related enzymes. 
The first three of these are digestive enzymes and are synthesized in the pancreas 
and secreted into the digestive tract as inactive proenzymes, or zymogens. Within the 
digestive tract, the zymogen is converted into the active enzyme form by cleaving off 
a portion of the peptide chain. Thrombin is a crucial enzyme in the blood-clotting 
cascade, subtilisin is a bacterial protease, and plasmin breaks down the fibrin poly- 
mers of blood clots. Tissue plasminogen activator (TPA) specifically cleaves the 
proenzyme plasminogen, yielding plasmin. Owing to its ability to stimulate breakdown 
of blood clots, TPA can minimize the harmful consequences of a heart attack, if ad- 
ministered to a patient within 30 minutes of onset. Finally, although not itself a pro- 
tease, acetylcholinesterase is a serine esterase and is related mechanistically to the serine 
proteases. It degrades the neurotransmitter acetylcholine in the synaptic cleft be- 
tween neurons. 


The Digestive Serine Proteases 


Trypsin, chymotrypsin, and elastase all carry out the same reaction—the cleavage of a 
peptide chain—and although their structures and mechanisms are quite similar, they 
display very different specificities. Trypsin cleaves peptides on the carbonyl side of the 
basic amino acids, arginine or lysine (see Table 5.2). Chymotrypsin prefers to cleave 
on the carbonyl side of aromatic residues, such as phenylalanine and tyrosine. Elastase 
is not as specific as the other two; it mainly cleaves peptides on the carbonyl side of 
small, neutral residues. These three enzymes all possess molecular weights in the 
range of 25,000, and all have similar sequences (Figure 14.15) and three-dimensional 
structures. The structure of chymotrypsin is typical (Figure 14.16). The molecule is el- 
lipsoidal in shape and contains an a-helix at the C-terminal end (residues 230 to 245) 
and several -sheet domains. Most of the aromatic and hydrophobic residues are 
buried in the interior of the protein, and most of the charged or hydrophilic residues 
are on the surface. Three polar residues—His°’, Asp!", and Ser!°—form what is 
known as a catalytic triad at the active site (Figure 14.17). These three residues are 
conserved in trypsin and elastase as well. The active site is actually a depression on the 
surface of the enzyme, with a pocket that the enzyme uses to identify the residue for 
which it is specific (Figure 14.18). Chymotrypsin, for example, has a pocket sur- 
rounded by hydrophobic residues and large enough to accommodate an aromatic 
side chain. The pocket in trypsin has a negative charge (Asp!*) at its bottom, facili- 
tating the binding of positively charged arginine and lysine residues. Elastase, on the 
other hand, has a shallow pocket with bulky threonine and valine residues at the 
opening. Only small, nonbulky residues can be accommodated in its pocket. The 
backbone of the peptide substrate is hydrogen bonded in antiparallel fashion to 
residues 215 to 219 and bent so that the peptide bond to be cleaved is bound close to 
His” and Ser!®. 
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Chymotrypsinogen Trypsinogen 








FIGURE 14.16 Structure of chymotrypsin (white) in a complex with eglin C (blue ribbon structure), a 
target protein. The residues of the catalytic triad (His*”, Asp, and Ser!) are highlighted. His?” (red) 

is flanked by Asp'° (gold) and by Ser'™ (green). The catalytic site is filled by a peptide segment of 
eglin. Note how close Ser!” is to the peptide that would be cleaved in the chymotrypsin reaction (pdb 
id = 1ACB). 





FIGURE 14.15 Comparison of the amino acid sequences 
of chymotrypsinogen, trypsinogen, and elastase. Each 
circle represents one amino acid. Numbering is based 
on the sequence of chymotrypsinogen. Filled circles in- 
dicate residues that are identical in all three proteins. 
Disulfide bonds are indicated in orange. The positions of 
the three catalytically important active-site residues 
(His®”, Asp', and Ser!) are indicated. 





P. 


Asp 102 


FIGURE 14.17 The catalytic triad of chymotrypsin. 
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FIGURE 14.18 The substrate-binding pockets of trypsin 
(pdb id = 2CMY), chymotrypsin (pdb id = 1ACB), and 
elastase (pdb id = 3EST). Asp!® (aqua) coordinates Arg 
and Lys residues of peptide substrates in the trypsin 
binding pocket. Val% (purple) and Thr? (green) make 
the elastase binding pocket shallow and able to accom- 
modate only small, nonbulky residues. 





p-Nitrophenylacetate 


FIGURE 14.19 Chymotrypsin cleaves simple esters, in 
addition to peptide bonds. p-Nitrophenylacetate has 
been used in studies of the chymotrypsin mechanism. 





Elastase 


Chymotrypsin 


Trypsin 


The Chymotrypsin Mechanism in Detail: Kinetics 


Much of what is known about the chymotrypsin mechanism is based on studies of the 
hydrolysis of artificial substrates—simple organic esters, such as p-nitrophenylacetate 
(Figure 14.19). p-Nitrophenylacetate is an especially useful model substrate, because 
the nitrophenolate product is easily observed, owing to its strong absorbance at 400 
nm. When large amounts of chymotrypsin are used in kinetic studies with this sub- 
strate, a rapid initial burst of -nitrophenolate is observed (in an amount 
approximately equal to the enzyme concentration), followed by a much slower, linear 
rate of nitrophenolate release (Figure 14.20). Observation of a burst, followed by 
slower, steady-state product release, is strong evidence for a multistep mechanism, with 
a fast first step and a slower second step. 

In the chymotrypsin mechanism, the nitrophenylacetate combines with the en- 
zyme to form an ES complex. This is followed by a rapid step in which an acyl-enzyme 
intermediate is formed, with the acetyl group covalently bound to the very reactive 
Ser! . The nitrophenyl moiety is released as nitrophenolate (Figure 14.20), account- 
ing for the burst of nitrophenolate product. Attack of a water molecule on the acyl- 
enzyme intermediate yields acetate as the second product in a subsequent, slower 
step. The enzyme is now free to bind another molecule of p-nitrophenylacetate, and 
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FIGURE 14.20 Burst kinetics observed in the chy- 
motrypsin reaction (a). A burst of nitrophenolate 
NO, o7 
(b, first step) is followed by a slower, steady-state H.O H+ 
release. After an initial lag period, acetate release A 
(b, second step) is observed. This kinetic pattern @ ~ou + @~o C—CH; Sf =0—0— CH; 
is consistent with rapid formation of an acyl-en- 195 Fast step 19% | Slow step | 
; : Ser Ser” 
zyme intermediate (and the burst of nitropheno- O 
late). The slower, steady-state release of products O—C—CH, 
corresponds to rate-limiting breakdown of the || 
acyl-enzyme intermediate. O 


14.6 What Can Be Learned from Typical Enzyme Mechanisms? 


the p-nitrophenolate product produced at this point corresponds to the slower, 
steady-state formation of product in the upper right portion of Figure 14.20. In this 
mechanism, the release of acetate is the rate-limiting step and accounts for the ob- 
servation of burst kinetics—the pattern shown in Figure 14.20. 


The Serine Protease Mechanism in Detail: Events at the Active Site 


A likely mechanism for peptide hydrolysis is shown in Figure 14.21. As the backbone 
of the substrate peptide binds adjacent to the catalytic triad, the specific side chain 
fits into its pocket. Asp! of the catalytic triad positions His>’ and immobilizes it 
through a hydrogen bond as shown. In the first step of the reaction, His” acts as a 
general base to withdraw a proton from Ser!%, facilitating nucleophilic attack by 
Ser! on the carbonyl carbon of the peptide bond to be cleaved. This is probably a 
concerted step, because proton transfer prior to Ser!” attack on the acyl carbon would 
leave a relatively unstable negative charge on the serine oxygen. In the next step, 
donation of a proton from His®” to the peptide’s amide nitrogen creates a proto- 
nated amine on the covalent, tetrahedral intermediate, facilitating the subsequent 
bond breaking and dissociation of the amine product. The negative charge on the 
peptide oxygen is unstable; the tetrahedral intermediate is short lived and rapidly 
breaks down to expel the amine product. The acyl-enzyme intermediate that results 
is reasonably stable; it can even be isolated using substrate analogs for which further 
reaction cannot occur. With normal peptide substrates, however, subsequent 
nucleophilic attack at the carbonyl carbon by water generates another transient 
tetrahedral intermediate (Figure 14.21g). His°’ acts as a general base in this step, 
accepting a proton from the attacking water molecule. The subsequent collapse of 
the tetrahedral intermediate is assisted by proton donation from His” to the serine 
oxygen in a concerted manner. Deprotonation of the carboxyl group and its de- 
parture from the active site complete the reaction as shown. 

Until recently, the catalytic role of Asp!” in trypsin and the other serine pro- 
teases had been surmised on the basis of its proximity to His*” in structures obtained 
from X-ray diffraction studies, but it had never been demonstrated with certainty in 
physical or chemical studies. As can be seen in Figure 14.16, Asp? is buried at the 
active site; it is normally inaccessible to chemical modifying reagents. In 1987, 
Charles Craik, William Rutter, and their colleagues used site-directed mutagenesis 
(see Chapter 12) to prepare a mutant trypsin with an asparagine in place of Asp!°. 
This mutant trypsin possessed a hydrolytic activity with ester substrates only 
1/10,000 that of native trypsin, demonstrating that Asp! is indeed essential for 
catalysis and that its ability to immobilize and orient His” by formation of a hydro- 
gen bond is crucial to the function of the catalytic triad. 

The serine protease mechanism relies in part on a low-barrier hydrogen bond. 
In the free enzyme, the pK, values of Asp!? and His*’ are very different, and the 
H bond between them is a weak one. However, donation of the proton of Ser! to 
His®’ lowers the pK, of the protonated imidazole ring so it becomes a close match 
to that of Asp!, and the H bond between them becomes an LBHB. The energy re- 
leased in the formation of this LBHB is used to facilitate the formation of the sub- 
sequent tetrahedral intermediate (Figure 14.21c, g). 


The Aspartic Proteases 


Mammals, fungi, and higher plants produce a family of proteolytic enzymes known 
as aspartic proteases. These enzymes are active at acidic (or sometimes neutral) pH, 
and each possesses two aspartic acid residues at the active site. Aspartic proteases 
carry out a variety of functions (Table 14.3), including digestion (pepsin and chy- 
mosin), lysosomal protein degradation (cathepsin D and E), and regulation of blood 
pressure (renin is an aspartic protease involved in the production of angiotensin, a hor- 
mone that stimulates smooth muscle contraction and reduces excretion of salts and 
fluid). The aspartic proteases display a variety of substrate specificities, but normally 
they are most active in the cleavage of peptide bonds between two hydrophobic 
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FIGURE 14.21 A detailed mechanism for the chymotrypsin 
reaction. Note the low-barrier hydrogen bond (LBHB) in 
(c) and (g). 


14.6 What Can Be Learned from Typical Enzyme Mechanisms? 439 


A DEEPER LOOK 





Transition-State Stabilization in the Serine Proteases 






X-ray crystallographic studies of serine protease complexes with 
transition-state analogs have shown how chymotrypsin stabilizes The oxyanion 
the tetrahedral oxyanion transition states [structures (c) and (g) hole f 
in Figure 14.21] of the protease reaction. The amide nitrogens Gly! 
of Ser!” and Gly! form an “oxyanion hole” in which the sub- 
strate carbonyl oxygen is hydrogen bonded to the amide N—H 
groups. 

Formation of the tetrahedral transition state increases the in- 
teraction of the carbonyl oxygen with the amide N—H groups in 
two ways. Conversion of the carbonyl double bond to the longer 
tetrahedral single bond brings the oxygen atom closer to the 
amide hydrogens. Also, the hydrogen bonds between the charged 
oxygen and the amide hydrogens are significantly stronger than 
the hydrogen bonds with the uncharged carbonyl oxygen. 

Transition-state stabilization in chymotrypsin also involves the 
side chains of the substrate. The side chain of the departing amine 
product forms stronger interactions with the enzyme upon forma- 
tion of the tetrahedral intermediate. When the tetrahedral inter- 
mediate breaks down (Figure 14.21d and h), steric repulsion be- 
tween the product amine group and the carbonyl group of the 
acyl-enzyme intermediate leads to departure of the amine product. 


The oxyanion 
hole 


> The “oxyanion hole” of chymotrypsin stabilizes the tetrahedral oxy- 
anion intermediate of the mechanism in Figure 14.21. 











LARES Som 

Name Source Function 

Pepsin“ Stomach Digestion of dietary protein 
Chymosint Stomach Digestion of dietary protein 
Cathepsin D Spleen, liver, and many Lysosomal digestion of proteins 


other animal tissues 


Renin? Kidney Conversion of angiotensinogen 
to angiotensin I; regulation of 
blood pressure 


HIV-protease$§ AIDS virus Processing of AIDS virus proteins 











“The second enzyme to be crystallized (by John Northrop in 1930). Even more than urease before it, pepsin study by 
Northrop established that enzyme activity comes from proteins. 

tAlso known as rennin, it is the major pepsinlike enzyme in gastric juice of fetal and newborn animals. It has been used 
for thousands of years, in a gastric extract called rennet, in the making of cheese. 

*A drop in blood pressure causes release of renin from the kidneys, which converts more angiotensinogen to angiotensin. 
SA dimer of identical monomers, homologous to pepsin. 
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amino acid residues. The preferred substrates of pepsin, for example, contain aro- 
matic residues on both sides of the peptide bond to be cleaved. 

Most aspartic proteases are composed of 323 to 340 amino acid residues, with 
molecular weights near 35,000. Aspartic protease polypeptides consist of two ho- 
mologous domains that fold to produce a tertiary structure composed of two simi- 
lar lobes, with approximate twofold symmetry (Figure 14.22). Each of these lobes or 
domains consists of two B-sheets and two short a-helices. The two domains are 
bridged and connected by a six-stranded, antiparallel B-sheet. The active site is a 
deep and extended cleft, formed by the two juxtaposed domains and large enough 
to accommodate about seven amino acid residues. The two catalytic aspartate 
residues, residues 32 and 215 in porcine pepsin, for example, are located deep in 
the center of the active site cleft. The N-terminal domain forms a “flap” that extends 
over the active site, which may help to immobilize the substrate in the active site. 

On the basis, in part, of comparisons with chymotrypsin, trypsin, and the other ser- 
ine proteases, it was at first hypothesized that aspartic proteases might function by for- 
mation of covalent enzyme-substrate intermediates involving the active-site aspartate 
residues. However, all attempts to trap or isolate a covalent intermediate failed, and a 
mechanism (see following section) favoring noncovalent enzyme-substrate interme- 
diates and general acid—general base catalysis is now favored for aspartic proteases. 


(a) 





The Mechanism of Action of Aspartic Proteases 


FIGURE 14.22 Structures of (a) HIV-1 protease, a dimer A crucial datum supporting the general acid-general base model is the pH depen- 
(pdb id = 7HVP), and (b) pepsin, a monomer. Pepsin’s dence of protease activity (Figure 14.23). For many years, enzymologists hypothesized 
N-terminal half is shown in red; C-terminal half is that the aspartate carboxyl groups functioned alternately as general acid and general 
shown in blue (pdb id = 5PEP). base. This model requires that one of the aspartate carboxyls be protonated and one 
be deprotonated when substrate binds. (This made sense, because X-ray diffraction 
data on aspartic proteases had shown that the active-site structure in the vicinity of the 
two aspartates is highly symmetric.) However, Stefano Piana and Paolo Carloni re- 
ported in 2000 that molecular dynamics simulations of aspartic proteases were con- 
sistent with a low-barrier hydrogen bond involving the two active-site aspartates. This 
led to a new mechanism for the aspartic proteases (Figure 14.24) that begins with 
Piana and Carloni’s model of the LBHB structure of the free enzyme (state E). In this 
model, the LBHB holds the twin aspartate carboxyls in a coplanar conformation, with 
the catalytic water molecule on the opposite side of a ten-atom cyclic structure. 
Following substrate binding, a counterclockwise flow of electrons moves two pro- 
tons clockwise and creates a tetrahedral intermediate bound to a diprotonated 
enzyme form (FT). Then a clockwise movement of electrons moves two protons 


(a) (b) 


Pepsin HIV protease 


Enzyme activity 
e 
Inhibition constants 





FIGURE 14.23 pH-rate profiles for (a) pepsin and (b) HIV protease. (Adapted from Denburg, J. et al, 1968. The effect of 
pH on the rates of hydrolysis of three acylated dipeptides by pepsin. Journal of the American Chemical Society 90:479-486; and 
Hyland, J. et al., 1991. Human immunodeficiency virus-1 protease. 2. Use of pH rate studies and solvent kinetic isotope effects 
to elucidate details of chemical mechanism. Biochemistry 30:8454-8463.) 
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FIGURE 14.24 Mechanism for the aspartic proteases. The letter titles describe the states as follows: E represents 
the enzyme form with a low-barrier hydrogen bond between the catalytic aspartates, F represents the enzyme 
form with both aspartates protonated and joined by a conventional hydrogen bond, S represents bound sub- 
strate, T represents a tetrahedral amide hydrate intermediate, P represents bound carboxyl product, and 

Q represents bound amine product. This mechanism is based in part on a mechanism proposed by Dexter 
Northrop, a distant relative of John Northrop, who had first crystallized pepsin in 1930. (Northrop, D. B., 2001. Follow 
the protons: A low-barrier hydrogen bond unifies the mechanisms of the aspartic proteases. Accounts of Chemical Research 
34:790-797,) The mechanism is also based on data of Thomas Meek. (Meek,T. D, Catalytic mechanisms of the aspartic 
proteinases. In Sinnott, M.,ed, Comprehensive Biological Catalysis: A Mechanistic Reference, San Diego: Academic Press, 1998.) 


counterclockwise and generates the zwitterion intermediate bound to a monopro- 
tonated enzyme form (ET’). Collapse of the zwitterion cleaves the C—N bond of 
the substrate. Dissociation of one product leaves the enzyme in the diprotonated 
FQ form. Finally, deprotonation and rehydration lead to regeneration of the ten- 
atom cyclic structure, E. 

What is the purpose of the low-barrier hydrogen bond in the aspartic protease 
mechanism? It may be to disperse electron density in the ten-atom cyclic structure, 
accomplishing rate acceleration by means of “hydrogen tunneling” (Figure 14.25). 
The barrier between the ES and ET’ states of Figure 14.24 is imagined to be large, 
and the state FT may not exist as a discrete intermediate but rather may exist tran- 
siently to facilitate conversion of ES and ET". 


The AIDS Virus HIV-1 Protease Is an Aspartic Protease 


Recent research on acquired immunodeficiency syndrome (AIDS) and its causative 
viral agent, the human immunodeficiency virus (HIV-1), has brought a new aspar- 
tic protease to light. HIV-1 protease cleaves the polyprotein products of the HIV-1 
genome, producing several proteins necessary for viral growth and cellular infec- 
tion (Figure 14.26). HIV-1 protease cleaves several different peptide linkages. For 
example, the protease cleaves between the Tyr and Pro residues of the sequence 
Ser-Gln-Asn-Tyr-Pro-Ile-Val, which joins the p17 and p24 HIV-1 proteins. 

The HIV-1 protease is a remarkable viral imitation of mammalian aspartic pro- 
teases: It is a dimer of identical subunits that mimics the two-lobed monomeric 
structure of pepsin and other aspartic proteases. The HIV-1 protease subunits are 
99-residue polypeptides that are homologous with the individual domains of the 
monomeric proteases. Structures determined by X-ray diffraction studies reveal that 
the active site of HIV-1 protease is formed at the interface of the homodimer and 
consists of two aspartate residues, designated Asp” and Asp’, one contributed by 
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FIGURE 14.25 Energy level diagram for the aspartic pro- 
tease reaction, showing ground-state hydrogen tunneling 
(arrow), with consequent rate acceleration. 
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FIGURE 14.26 HIV mRNA provides the genetic informa- 
tion for synthesis of a polyprotein. Proteolytic cleavage 
of this polyprotein by HIV protease produces the indi- 
vidual proteins required for viral growth and cellular 
infection. 


CENGAGENOW” ACTIVE FIGURE 14.27 (left) HIV-1 pro- 
tease complexed with the inhibitor Crixivan (red) made 
by Merck. The flaps (residues 46-55 from each subunit) 
covering the active site are shown in green, and the 
active-site aspartate residues involved in catalysis are 
shown in light purple. (right) The close-up of the active 
site shows the interaction of Crixivan with the carboxyl 
groups (yellow) of the essential aspartate residues (pdb 
id = 1HSG). Test yourself on the concepts in this 
figure at www.cengage.com/login. 
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each subunit (Figure 14.27). In the homodimer, the active site is covered by two 
identical “flaps,” one from each subunit, in contrast to the monomeric aspartic pro- 
teases, which possess only a single active-site flap. Enzyme kinetic measurements by 
Thomas Meek and his collaborators at SmithKline Beecham Pharmaceuticals have 
shown that the mechanism of HIV-1 protease is very similar to those of other aspar- 
tic proteases. 


Chorismate Mutase: A Model for Understanding Catalytic Power 
and Efficiency 


Direct comparison of an enzyme reaction with the analogous uncatalyzed reaction 
is usually difficult, if not impossible. There are several problems: First, many 
enzyme-catalyzed reactions do not proceed at measurable rates in the absence of 
the enzyme. Second, many enzyme-catalyzed reactions involve formation of a cova- 
lent intermediate between the enzyme and the substrate. Third, a reaction occur- 
ring in an enzyme active site might proceed through a different transition state than 
the corresponding solution reaction. Chorismate mutase is a happy exception to all 
these potential problems. First, although the rate of this reaction is more than a mil- 
lion times faster on the enzyme, the uncatalyzed solution reaction still proceeds at 
reasonable and measurable rates. Second, the enzyme reaction does not employ a 
covalent intermediate. What about the transition states for the catalyzed and un- 
catalyzed reactions? Chorismate mutase acts in the biosynthesis of phenylalanine 
and tyrosine in microorganisms and plants. It involves a single substrate and cat- 
alyzes a concerted intramolecular rearrangement of chorismate to prephenate. In 
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HUMAN BIOCHEMISTRY 


Protease Inhibitors Give Life to AIDS Patients 


Infection with HIV was once considered a death sentence, but the 
emergence of a new family of drugs called protease inhibitors has 
made it possible for some AIDS patients to improve their overall 
health and extend their lives. These drugs are all specific inhibitors 
of the HIV protease. By inhibiting the protease, they prevent the de- 
velopment of new virus particles in the cells of infected patients. A 
combination of drugs—including a protease inhibitor together with 
a reverse transcriptase inhibitor like AZT—can reduce the human 
immunodeficiency virus (HIV) to undetectable levels in about 40% 
to 50% of infected individuals. Patients who respond successfully to 
this combination therapy have experienced dramatic improvement 
in their overall health and a substantially lengthened life span. 

Four of the protease inhibitors approved for use in humans by 
the U.S. Food and Drug Administration are shown below: Crixivan 
by Merck, Invirase by Hoffman-LaRoche, Norvir by Abbott, and 
Viracept by Agouron. These drugs were all developed from a 
“structure-based” design strategy; that is, the drug molecules were 
designed to bind tightly to the active site of the HIV-1 protease. The 
backbone OH-group in all these substances inserts between the two 
active-site carboxyl groups of the protease. 

In the development of an effective drug, it is not sufficient 
merely to show that a candidate compound can cause the desired 
biochemical effect. It must also be demonstrated that the drug 
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can be effectively delivered in sufficient quantities to the desired 
site(s) of action in the organism and that the drug does not cause 
undesirable side effects. The HIV-1 protease inhibitors shown 
here fulfill all of these criteria. Other drug candidates have been 
found that are even better inhibitors of HIV-1 protease in cell cul- 
tures, but many of these fail the test of bioavailability—the ability 
of a drug to be delivered to the desired site(s) of action in the 
organism. 

Candidate protease inhibitor drugs must be relatively specific 
for the HIV-1 protease. Many other aspartic proteases exist in the 
human body and are essential to a variety of body functions, in- 
cluding digestion of food and processing of hormones. An ideal 
drug thus must strongly inhibit the HIV-1 protease, must be deliv- 
ered effectively to the lymphocytes where the protease must 
be blocked, and should not adversely affect the activities of the 
essential human aspartic proteases. 

A final but important consideration is viral mutation. Certain 
mutant HIV strains are resistant to one or more of the protease in- 
hibitors, and even for patients who respond initially to protease in- 
hibitors it is possible that mutant viral forms may eventually arise 
and thrive in infected individuals. The search for new and more ef- 
fective protease inhibitors is ongoing. 
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this simple reaction, one carbon-oxygen bond is broken, and one carbon-carbon 
bond is formed. It is an example of a Claisen rearrangement, familiar to any student 
of organic chemistry (Figure 14.28). There are two possible transition states, one in- 
volving a chair conformation and the other a boat (Figure 14.29). Jeremy Knowles 
and his co-workers have shown that both the enzymatic and the solution reactions 
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FIGURE 14.28 (a) The chorismate mutase reaction con- 
verts chorismate to prephenate. (b) A classic Claisen re- 
arrangement. Conversion of allyl phenyl ether to 2-allyl 
alcohol proceeds through a cyclohexadienone interme- 
diate, which then undergoes a keto-enol tautomerization. 


FIGURE 14.29 The conversion of chorismate to 
prephenate could occur (in principle) through a boat 
transition state or a chair transition state. The difference 
can be understood by imagining two different isotopes 
of hydrogen (blue and green) at carbon 9 of chorismate 
and the products that would result in each case. 
Knowles and co-workers have shown that both the un- 
catalyzed reaction and the reaction on chorismate mu- 
tase occur through a chair transition state. The molecule 
shown at right is a transition analog for the chorismate 
mutase reaction. 
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take place via a chair transition state, and a transition-state analog of this state has 
been characterized (Figure 14.29). 


The Chorismate Mutase Active Site Lies at the Interface of Two Subunits The 
chorismate mutase from E. coli is the N-terminal portion (109 residues) of a bi- 
functional enzyme, termed the P protein, which also has a C-terminal prephenate 
dehydrogenase activity. The N-terminal portion of the P protein has been prepared 
as a separate protein by recombinant DNA techniques, and this engineered protein 
is a fully functional chorismate mutase. The structure shown in Figure 14.30 is a 
homodimer, each monomer consisting of three o-helices (denoted H1, H2, and 
H3) connected by short loops. The two monomers are dovetailed in the dimer 
structure, with the H1 helices paired and the H3 helices overlapping significantly. 
The long, ten-turn H1 helices form an antiparallel coiled coil, with leucines at po- 
sitions 10, 17, 24, and 31 in a classic 7-residue repeat pattern (see Chapter 6). 

The chorismate mutase dimer contains two equivalent active sites, each formed 
from portions of both monomers. The structure shown in Figure 14.20 contains 
a bound transition-state analog (Figure 14.29) stabilized by 12 electrostatic and 











Chorismate Prephenate 
H 
Hei coo- 
via 
boat "COO 
~OOC 
OH O 
COO- 
H 
OOC = 
H OH 
via 
chair wuCooT Transition state 


analog 





OH 


14.6 What Can Be Learned from Typical Enzyme Mechanisms? 445 


(b) 


85 o 
Val*? Arg?s 





FIGURE 14.30 Chorismate mutase is a symmetric homodimer, each monomer consisting of three a-helices 
connected by short loops. (a) The dimer contains two equivalent active sites, each formed from portions of 
both monomers (pdb id = 4CSM). (b) A close-up of the active site, showing the bound transition-state analog 
(pink, see Figure 14.29). 


hydrogen-bonding interactions (Figure 14.31). Arg” from one subunit and Arg!!” 
from the other coordinate the carboxyl groups of the analog, and a third arginine 
(Arg*!) coordinates a water molecule, which in turn coordinates both carboxyls of 
the analog. Each oxygen of the analog is coordinated by two groups from the active 
site. In addition, there are hydrophobic residues surrounding the analog, especially 
Val” on one side and Ile*! and Val® on the other. 


The Chorismate Mutase Active Site Favors a Near-Attack Conformation The 
chorismate mutase reaction mechanism requires that the carboxyvinyl group fold 
over the chorismate ring to facilitate the Claisen rearrangement (Figure 14.32). 
This implies the formation of a NAC on the way to the transition state. Bruice and 
his co-workers have carried out extensive molecular dynamics simulations of the 
chorismate mutase reaction. Their calculations show that, in the nonenzymatic re- 
action, only 0.0001% of chorismate in solution exists in the NAC required for reac- 
tion. Similar calculations show that, in the enzyme active site, chorismate adopts a 
NAC 30% of the time. The computersimulated NAC in the chorismate mutase 





FIGURE 14.31 In the chorismate mutase active site, the 
transition-state analog is stabilized by 12 electrostatic 
and hydrogen-bonding interactions. (Adapted from Lee, A., 
et al., 1995. Atomic structure of the buried catalytic pocket of 
Escherichia coli chorismate mutase. Journal of the American 
Chemical Society 117:3627-3628.) 
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FIGURE 14.32 The mechanism of the chorismate mutase reaction. The carboxyvinyl group folds up and over 
the chorismate ring, and the reaction proceeds via an internal rearrangement. 
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FIGURE 14.33 Chorismate bound to the active site of chorismate mutase in a structure that resembles a NAC. 
Arrows indicate hydrophobic interactions, and red dotted lines indicate electrostatic interactions. (Adapted 
from Hur, S. and Bruice, T., 2003. The near attack conformation approach to the study of the chorismate to 
prephenate reaction. Proceedings of the National Academy of Sciences USA 100:12015-12020.) 


CRITICAL DEVELOPMENTS IN BIOCHEMISTRY 
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Because the transition states of enzyme-catalyzed reactions are 
imagined to have lifetimes on the order of a bond vibration 
(107? sec), it has long been assumed that it would not be possi- 
ble to see a transition state in the form of a crystal structure 
solved by X-ray diffraction. However, Debra Dunaway-Mariano, 
Karen Allen, and their colleagues have crystallized phosphory- 
lated B-phosphoglucomutase at low temperature in the presence 
of Mg?* and either glucose-1-phosphate or glucose-6-phosphate 
and have observed a stable pentacoordinate phosphorane that 
looks very much like the transition state anticipated for the phos- 
phoryl transfer carried out by this enzyme. The most likely mech- 
anisms for a phosphoryl transfer reaction are shown in the ac- 
companying figure: (a) is a dissociative mechanism involving an 
intermediate metaphosphate, with expected apical P-O distances 


(a) Dissociative 





Caught in the Act! A High-Energy Intermediate in the Phosphoglucomutase Reaction 


of 0.33 nm or more. (b) is an Sy2-like, partly associative mecha- 
nism, with apical P-O distances of 0.19 to 0.21 nm and bond or- 
ders of 0.5. A fully-associative mechanism would have apical P-O 
distances of 0.166 to 0.176 nm. (c) The crystal structure of phos- 
phoglucomutase shows a trigonal bipyramidal oxyphosphorane 
with P-O distances of 0.2 and 0.21 nm and calculated bond orders 
of 0.24 to 0.45. The structure is remarkably similar to what would 
be expected for the transition state of a partly associative mecha- 
nism. Is this the transition state, trapped in a crystal? The crystals 
were frozen at liquid nitrogen temperature (77 K), and the X-ray 
diffraction data were collected at 93 K. Because we imagine that a 
true transition state has a lifetime too short to be observed in this 
way, we may surmise that what is a transition state at physiological 
temperature is a stable intermediate at very low temperature. 
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active site (Figure 14.33) is similar in many ways to the chorismate mutase-TSA 
complex, with Arg” and Arg! coordinating the two carboxylate groups of choris- 
mate so as to position the carboxyvinyl group in the conformation required for 
transition-state formation. This conformation is also stabilized by Val” and Ile®, 
which are in van der Waals contact with the vinyl group and the chorismate ring, re- 
spectively. Thus, the NAC of chorismate is promoted by electrostatic and hydro- 
phobic interactions with active-site residues. 

The energetics of the chorismate mutase reaction are revealing (Figure 
14.34). Computer simulations by Bruice and his co-workers show that formation 
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FIGURE 14.34 The energetic profile of the chorismate 
mutase reaction. Computer simulations by Bruice and 
his co-workers show that the NAC and the E-S complex 
are separated by only 0.42 kJ/mol, meaning that the 
NAC forms much more readily in the enzyme active site 
than it does in the absence of enzyme. The NAC and the 
reaction transition state are separated by similar energy 
barriers in either the presence or the absence of the en- 
zyme. Thus, the catalytic prowess of the enzyme lies in 
its ability to form the NAC at a very small energetic cost. 
(Adapted from Bruice, T., 2002. A view at the millennium: 
The efficiency of enzymatic catalysis. Accounts of Chemi- 
cal Reactions 35:139-148.) 
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of a NAC in the absence of the enzyme is energetically costly, whereas formation 
of the NAC in the enzyme active site is facile, with only a modest energy cost. On 
the other hand, the energy required to move from the NAC to the transition 
state is about the same for the solution and the enzyme reactions. Clearly, the cat- 
alytic advantage of chorismate mutase is the ease of formation of a NAC in the 


active site. 


SUMMARY 


It is simply chemistry—the breaking and making of bonds—that gives 
enzymes their prowess. This chapter explores the unique features of this 
chemistry. The mechanisms of thousands have been studied in at least 
some detail. 


14.1 What Are the Magnitudes of Enzyme-Induced Rate Accelera- 
tions? Enzymes are powerful catalysts. Enzyme-catalyzed reactions 
are typically 107 to 10!4 times faster than their uncatalyzed counter- 
parts and may exceed 10". 


14.2 What Role Does Transition-State Stabilization Play in Enzyme 
Catalysis? The energy barrier for the uncatalyzed reaction is the dif- 
ference in energies of the S and Xt states. Similarly, the energy barrier 
to be surmounted in the enzyme-catalyzed reaction, assuming that E is 
saturated with S, is the energy difference between ES and EX?. Reaction 
rate acceleration by an enzyme means simply that the energy barrier be- 
tween ES and EX? is less than the energy barrier between S and Xt. In 
terms of the free energies of activation, A Gt < AG,}. 


14.3 How Does Destabilization of ES Affect Enzyme Catalysis? The 
favorable interactions between the substrate and amino acid residues on 
the enzyme account for the intrinsic binding energy, AG,. The intrinsic 
binding energy ensures the favorable formation of the ES complex, but 
if uncompensated, it makes the activation energy for the enzyme- 
catalyzed reaction unnecessarily large and wastes some of the catalytic 
power of the enzyme. Because the enzymatic reaction rate is deter- 
mined by the difference in energies between ES and EXt, the smaller 
this difference, the faster the enzyme-catalyzed reaction. Tight binding 


of the substrate deepens the energy well of the ES complex and actually 
lowers the rate of the reaction. 

Destabilization of the ES complex can involve structural strain, 
desolvation, or electrostatic effects. Destabilization by strain or distor- 
tion is usually just a consequence of the fact that the enzyme has evolved 
to bind the transition state more strongly than the substrate. 


14.4 How Tightly Do Transition-State Analogs Bind to the Active Site? 
Given a ratio k./k, of 10" and a typical Ks of 107? M, the value of Ky 
should be 107" M. This is the dissociation constant for the transition- 
state complex from the enzyme, and this very low value corresponds to 
very tight binding of the transition state by the enzyme. It is unlikely that 
such tight binding in an enzyme transition state will ever be measured 
experimentally, however, because the lifetimes of transition states are 
typically 107! to 107} sec. 


14.5 What Are the Mechanisms of Catalysis? Enzymes facilitate for- 
mation of NACs (near-attack conformations). Enzyme reaction mecha- 
nisms involve covalent bond formation, general acid-base catalysis, low- 
barrier hydrogen bonds, metal ion effects, and proximity and favorable 
orientation of reactants. Most enzymes display involvement of two or 
more of these in any given reaction. 


14.6 What Can Be Learned from Typical Enzyme Mechanisms? The en- 
zymes examined in this chapter—serine proteases, aspartic proteases, and 
chorismate mutase—provide representative examples of catalytic mecha- 
nisms; all embody two or more of the rate enhancement contributions. 
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PROBLEMS 


CENGAGENOW’” Preparing for an exam? Create your own study path for this b. Explain why the structure you have proposed explains the re- 
chapter at www.cengage.com/login. duced activity of the mutant trypsin. 

1. Tosyl-L-phenylalanine chloromethyl ketone (TPCK) specifically in- c. See the original journal articles (Sprang, et al., 1987. Science 237: 

hibits chymotrypsin by covalently labeling His®”. 905-909; and Craik, et al., 1987. Science 237:909-913) to see what 


Craik and Rutter’s answer to this question was. 
3. Pepstatin (see below) is an extremely potent inhibitor of the mono- 
meric aspartic proteases, with K; values of less than 1 nM. 
a. On the basis of the structure of pepstatin, suggest an explanation 





o CH, O for the strongly inhibitory properties of this peptide. 
/ \ | | || b. Would pepstatin be expected to also inhibit the HIV-1 protease? 
CH; \ S—NH—CH—C— CH,Cl Explain your answer. 
== l 4. Based on the following reaction scheme, derive an expression for 


ke/ ku, the ratio of the rate constants for the catalyzed and uncat- 


Tosyl-t-phenylalanine chloromethyl ketone (TPCK) alyzed reactions, respectively, in terms of the free energies of 


a. Propose a mechanism for the inactivation reaction, indicating the activation for the catalyzed (AG,') and the uncatalyzed (A G,*) 
structure of the product(s). reactions. 
b. State why this inhibitor is specific for chymotrypsin. K. a 
c. Propose a reagent based on the structure of TPCK that might be S = X — P 
an effective inhibitor of trypsin. E E 
2. In this chapter, the experiment in which Craik and Rutter re- ~ Ks | 
placed Asp’ with Asn in trypsin (reducing activity 10,000-fold) K. ' 


was discussed. 

a. On the basis of your knowledge of the catalytic triad structure in 
trypsin, suggest a structure for the “uncatalytic triad” of Asn-His- 
Ser in this mutant enzyme. 
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5. 


6. 


10. 


11: 


The kea for alkaline phosphatase-catalyzed hydrolysis of methyl- 
phosphate is approximately 14/sec at pH 8 and 25°C. The rate con- 
stant for the uncatalyzed hydrolysis of methylphosphate under the 
same conditions is approximately 107!°/sec. What is the difference 
in the free energies of activation of these two reactions? 

Active a-chymotrypsin is produced from chymotrypsinogen, an 
inactive precursor, as shown in the color figure on the previous 
page. The first intermediate—a-chymotrypsin—displays chy- 
motrypsin activity. Suggest proteolytic enzymes that might carry 
out these cleavage reactions effectively. 


- Consult a classic paper by William Lipscomb (1982. Accounts of 


Chemical Research 15:232-—238), and on the basis of this article write 
a simple mechanism for the enzyme carboxypeptidase A. 


. The relationships between the free energy terms defined in the so- 


lution to Problem 4 above are shown in the following figure: 





Reaction coordinate 


If the energy of the ES complex is 10 kJ/mol lower than the energy 
of E + S, the value of AG, + is 20 kJ/mol, and the value of AG, is 90 
kJ/mol. What is the rate enhancement achieved by an enzyme in 
this case? 


. As noted on page 423, a true transition state can bind to an enzyme 


active site with a Kr as low as 7 X 10°26 M. This is a remarkable 
number, with interesting consequences. Consider a hypothetical so- 
lution of an enzyme in equilibrium with a ligand that binds with a 
Kp of 107-2” M. If the concentration of free enzyme, [E], is equal to 
the concentration of the enzyme-—ligand complex, [EL], what 
would [L], the concentration of free ligand, be? Calculate the vol- 
ume of solution that would hold one molecule of free ligand at this 
concentration. 


Another consequence of tight binding (problem 9) is the free en- 
ergy change for the binding process. Calculate AG°’ for an equilib- 
rium with a Kp of 107? M. Compare this value to the free energies 
of the noncovalent and covalent bonds with which you are familiar. 
What are the implications of this number, in terms of the nature of 
the binding of a transition state to an enzyme active site? 

The incredible catalytic power of enzymes can perhaps best be ap- 
preciated by imagining how challenging life would be without just 
one of the thousands of enzymes in the human body. For example, 
consider life without fructose-1,6-bisphosphatase, an enzyme in the 
gluconeogenesis pathway in liver and kidneys (see Chapter 22), 
which helps produce new glucose from the food we eat: 


Fructose-1,6-bisphosphate + HO — Fructose-6-P + P; 


The human brain requires glucose as its only energy source, and the 
typical brain consumes about 120 g (or 480 calories) of glucose daily. 
Ordinarily, two pieces of sausage pizza could provide more than 
enough potential glucose to feed the brain for a day. According to a na- 
tional fast-food chain, two pieces of sausage pizza provide 1340 calories, 
48% of which is from fat. Fats cannot be converted to glucose in glu- 
coneogenesis, so that leaves 697 calories potentially available for glu- 
cose synthesis. The first-order rate constant for the hydrolysis of fruc- 


tose-1,6-bisphosphate in the absence of enzyme is 2 X 107*°/sec. Cal- 
culate how long it would take to provide enough glucose for one day 
of brain activity from two pieces of sausage pizza without the enzyme. 


Preparing for the MCAT Exam 


The following graphs show the temperature and pH dependencies of four 
enzymes, A, B, X, and Y. Problems 12 through 18 refer to these graphs. 


12. 
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16. 
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Enzymes X and Y in the figure are both protein-digesting enzymes 
found in humans. Where would they most likely be at work? 

a. X is found in the mouth, Yin the small intestine. 

b. X in the small intestine, Yin the mouth. 

c. X in the stomach, Yin the small intestine. 

d. X in the small intestine, Yin the stomach. 


Which statement is true concerning enzymes X and Y? 

a. They could not possibly be at work in the same part of the body at 
the same time. 

b. They have different temperature ranges at which they work best. 

c. Ata pH of 4.5, enzyme X works slower than enzyme Y. 

d. At their appropriate pH ranges, both enzymes work equally fast. 


. What conclusion may be drawn concerning enzymes A and B? 


a. Neither enzyme is likely to be a human enzyme. 
b. Enzyme A is more likely to be a human enzyme. 
c. Enzyme B is more likely to be a human enzyme. 
d. Both enzymes are likely to be human enzymes. 


At which temperatures might enzymes A and B both work? 
a. Above 40°C 

b. Below 50°C 

c. Above 50°C and below 40°C 

d. Between 40° and 50°C 


An enzyme-substrate complex can form when the substrate (s) 
bind(s) to the active site of the enzyme. Which environmental con- 
dition might alter the conformation of an enzyme to the extent that 
its substrate is unable to bind? 

a. Enzyme A at 40°C 

b. Enzyme B at pH 2 

c. Enzyme X at pH 4 

d. Enzyme Y at 37°C 


17. At 35°C, the rate of the reaction catalyzed by enzyme A begins to 
level off. Which hypothesis best explains this observation? 
a. The temperature is too far below optimum. 
b. The enzyme has become saturated with substrate. 
c. Both A and B. 
d. Neither A nor B. 
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18. In which of the following environmental conditions would digestive 
enzyme Y be unable to bring its substrate(s) to the transition state? 
a. At any temperature below optimum 
b. At any pH where the rate of reaction is not maximum 
c. At any pH lower than 5.5 
d. At any temperature higher than 37°C 
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ESSENTIAL QUESTIONS 


Enzymes catalyze essentially all of the thousands of metabolic reactions taking place 
in cells. Many of these reactions are at cross-purposes: Some enzymes catalyze the 
breakdown of substances, whereas others catalyze synthesis of the same substances; 
many metabolic intermediates have more than one fate; and energy is released in 
some reactions and consumed in others. At key positions within the metabolic path- 
ways, regulatory enzymes sense the momentary needs of the cell and adjust their 
catalytic activity accordingly. Regulation of these enzymes ensures the harmonious 
integration of the diverse and often divergent reactions of metabolism. 


What are the properties of regulatory enzymes? How do regulatory enzymes 
sense the momentary needs of cells? What molecular mechanisms are used to 
regulate enzyme activity? 


Sala What Factors Influence Enzymatic Activity? 


The activity displayed by enzymes is affected by a variety of factors, some of which are 
essential to the harmony of metabolism. Two of the more obvious ways to regulate the 
amount of activity at a given time are (1) to increase or decrease the number of en- 
zyme molecules and (2) to increase or decrease the activity of each enzyme molecule. 
Although these ways are obvious, the cellular mechanisms that underlie them are 
complex and varied, as we shall see. A general overview of factors influencing enzyme 
activity includes the following considerations. 


The Availability of Substrates and Cofactors Usually Determines How 
Fast the Reaction Goes 


The availability of substrates and cofactors typically determines the enzymatic reac- 
tion rate. In general, enzymes have evolved such that their K„ values approximate 
the prevailing in vivo concentration of their substrates. (It is also true that the con- 
centration of some enzymes in cells is within an order of magnitude or so of the 
concentrations of their substrates.) 


As Product Accumulates, the Apparent Rate of the Enzymatic Reaction 
Will Decrease 


The enzymatic rate, v = d[P]/dt, “slows down” as product accumulates and equilib- 
rium is approached. The apparent decrease in rate is due to the conversion of P to 
S by the reverse reaction as [P] rises. Once [P]/[S] = Keq no further reaction is ap- 
parent. Keq defines thermodynamic equilibrium. Enzymes have no influence on the 
thermodynamics of a reaction. Also, product inhibition can be a kinetically valid 


phenomenon: Some enzymes are actually inhibited by the products of their action. 


Genetic Regulation of Enzyme Synthesis and Decay Determines 
the Amount of Enzyme Present at Any Moment 


The amounts of enzyme synthesized by a cell are determined by transcription reg- 
ulation (see Chapter 29). If the gene encoding a particular enzyme protein is 
turned on or off, changes in the amount of enzyme activity soon follow. Induction, 
which is the activation of enzyme synthesis, and repression, which is the shutdown 
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of enzyme synthesis, are important mechanisms for the regulation of metabolism. 
By controlling the amount of an enzyme that is present at any moment, cells can ei- 
ther activate or terminate various metabolic routes. Genetic controls over enzyme 
levels have a response time ranging from minutes in rapidly dividing bacteria to 
hours (or longer) in higher eukaryotes. Once synthesized, the enzyme may also be 
degraded, either through normal turnover of the protein or through specific decay 
mechanisms that target the enzyme for destruction. These mechanisms are dis- 
cussed in detail in Chapter 31. 


Enzyme Activity Can Be Regulated Allosterically 


Enzymatic activity can also be activated or inhibited through noncovalent interaction 
of the enzyme with small molecules (metabolites) other than the substrate. This 
form of control is termed allosteric regulation, because the activator or inhibitor 
binds to the enzyme at a site other than (allo means “other”) the active site. Further- 
more, such allosteric regulators, or effector molecules, are often quite different ster- 
ically from the substrate. Because this form of regulation results simply from re- 
versible binding of regulatory ligands to the enzyme, the cellular response time can 
be virtually instantaneous. 


Enzyme Activity Can Be Regulated Through Covalent Modification 


Enzymes can be regulated by covalent modification, the reversible covalent attachment 
of a chemical group. Enzymes susceptible to such regulation are called interconvert- 
ible enzymes, because they can be reversibly converted between two forms. Thus, a 
fully active enzyme can be converted into an inactive form simply by the covalent at- 
tachment of a functional group. For example, protein kinases are enzymes that act in 
covalent modification by attaching a phosphoryl moiety to target proteins (Figure 
15.1). Protein kinases catalyze the ATP-dependent phosphorylation of —OH groups 
on Ser, Thr, or Tyr side chains. Removal of the phosphate group by a phosphoprotein 
phosphatase returns the enzyme to its original state. In contrast to the example in the 
figure, some enzymes exist in an inactive state unless specifically converted into the 
active form through covalent addition of a functional group. Covalent modification 
reactions are catalyzed by special converter enzymes, which are themselves subject to 
metabolic regulation. (Protein kinases are one class of converter enzymes.) Although 
covalent modification represents a stable alteration of the enzyme, a different con- 
verter enzyme operates to remove the modification, so when the conditions that fa- 
vored modification of the enzyme are no longer present, the process can be reversed, 
restoring the enzyme to its unmodified state. Because covalent modification events are 
catalyzed by enzymes, they occur very quickly, with response times of seconds or even 
less for significant changes in metabolic activity. 


Regulation of Enzyme Activity Also Can Be Accomplished 
in Other Ways 


Enzyme regulation is an important matter to cells, and evolution has provided a vari- 
ety of additional options, including zymogens, isozymes, and modulator proteins. We 
will discuss these options first and then return to the major topics of this chapter— 
enzyme regulation through allosteric mechanisms and covalent modification. 
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FIGURE 15.1 Enzyme regulation by reversible covalent 
modification. 
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FIGURE 15.2 Proinsulin is an 86-residue precursor to in- 
sulin (the sequence shown here is human proinsulin). 
Proteolytic removal of residues 31 to 65 yields insulin. 
Residues 1 through 30 (the B chain) remain linked to 
residues 66 through 87 (the A chain) by a pair of inter- 
chain disulfide bridges. 


CENGAGENOW” ANIMATED FIGURE 15.3 The proteo- 
lytic activation of chymotrypsinogen. See this figure 
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Zymogens Are Inactive Precursors of Enzymes 


Most proteins become fully active as their synthesis is completed and they sponta- 
neously fold into their native, three-dimensional conformations. Some proteins, how- 
ever, are synthesized as inactive precursors, called zymogens or proenzymes, that ac- 
quire full activity only upon specific proteolytic cleavage of one or several of their 
peptide bonds. Unlike allosteric regulation or covalent modification, zymogen activa- 
tion by specific proteolysis is an irreversible process. Activation of enzymes and other 
physiologically important proteins by specific proteolysis is a strategy frequently ex- 
ploited by biological systems to switch on processes at the appropriate time and place, 
as the following examples illustrate. 


Insulin Some protein hormones are synthesized in the form of inactive precursor 
molecules, from which the active hormone is derived by proteolysis. For instance, 
insulin, an important metabolic regulator, is generated by proteolytic excision of a 
specific peptide from proinsulin (Figure 15.2). 


Proteolytic Enzymes of the Digestive Tract Enzymes of the digestive tract that 
serve to hydrolyze dietary proteins are synthesized in the stomach and pancreas as 
zymogens (Table 15.1). Only upon proteolytic activation are these enzymes able to 
form a catalytically active substrate-binding site. The activation of chymotrypsino- 
gen is an interesting example (Figure 15.3). Chymotrypsinogen is a 245-residue 
polypeptide chain crosslinked by five disulfide bonds. Chymotrypsinogen is con- 
verted to an enzymatically active form called 7-chymotrypsin when trypsin cleaves 
the peptide bond joining Arg!’ and Ile!®. The enzymatically active 7-chymotrypsin 
acts upon other 7-chymotrypsin molecules, excising two dipeptides: Ser!*—Arg!” 
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Origin Zymogen Active Protease 
Pancreas Trypsinogen Trypsin 

Pancreas Chymotrypsinogen Chymotrypsin 
Pancreas Procarboxypeptidase Carboxypeptidase 
Pancreas Proelastase Elastase 
Stomach Pepsinogen Pepsin 
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and Thr!#—Asn!48, The end product of this processing pathway is the mature pro- 
tease a-chymotrypsin, in which the three peptide chains, A (residues 1 through 
13), B (residues 16 through 146), and C (residues 149 through 245), remain 
together because they are linked by two disulfide bonds, one from A to B and one 
from B to C. 


Blood Clotting The formation of blood clots is the result of a series of zymogen 
activations (Figure 15.4). The amplification achieved by this cascade of enzymatic 
activations allows blood clotting to occur rapidly in response to injury. Seven of the 
clotting factors in their active form are serine proteases: kallikrein, XII,, XI,, IXa, 
VIL, Xa, and thrombin. Two routes to blood clot formation exist. The intrinsic 
pathway is instigated when the blood comes into physical contact with abnormal 
surfaces caused by injury; the extrinsic pathway is initiated by factors released from 
injured tissues. The pathways merge at factor X and culminate in clot formation. 
Thrombin excises peptides rich in negative charge from fibrinogen, converting it to 
fibrin, a molecule with a different surface charge distribution. Fibrin readily aggre- 
gates into ordered fibrous arrays that are subsequently stabilized by covalent 
crosslinks. Thrombin specifically cleaves Arg—Gly peptide bonds and is homologous 
to trypsin, which is also a serine protease (recall that trypsin acts only at Arg and Lys 
residues). 


Isozymes Are Enzymes with Slightly Different Subunits 


A number of enzymes exist in more than one quaternary form, differing in their 
relative proportions of structurally equivalent but catalytically distinct polypeptide 
subunits. A classic example is mammalian lactate dehydrogenase (LDH), which 
exists as five different isozymes, depending on the tetrameric association of two 
different subunits, A and B: Ay, AB, AoBo, ABs, and By (Figure 15.5). The kinetic 
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FIGURE 15.4 The cascade of activation steps leading to 
blood clotting. The intrinsic and extrinsic pathways con- 
verge at factor X, and the final common pathway involves 
the activation of thrombin and its conversion of fibrino- 
gen into fibrin, which aggregates into ordered filamen- 
tous arrays that become crosslinked to form the clot. 
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CENGAGENOW”™ ACTIVE FIGURE 15.5 The isozymes of 
actate dehydrogenase (LDH). Active muscle tissue 
becomes anaerobic and produces pyruvate from glucose 
via glycolysis (see Chapter 18). It needs LDH to regenerate 
AD* from NADH so that glycolysis can continue. The 
actate produced is released into the blood. The muscle 
LDH isozyme (A4) works best in the NAD*-regenerating 
direction. Heart tissue is aerobic and uses lactate as a fuel, 
converting it to pyruvate via LDH and using the pyruvate 
o fuel the citric acid cycle to obtain energy. The heart 
LDH isozyme (B,) is inhibited by excess pyruvate so that 
he fuel won't be wasted. Test yourself on the con- 
cepts in this figure at www.cengage.com/login. 
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FIGURE 15.6 Sigmoid v versus [S] plot. The dotted line 
represents the hyperbolic plot characteristic of normal 
Michaelis-Menten-type enzyme kinetics. 
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properties of the various LDH isozymes differ in terms of their relative affinities 
for the various substrates and their sensitivity to inhibition by product. Different 
tissues express different isozyme forms, as appropriate to their particular meta- 
bolic needs. By regulating the relative amounts of A and B subunits they synthe- 
size, the cells of various tissues control which isozymic forms are likely to assemble 
and thus which kinetic parameters prevail. 


152 What Are the General Features 
of Allosteric Regulation? 


Allosteric regulation acts to modulate enzymes situated at key steps in metabolic 
pathways. Consider as an illustration the following pathway, where A is the precur- 
sor for formation of an end product, F, in a sequence of five enzyme-catalyzed 
reactions: 


enz 1 enz 2 enz 3 enz 4 enz 5 


A— > B —> C—> D—> E—> F 





In this scheme, F symbolizes an essential metabolite, such as an amino acid or a nu- 
cleotide. In such systems, F, the essential end product, inhibits enzyme 1, the first step 
in the pathway. Therefore, when sufficient F is synthesized, it blocks further synthe- 
sis of itself. This phenomenon is called feedback inhibition or feedback regulation. 


Regulatory Enzymes Have Certain Exceptional Properties 


Enzymes such as enzyme 1, which are subject to feedback regulation, represent a 
distinct class of enzymes, the regulatory enzymes. As a class, these enzymes have cer- 
tain exceptional properties: 


1. Their kinetics do not obey the Michaelis-Menten equation. Their v versus [S] plots 
yield sigmoid- or S-shaped curves rather than rectangular hyperbolas (Figure 
15.6). Such curves suggest a second-order (or higher) relationship between v and 
[S]; that is, vis proportional to [S]”, where n > 1. A qualitative description of the 
mechanism responsible for the S-shaped curves is that binding of one S to a pro- 
tein molecule makes it easier for additional substrate molecules to bind to the 
same protein molecule. In the jargon of allostery, substrate binding is cooperative. 

2. Inhibition of a regulatory enzyme by a feedback inhibitor does not conform to 
any normal inhibition pattern, and the feedback inhibitor F bears little structural 
similarity to A, the substrate for the regulatory enzyme. F apparently acts at 
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a binding site distinct from the substrate-binding site. The term allosteric is apt, 
because F is sterically dissimilar and, moreover, acts at a site other than the site 
for S. Its effect is called allosteric inhibition. 

3. Regulatory or allosteric enzymes like enzyme 1 are, in some instances, regulated 
by activation. That is, whereas some effector molecules such as F exert negative 
effects on enzyme activity, other effectors show stimulatory, or positive, influ- 
ences on activity. 

4, Allosteric enzymes typically have an oligomeric organization. They are com- 
posed of more than one polypeptide chain (subunit), and each subunit has a 
binding site for substrate, as well as a distinct binding site for allosteric effec- 
tors. Thus, allosteric enzymes typically have more than one S-binding site and 
more than one effector-binding site per enzyme molecule. 

5. The working hypothesis is that, by some means, interaction of an allosteric en- 
zyme with effectors alters the distribution of conformational possibilities or sub- 
unit interactions available to the enzyme. That is, the regulatory effects exerted 
on the enzyme’s activity are achieved by conformational changes occurring in 
the protein when effector metabolites bind. 


In addition to enzymes, noncatalytic proteins may exhibit many of these prop- 
erties; hemoglobin is the classic example. The allosteric properties of hemoglo- 
bin are the subject of a Special Focus at the end of this chapter. 


15:38 Can Allosteric Regulation Be Explained 
by Conformational Changes in Proteins? 


The Symmetry Model for Allosteric Regulation Is Based 
on Two Conformational States for a Protein 


Various models have been proposed to account for the behavior of allosteric pro- 
teins. All of them note that proteins can exist in different conformational states. 
Models usually propose a small number of conformations (two or, at most, three) 
for a given protein. For example, the model for allosteric behavior of Jacques 
Monod, Jeffries Wyman, and Jean-Pierre Changeux (the MWC model) proposes two 
conformational states for an allosteric protein: the R (relaxed) state and the T 
(taut) state. The MWC model is sometimes referred to as the symmetry model be- 
cause all subunits in an oligomer are assumed to have the same conformation, 
whether it is R or T. R-state and T-state protein molecules are in equilibrium, with 
the T conformation greatly favored over the R ([T] > [R]), under conditions in 
which no ligands are present. This model further suggests that substrate and al- 
losteric activators (positive effectors) bind only to the R state and allosteric in- 
hibitors (negative effectors) bind only to the T state. Figure 15.7 illustrates such a 
model for a dimeric protein, each monomer of which has a substrate-binding site 
and an effector-binding site. Because substrate (S) binds only to the R state, S bind- 
ing perturbs the R <= T equilibrium in favor of more R-state conformers and thus 
more S binding. That is, S binding is cooperative. The concentration of ligand giv- 
ing half-maximal response is defined as Ko5. (Like Km, the units of Ko, are molar- 
ity; Km cannot be used to describe these constants, because the protein does not 
conform to the Michaelis-Menten model for enzyme kinetics.) The MWC model 
accounts for the action of allosteric effectors. Positive effectors bind only to the 
R state and thus cause a shift of the R <= T equilibrium in favor of more R and 
thus easier S binding. Negative effectors do the opposite; they perturb the R ==> T 
equilibrium in favor of T, the conformation that cannot bind S. Note that positive 
effectors (allosteric activators) cause a decline in the Ky; for S (signifying easier 
binding of S) and negative effectors raise Kos for S (Figure 15.7). Note that the 
MWC model assumes an equilibrium between conformational states, but ligand 
binding does not alter the conformation of the protein. 
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CENGAGENOW” ACTIVE FIGURE 15.7 Allosteric effects: A and | binding to R and T, respectively. The linked 
equilibria lead to changes in the relative amounts of R and T and, therefore, shifts in the substrate saturation 
curve. The parameters of such a system are that (1) S and A (or |) have different affinities for R and T and (2) A 
(or |) modifies the apparent Kas for S by shifting the relative R versus T population. Test yourself on the con- 
cepts in this figure at www.cengage.com/login. 


The Sequential Model for Allosteric Regulation Is Based 
on Ligand-Induced Conformational Changes 


An alternative model proposed by Daniel Koshland, George Nemethy, and David 
Filmer (the KNF model) relies on the well-accepted idea that ligand binding trig- 
gers a change in the conformation of a protein. And, if the protein is oligomeric, 
ligand-induced conformational changes in one subunit may lead to changes in the 
conformation of its neighbors. Such ligand-induced conformational change could 
cause the subunits of an oligomeric protein to shift from a low-affinity state to a 
high-affinity state. For example, S binding to one monomer may cause the other 
monomers to adopt conformations with higher affinity for S (Figure 15.8). Inter- 
estingly, the KNF model also explains how ligand-induced conformational changes 
could cause subunits of a protein to adopt conformations with little or no affinity 
for the ligand, a phenomenon referred to as negative cooperativity. The KNF model 
is termed the sequential model because subunits undergo sequential changes in 
conformation due to ligand binding. A comparison of the response of velocity 
to substrate concentration for positive versus negative cooperativity is shown in 
Figure 15.8c. 


Changes in the Oligomeric State of a Protein Can Also Give 
Allosteric Behavior 


Although the MWC and KNF models are the best-known paradigms for allosteric 
protein behavior, other models have been put forward. For example, instead of R 
and T, consider a monomer-oligomer equilibrium for an allosteric protein, where 
only the oligomer binds S and [monomer] >> [ oligomer]. This model strongly 
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FIGURE 15.8 The Koshland-Nemethy-Filmer sequential model for allosteric behavior. (a) S binding can, by 
induced fit, cause a conformational change in the subunit to which it binds. (b) If subunit interactions are 
tightly coupled, binding of S to one subunit may cause the other subunit to assume a conformation having 
a greater or lesser affinity for S. That is, the ligand-induced conformational change in one subunit can affect 
the adjoining subunit. Such effects could be transmitted between neighboring peptide domains by chang- 
ing alignments of nonbonded amino acid residues. (c) Theoretical curves for the binding of a ligand to a pro- 
tein having four identical subunits, each with one binding site for the ligand. The fraction of maximal binding is 
plotted as a function of [S]/Kos. 


resembles the MWC model. In yet another model, we might have a monomeric pro- 
tein with distinct binding sites for several different ligands. In this case, binding of 
ligand A to its site might cause a conformational change such that the protein shows 
much greater affinity for S than it would in the absence of A. Or, binding of ligand 
I might result in a conformational change in the protein such that its affinity for S 
is abolished. Although the binding of other ligands may affect the affinity of the 
monomer for S, S binding cannot show cooperativity in monomeric proteins, be- 
cause, unlike oligomers, the monomer has only one binding site for S. 

It is important to realize that all of these various models are attempts to use sim- 
ple concepts to explain the complex behavior of a protein. Although these models 
provide reasonable approximations and useful insights, the molecular mechanisms 
underlying allostery cannot be expected to conform rigidly to any one of these mod- 
els. Shortly, we explore the regulated behavior of a real protein (glycogen phos- 
phorylase) with these models in mind. 


15e45 What Kinds of Covalent Modification Regulate 
the Activity of Enzymes? 


Covalent Modification Through Reversible Phosphorylation 


As we saw in Figure 15.1, enzyme activity can be regulated through reversible phos- 
phorylation; indeed it is the most prominent form of covalent modification in cel- 
lular regulation. Phosphorylation is accomplished by protein kinases that target spe- 
cific enzymes for modification. Phosphoprotein phosphatases operate in the 
reverse direction to remove the phosphate group through hydrolysis of the side- 
chain phosphoester bond. Because protein kinases and phosphoprotein phos- 
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TABLE 15.2 0 








Protein Kinase Class Target Sequence* Activators 
I. Ser/Thr protein kinases 

A. Cyclic nucleotide-dependent 
cAMP-dependent (PKA) —R(R/K) X(S*/T*)— cAMP 
cGMP-dependent —(R/K) KKX (S*/T*) — cGMP 

B. Ca?*-calmodulin (CaM)-—dependent 
Phosphorylase kinase (PhK) —KRKQIS* VRGL— Phosphorylation by PKA 
Myosin light-chain kinase (MLCK) —KKRPORATS*NV— Ca?*-CaM 

C. Protein kinase C (PKC) Ca?", diacylglycerol 

D. Mitogen-activated protein kinases —PXX(S*/T*)P— Phosphorylation 
(MAP kinases) by MAPK kinase 

E. G-protein-coupled receptors 
f-Adrenergic receptor kinase (BARK) 
Rhodopsin kinase 

II. Ser/Thr/Tyr protein kinases 
MAP kinase kinase (MAPK kinase) —TEY— Phosphorylation by 
Raf (a protein kinase) 
HI. Tyr protein kinases 

A. Cytosolic tyrosine kinases (src, fgr, abl, etc.) 

B. Receptor tyrosine kinases (RTKs) 
Plasma membrane receptors for hormones 
such as epidermal growth factor (EGF) or 
platelet-derived growth factor (PDGF) 








*X denotes any amino acid. 
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FIGURE 15.9 Protein kinase A is shown complexed with 
a pseudosubstrate peptide (orange). This complex also 
includes ATP (red) and two Mn?* ions (yellow) bound at 
the active site (pdb id = 1ATP). 


phatases work in opposing directions, regulation must be imposed on these con- 
verter enzymes so that their interconvertible enzyme targets are locked in the de- 
sired state (active versus inactive) and a wasteful cycle of ATP hydrolysis is avoided. 
Thus, converter enzymes are themselves the targets of allosteric regulation or cova- 
lent modification. 


Protein Kinases: Target Recognition and Intrasteric Control 


Protein kinases are converter enzymes that catalyze the ATP-dependent phosphoryla- 
tion of serine, threonine, or tyrosine hydroxyl groups in target proteins (Table 15.2). 
Phosphorylation introduces a bulky group bearing two negative charges, causing con- 
formational changes that alter the target protein’s function. (Unlike a phosphoryl 
group, no amino acid side chain can provide two negative charges.) Protein kinases 
represent a protein superfamily whose members are widely diverse in terms of size, 
subunit structure, and subcellular localization. Nevertheless, all share a common cat- 
alytic mechanism based on a conserved catalytic core/kinase domain of approximately 
260 amino acid residues (Figure 15.9). Protein kinases are classified as Ser/Thr 
and/or Tyr specific. They also differ in terms of the target proteins that they recognize 
and phosphorylate; target selection depends on the presence of an amino acid se- 
quence within the target protein that is recognized by the kinase. For example, cAMP- 
dependent protein kinase (PKA) phosphorylates proteins having Ser or Thr residues 
within an R(R/K) X(S*/T*) target consensus sequence (* denotes the residue that be- 
comes phosphorylated). That is, PKA phosphorylates Ser or Thr residues that occur in 
an Arg-(Arg or Lys)-(any amino acid)-(Ser or Thr) sequence segment (Table 15.2). 
Targeting of protein kinases to particular consensus sequence elements within pro- 
teins creates a means to regulate these kinases by intrasteric control. Intrasteric con- 
trol occurs when a regulatory subunit (or protein domain) has a pseudosubstrate 
sequence that mimics the target sequence but lacks an OH-bearing side chain at the 
right place. For example, the cAMP-binding regulatory subunits of PKA (R subunits 
in Figure 15.10) possess the pseudosubstrate sequence RRGA*I, and this sequence 
binds to the active site of PKA catalytic subunits, blocking their activity. This pseudo- 
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cAMP cAMP CENGAGENOW” ANIMATED FIGURE 15.10 Cyclic 
AMP-dependent protein kinase (also known as 


| 
PKA) is a 150- to 170-kD RG; tetramer in mam- 
+ cAMP < +2 malian cells. The two R (regulatory) subunits bind 
cAMP (Kp = 3 X 1078 M); cAMP binding releases 


the R subunits from the C (catalytic) subunits. 


cAMP cAMP C subunits are enzymatically active as monomers. 
R,C, See this figure animated at www.cengage.com/ 
inactive R,-(cAMP) , login. 


substrate sequence has an alanine residue where serine occurs in the PKA target se- 
quence; Ala is sterically similar to serine but lacks a phosphorylatable OH group. 
When these PKA regulatory subunits bind cAMP, they undergo a conformational 
change and dissociate from the catalytic (C) subunits, and the active site of PKA is free 
to bind and phosphorylate its targets. In other protein kinases, the pseudosubstrate 
sequence involved in intrasteric control and the kinase domain are part of the same 
polypeptide chain. In these cases, binding of an allosteric effector (like cAMP) in- 
duces a conformational change in the protein that releases the pseudosubstrate se- 
quence from the active site of the kinase domain. 

The abundance of many protein kinases in cells is an indication of the great impor- 
tance of protein phosphorylation in cellular regulation. Exactly 113 protein kinase 
genes have been recognized in yeast, and 868 putative protein kinase genes have been 
identified in the human genome. Tyrosine kinases (protein kinases that phosphorylate 
Tyr residues) occur only in multicellular organisms (yeast has no tyrosine kinases). 
Tyrosine kinases are components of signaling pathways involved in cell-cell communi- 
cation (see Chapter 32). 


Phosphorylation Is Not the Only Form of Covalent Modification 
That Regulates Protein Function 


Several hundred different chemical modifications of proteins have been dis- 
covered thus far, ranging from carboxylation (addition of a carboxyl group), 
acetylation (addition of an acetyl group, see Figure 29.30), prenylation (see 
Figure 9.23), and glycosylation (see Figures 7.32-7.39) to covalent attachment 
of a polypeptide to the protein (addition of ubiquitin to free amino groups on 
proteins; see Figure 31.8), to name just a few. A compilation of known protein 
modifications can be found in RESID, the European Bioinformatics Institute 
online database (http://www. ebi.ac.uk/RESID/). Only a small number of these co- 
valent modifications are used to achieve metabolic regulation through reversible 
conversion of an enzyme between active and inactive forms. Table 15.3 presents a few 
examples. 





















Reaction Amino Acid Side Chain Reaction (see figure indicated) 

Adenylylation Tyrosine Transfer of AMP from ATP to Tyr-OH 
(Figure 25.16) 

Uridylylation Tyrosine Transfer of UMP from UTP to TyrOH 
(Figure 25.17) 

ADP-ribosylation Arginine Transfer of ADP-ribose from NAD* to 
Arg (Figure 25.8) 

Methylation Glutamate Transfer of methyl group from 
S-adenosylmethionine to Glu 
y-carboxyl group 

Oxidation-reduction Cysteine (disulfide) Reduction of Cys-S—S-Cys to 
Cys-SH HS-Cys (Figure 21.27) 
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FIGURE 15.11 The glycogen phosphorylase reaction. 


FIGURE 15.12 The phosphoglucomutase reaction. 


Note that three of these types of covalent modification require nucleoside triphos- 
phates (ATP, UTP) that are related to cellular energy status; another relies on re- 
ducing potential within the cell, which also reflects cellular energy status. 


A5 Is the Activity of Some Enzymes Controlled by Both 
Allosteric Regulation and Covalent Modification? 


Glycogen phosphorylase, the enzyme that catalyzes the release of glucose units from 
glycogen, serves as an excellent example of the many enzymes regulated both by al- 
losteric controls and by covalent modification. 


The Glycogen Phosphorylase Reaction Converts Glycogen 
into Readily Usable Fuel in the Form of Glucose-1-Phosphate 


The cleavage of glucose units from the nonreducing ends of glycogen molecules is 
catalyzed by glycogen phosphorylase, an allosteric enzyme. The enzymatic reaction 
involves phosphorolysis of the bond between C-1 of the departing glucose unit and 
the glycosidic oxygen, to yield glucose-1-phosphate and a glycogen molecule that is 
shortened by one residue (Figure 15.11). (Because the reaction involves attack by 
phosphate instead of HO, it is referred to as a phosphorolysis rather than a hy- 
drolysis.) Phosphorolysis produces a phosphorylated sugar product, glucose-1-P, 
which is converted to the glycolytic substrate, glucose-6-P, by phosphoglucomutase 
(Figure 15.12). In muscle, glucose-6-P proceeds into glycolysis, providing needed 
energy for muscle contraction. In the liver, hydrolysis of glucose-6-P yields glucose, 
which is exported to other tissues via the circulatory system. 


Glycogen Phosphorylase Is a Homodimer 


Muscle glycogen phosphorylase is a dimer of two identical subunits (842 residues, 
97.44 kD). Each subunit contains an active site (at the center of the subunit) and an 
allosteric effector site near the subunit interface (Figure 15.13). In addition, a regula- 
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AMP at allosteric 
effector site 


Glycogen- 
binding site T 


Pyridoxal-P at 
catalytic site 


FIGURE 15.13 Structure of the glycogen phosphorylase monomer (pdb id = 8GPB). 


tory phosphorylation site is located at Ser!* on each subunit. A glycogen-binding site 
on each subunit facilitates prior association of glycogen phosphorylase with its sub- 
strate and also exerts regulatory control on the enzymatic reaction. 

Each subunit contributes a tower helix (residues 262 to 278) to the subunit- 
subunit contact interface in glycogen phosphorylase. In the phosphorylase dimer, 
the tower helices extend from their respective subunits and pack against each other 
in an antiparallel manner. 


Glycogen Phosphorylase Activity Is Regulated Allosterically 


Muscle Glycogen Phosphorylase Shows Cooperativity in Substrate Binding The 
binding of the substrate inorganic phosphate (P;) to muscle glycogen phosphorylase is 
highly cooperative (Figure 15.14a), which allows the enzyme activity to increase 
markedly over a rather narrow range of substrate concentration. 


(a) (b) (c) 


+ AMP 


+ ATP or glucose-6-P 





[Pi] [Pi] [Pi] 


FIGURE 15.14 v versus S curves for glycogen phosphorylase. (a) The response to the concentration of the sub- 
strate phosphate (P;). (b) ATP and glucose-6-P are feedback inhibitors. (c) AMP is a positive effector. It binds at 
the same site as ATP. 
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CENGAGENOW” ACTIVE FIGURE 15.15 The mechanism 
of covalent modification and allosteric regulation of 
glycogen phosphorylase. Test yourself on the con- 
cepts in this figure at www.cengage.com/login. 


ATP and Glucose-6-P Are Allosteric Inhibitors of Glycogen Phosphorylase ATP 
can be viewed as the “end product” of glycogen phosphorylase action, in that the 
glucose-1-P liberated by glycogen phosphorylase is degraded in muscle via metabolic 
pathways whose purpose is energy (ATP) production. Glucose-1-P is readily converted 
into glucose-6-P to feed such pathways. (In the liver, glucose-1-P from glycogen is con- 
verted to glucose and released into the bloodstream to raise blood glucose levels.) 
Thus, feedback inhibition of glycogen phosphorylase by ATP and glucose-6-P pro- 
vides a very effective way to regulate glycogen breakdown. Both ATP and glucose-6-P 
act by decreasing the affinity of glycogen phosphorylase for its substrate P; (Figure 
15.14b). Because the binding of ATP or glucose-6-P has a negative effect on substrate 
binding, these substances act as negative effectors. Note in Figure 15.14b that the sub- 
strate saturation curve is displaced to the right in the presence of ATP or glucose-6-P, 
and a higher substrate concentration is needed to achieve half-maximal velocity 
(Vinax/2). When concentrations of ATP or glucose-6-P accumulate to high levels, 
glycogen phosphorylase is inhibited; when [ATP] and [glucose-6-P] are low, the ac- 
tivity of glycogen phosphorylase is regulated by availability of its substrate, P;. 


AMP Is an Allosteric Activator of Glycogen Phosphorylase AMP also provides a reg- 
ulatory signal to glycogen phosphorylase. It binds to the same site as ATP, but it stim- 
ulates glycogen phosphorylase rather than inhibiting it (Figure 15.14c). AMP acts as a 
positive effector, meaning that it enhances the binding of substrate to glycogen phos- 
phorylase. Significant levels of AMP indicate that the energy status of the cell is low and 
that more energy (ATP) should be produced. Reciprocal changes in the cellular con- 
centrations of ATP and AMP and their competition for binding to the same site (the 
allosteric site) on glycogen phosphorylase, with opposite effects, allow these two nucleo- 
tides to exert rapid and reversible control over glycogen phosphorylase activity. Such reci- 
procal regulation ensures that the production of energy (ATP) is commensurate with 
cellular needs. 

To summarize, muscle glycogen phosphorylase is allosterically activated by AMP 
and inhibited by ATP and glucose-6-P; caffeine can also act as an allosteric inhibitor 
(Figure 15.15). When ATP and glucose-6-P are abundant, glycogen breakdown is 
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inhibited. When cellular energy reserves are low (i.e., high [AMP] and low [ATP] 
and [G-6-P]), glycogen catabolism is stimulated. 

Glycogen phosphorylase conforms to the MWC model of allosteric transitions, 
with the active form of the enzyme designated the R state and the inactive form de- 
noted as the T state (Figure 15.15). Thus, AMP promotes the conversion to the ac- 
tive R state, whereas ATP, glucose-6-P, and caffeine favor conversion to the inactive 
T state. 

X-ray diffraction studies of glycogen phosphorylase in the presence of allosteric ef- 
fectors have revealed the molecular basis for the T=—=R conversion. Although the 
structure of the central core of the phosphorylase subunits is identical in the T and R 
states, a significant change occurs at the subunit interface between the T and R states. 
This conformation change at the subunit interface is linked to a structural change at 
the active site that is important for catalysis. In the T state, the negatively charged car- 
boxyl group of Asp**° faces the active site, so binding of the anionic substrate phos- 
phate is unfavorable. In the conversion to the R state, Asp**? is displaced from the ac- 
tive site and replaced by Argř®. The exchange of negatively charged aspartate for 
positively charged arginine at the active site provides a favorable binding site for phos- 
phate anion. These allosteric controls serve as a mechanism for adjusting the activity of 
glycogen phosphorylase to meet normal metabolic demands. However, in crisis situa- 
tions in which abundant energy (ATP) is needed immediately, these controls can be 
overridden by covalent modification of glycogen phosphorylase. Covalent modification 
through phosphorylation of Ser! in glycogen phosphorylase converts the enzyme from 
a less active, allosterically regulated form (the )form) to a more active, allosterically un- 
responsive form (the a form). 
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FIGURE 15.16 The major conformational change that 
occurs in the N-terminal residues upon phosphorylation 
of Ser!4. Ser'4 is shown in red. N-terminal conformation 


of unphosphorylated enzyme (phosphorylase b): yellow; 
N-terminal conformation of phosphorylated enzyme 
(phosphorylase a): cyan. (Molecular graphic created 
from pdb id = 8GPB and pdb id = 1GPA.) 
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Covalent Modification of Glycogen Phosphorylase Trumps 
Allosteric Regulation 


As early as 1938, it was known that glycogen phosphorylase existed in two forms: the 
less active phosphorylase b and the more active phosphorylase a. In 1956, Edwin 
Krebs and Edmond Fischer reported that a “converting enzyme” could convert 
phosphorylase b to phosphorylase a. Three years later, Krebs and Fischer demon- 
\ strated that the conversion of phosphorylase bto phosphorylase a involved covalent 
phosphorylation, as shown in Figure 15.15. 

Phosphorylation of Ser!* causes a dramatic conformation change in phosphorylase. 
Upon phosphorylation, the amino-terminal end of the protein (including residues 10 
through 22) swings through an arc of 120°, moving into the subunit interface (Figure 
15.16). This conformation change moves Ser! by more than 3.6 nm. The phosphory- 
lated or aform of glycogen phosphorylase is much less sensitive to allosteric regulation 
than the ) form. Thus, covalent modification of glycogen phosphorylase converts this 
enzyme from an allosterically regulated form into a persistently active form. Covalent 
modification overrides the allosteric regulation. 

Dephosphorylation of glycogen phosphorylase is carried out by phosphoprotein 
phosphatase 1. The action of phosphoprotein phosphatase 1 inactivates glycogen 
phosphorylase. The 1992 Nobel Prize in Physiology or Medicine was awarded to Krebs 
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Enzyme Cascades Regulate Glycogen Phosphorylase Covalent 


Modification 


The phosphorylation reaction that activates glycogen phosphorylase is mediated by 
an enzyme cascade (Figure 15.17). The first part of the cascade leads to hormonal 
stimulation (described in the next section) of adenylyl cyclase, a membrane-bound 
enzyme that converts ATP to adenosine-3’ ,5’-cyclic monophosphate, denoted as cyclic 
AMP or simply cAMP (Figure 15.18). This regulatory molecule is found in all eu- 
karyotic cells and acts as an intracellular messenger molecule, controlling a wide va- 
riety of processes. Cyclic AMP is known as a second messenger because it is the in- 
tracellular agent of a hormone (the “first messenger”). (The myriad cellular roles 
Slow GTPase activity of G, of cyclic AMP are described in detail in Chapter 32.) 

i hydrolyzes GTP to GDP The hormonal stimulation of adenylyl cyclase is effected by a transmembrane sig- 
naling pathway consisting of three components, all membrane associated. Binding 
of hormone to the external surface of a hormone receptor causes a conformational 
change in this transmembrane protein, which in turn stimulates a GTP-binding 
protein (abbreviated G protein). G proteins are heterotrimeric proteins consisting of 
a- (45-47 kD), B- (35 kD), and y- (7-9 kD) subunits. The a-subunit binds GDP or 
GTP and has an intrinsic, slow GTPase activity. In the inactive state, the Gag complex 
has GDP at the nucleotide site. When a G protein is stimulated by a hormone- 
receptor complex, GDP dissociates and GTP binds to G,, causing it to dissociate 
from Gg, and to associate with adenylyl cyclase (Figure 15.19). Binding of Ga (GTP) 
activates adenylyl cyclase to form cAMP from ATP. However, the intrinsic GTPase activity 
GAEDE) dissociates from of G, eventually hydrolyzes GTP to GDP, leading to dissociation of G, (GDP) from 
adenylyl cyclase and adenylyl cyclase and reassociation with Gg, to form the inactive Gag, complex. This 
returns to Ggy cascade amplifies the hormonal signal because a single hormone-receptor complex 
can activate many G proteins before the hormone dissociates from the receptor, and 
because the G,-activated adenylyl cyclase can synthesize many cAMP molecules be- 
fore bound GTP is hydrolyzed by G,. More than 100 different G-protein—coupled re- 
ceptors and at least 21 distinct G, proteins are known (see Chapter 32). 











Inactive FIGURE 15.19 Hormone binding to its receptor leads via G-protein activation to cAMP synthesis. Adenylyl 
adenylyl cyclase and the hormone receptor are integral plasma membrane proteins; Gy and Gg, are membrane- 
cyclase anchored proteins. 


G protein 


Cyclic AMP is an essential activator of cAMP-dependent protein kinase (PKA). Bind- 
ing of cyclic AMP to the regulatory subunits induces a conformation change that 
causes the dissociation of the C monomers from the R dimer (Figure 15.10). The 
free C subunits are active and can phosphorylate other proteins. One of the many 
proteins phosphorylated by PKA is phosphorylase kinase (Figure 15.17). Phosphory- 
lase kinase is inactive in the unphosphorylated state and active in the phosphory- 
lated form. As its name implies, phosphorylase kinase functions to phosphorylate 
(and activate) glycogen phosphorylase. Thus, hormonal activation of adenylyl cy- 
clase leads to activation of glycogen breakdown. 


Special Focus 





Is There an Example in Nature That Exemplifies the Relationship 
Between Quaternary Structure and the Emergence of Allosteric 
Properties? Hemoglobin and Myoglobin—Paradigms of Protein 
Structure and Function 


Ancient life forms evolved in the absence of oxygen and were capable only of anaer- 
obic metabolism. As the earth’s atmosphere changed over time, so too did living 
things. Indeed, the production of O by photosynthesis was a major factor in alter- 
ing the atmosphere. Evolution to an oxygen-based metabolism was highly beneficial. 
Aerobic metabolism of sugars, for example, yields far more energy than correspond- 
ing anaerobic processes. Two important oxygen-binding proteins appeared in the 
course of evolution so that aerobic metabolic processes were no longer limited by the 
solubility of Os in water. These proteins are represented in animals as hemoglobin 
(Hb) in blood and myoglobin (Mb) in muscle. Because hemoglobin and myoglobin 
are two of the most-studied proteins in nature, they have become paradigms of pro- 
tein structure and function. Moreover, hemoglobin is a model for protein quater- 
nary structure and allosteric function. The binding of Og by hemoglobin, and its 
modulation by effectors such as protons, COs, and 2,3-bisphosphoglycerate, depend 
on interactions between subunits in the Hb tetramer. Subunit-subunit interactions 
in Hb reveal much about the functional significance of quaternary associations and 
allosteric regulation. 


The Comparative Biochemistry of Myoglobin and Hemoglobin 
Reveals Insights into Allostery 


A comparison of the properties of hemoglobin and myoglobin offers insights into 
allosteric phenomena, even though these proteins are not enzymes. Hemoglobin 
displays sigmoid-shaped Os-binding curves (Figure 15.20). The unusual shape of 
these curves was once a great enigma in biochemistry. Such curves closely resemble 
allosteric enzyme: substrate saturation graphs (see Figure 15.6). In contrast, myo- 
globin’s interaction with oxygen obeys classical Michaelis-Menten-type substrate 
saturation behavior. 

Before examining myoglobin and hemoglobin in detail, let us first encapsulate the 
lesson: Myoglobin is a compact globular protein composed of a single polypeptide 
chain 153 amino acids in length; its molecular mass is 17.2 kD (Figure 15.21). It con- 
tains heme, a porphyrin ring system complexing an iron ion, as its prosthetic group 
(Figure 15.22). Oxygen binds to Mb via its heme. Hemoglobin (Hb) is also a compact 
globular protein, but Hb is a tetramer. It consists of four polypeptide chains, each of 
which is very similar structurally to the myoglobin polypeptide chain, and each bears 
a heme group. Thus, a hemoglobin molecule can bind four Os molecules. In adult 
human Hb, there are two identical chains of 141 amino acids, the a-chains, and two 
identical B-chains, each of 146 residues. The human Hb molecule is an aBe-type 
tetramer of molecular mass 64.45 kD. 
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FIGURE 15.20 © -binding curves for hemoglobin and 
myoglobin. 





Hemoglobin (Hb) 


FIGURE 15.21 The myoglobin (pdb id = 2MM1) and 
hemoglobin (pdb id = 2HHB) molecules. 


FIGURE 15.22 Heme is formed when protoporphyrin IX 
binds Fe?*. 
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Partial pressure of oxygen (pO, torr) 


The myoglobin polypeptide chain and the a- and -chains of hemoglobin are com- 
posed of 8 a-helical segments denoted by the letters A through H. The short, un- 
ordered regions that connect the helices are named for the segments they connect, 
as in the AB region or the EF region. In an amino acid numbering system unique to 
globin chains, successive residues in the helices are numbered, such as the histidine 
at position 8 in the F helix, known as His F8. 

The tetrameric nature of Hb is crucial to its biological function: When a molecule 
of O2 binds to a heme in Hb, the heme Fe ion is drawn into the plane of the porphyrin ring. 
This slight movement sets off a chain of conformational events that are transmitted to adja- 
cent subunits, dramatically enhancing the affinity of their heme groups for Oo. That is, the 
binding of Os to one heme of Hb makes it easier for the Hb molecule to bind ad- 
ditional equivalents of O. Hemoglobin is a marvelously constructed molecular ma- 
chine. Let us dissect its mechanism, beginning with its monomeric counterpart, the 
myoglobin molecule. 


Myoglobin Is an Oxygen-Storage Protein 


Myoglobin is the oxygen-storage protein of muscle. The muscles of diving mammals 
such as seals and whales are especially rich in this protein, which serves as a store 
for Os during the animal’s prolonged periods underwater. Myoglobin is abundant 
in skeletal and cardiac muscle of nondiving animals as well. Myoglobin is the cause 
of the characteristic red color of muscle. 
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0, Binds to the Mb Heme Group 


Iron prefers to interact with six ligands, four of which share a common plane. The 
fifth and sixth ligands lie above and below this plane (see Figure 15.23). In heme, four 
of the ligands are provided by the nitrogen atoms of the four pyrroles. A fifth ligand 
is donated by the imidazole side chain of amino acid residue His F8. When myoglo- 
bin binds O% to become oxymyoglobin, the O molecule adds to the heme iron ion as 
the sixth ligand (Figure 15.23). O adds end on to the heme iron, but it is not ori- 
ented perpendicular to the plane of the heme. Rather, it is tilted about 60° with re- 
spect to the perpendicular. 


0, Binding Alters Mb Conformation 


What happens when the heme group of myoglobin binds oxygen? X-ray crystallog- 
raphy has revealed that a crucial change occurs in the position of the iron atom rel- 
ative to the plane of the heme. In deoxymyoglobin, the ferrous ion actually lies 
0.055 nm above the plane of the heme, in the direction of His F8. The iron- 
porphyrin complex is therefore dome-shaped. When Oy binds, the iron atom is 
pulled back toward the porphyrin plane and is now displaced from it by only 
0.026 nm. The consequences of this small motion are trivial as far as the biological 
role of myoglobin is concerned. However, as we shall soon see, this slight movement 
profoundly affects the properties of hemoglobin. 


Cooperative Binding of Oxygen by Hemoglobin Has Important 
Physiological Significance 

The relative oxygen affinities of hemoglobin and myoglobin reflect their respective 
physiological roles (see Figure 15.20). Myoglobin, as an oxygen storage protein, has a 
greater affinity for O than hemoglobin at all oxygen pressures. Hemoglobin, as the 
oxygen carrier, becomes saturated with Oz in the lungs, where the partial pressure of 
Oz (pOs) is about 100 torr.' In the capillaries of tissues, POs is typically 40 torr, and oxy- 
gen is released from Hb. In muscle, some of it can be bound by myoglobin, to be 
stored for use in times of severe oxygen deprivation, such as during strenuous exercise. 


Hemoglobin Has an a@,/3, Tetrameric Structure 


As noted, hemoglobin is an af» tetramer. Each of the four subunits has a conforma- 
tion virtually identical to that of myoglobin (Figure 15.21). The subunits pack in a 
tetrahedral array, creating a roughly spherical molecule 6.4 X 5.5 X 5.0 nm. The four 
heme groups, nestled within the easily recognizable cleft formed between the E and F 
helices of each polypeptide, are exposed at the surface of the molecule. The heme 
groups are quite far apart; 2.5 nm separates the closest iron ions, those of hemes a 
and b», and those of hemes as and 81. The subunit interactions are mostly between dis- 
similar chains: Each of the a-chains is in contact with both B-chains, but there are few 
a—a or P-B interactions. 


Oxygenation Markedly Alters the Quaternary Structure of Hb 


Crystals of deoxyhemoglobin shatter when exposed to Os. Furthermore, X-ray crystal- 
lographic analysis reveals that oxyhemoglobin and deoxyhemoglobin differ markedly 
in quaternary structure. In particular, specific a6-subunit interactions change. The a8 
contacts are of two kinds. The a fı and aofo contacts involve helices B, G, and H and 
the GH corner. These contacts are extensive and important to subunit packing; they 
remain unchanged when hemoglobin goes from its deoxy to its oxy form. The af, 
and a6; contacts are called sliding contacts. They principally involve helices C and G 
and the FG corner (Figure 15.24). When hemoglobin undergoes a conformational 


The torr is a unit of pressure named for Torricelli, inventor of the barometer. One torr corresponds 
to 1 mm Hg (1/760th of an atmosphere). 
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FIGURE 15.23 The six liganding positions of an iron ion. 





FIGURE 15.24 Side view of one of the two @B-dimers in 
Hb, with a-B packing contacts indicated in blue. The 
sliding contacts made with the other dimer are shown 
in yellow. The changes in these sliding contacts are 
shown in Figure 15.25. (Illustration: Irving Geis. Rights owned 
by Howard Hughes Medical Institute. Not to be reproduced 


without permission.) 
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The Oxygen-Binding Curves of Myoglobin and Hemoglobin 


Myoglobin 
The reversible binding of oxygen to myoglobin, 
MbO.==Mb + Os 
can be characterized by the equilibrium dissociation constant, K. 
_ [Mb][O:] 


[MbOu] (15.1) 


If Y is defined as the fractional saturation of myoglobin with Os, 
that is, the fraction of myoglobin molecules having an oxygen mol- 
ecule bound, then 


[MbOz] 
i [MbOs] + [Mb] 





(15.2) 


The value of Y ranges from 0 (no myoglobin molecules carry an 
Oz) to 1.0 (all myoglobin molecules have an Os molecule bound). 
Substituting from Equation 15.1, ([Mb][O»])/K for [MbOs] gives 





[Mb] [Os] ( [O2] ) 
= K _ K = [Oz] (15.3) 
[pP + [Mb]) L +1) [Oe] + K 


and, if the concentration of Oy is expressed in terms of the partial 
pressure (in torr) of oxygen gas in equilibrium with the solution 
of interest, then 


pOrs 


Y= pO, + K (15.4) 


(In this form, Khas the units of torr.) The relationship defined by 
Equation 15.4 plots as a hyperbola. That is, the MbO» saturation 
curve resembles an enzyme: substrate saturation curve. For myo- 
globin, a partial pressure of 1 torr for pO» is sufficient for half- 
saturation (Figure 1). We can define P50 as the partial pressure of 
Oy» at which 50% of the myoglobin molecules have a molecule of 
Ov bound (that is, Y= 0.5), then 


pOs 


0.5 = —___| 
pO2 + Pso 


(15.5) 


(Note from Equation 15.1 that when [MbO,] = [Mb], K= [Ob], 
which is the same as saying when Y= 0.5, K= Ps.) The general 
equation for Os binding to Mb becomes 


(15.6) 





pO», torr 
A FIGURE1 Oxygen saturation curve for myoglobin in the form of Y ver- 
sus pOx showing Psp is at a pO% of 1 torr. 


The ratio of the fractional saturation of myoglobin, Y, to free 
myoglobin, 1 — Y, depends on pO, and Kaccording to the equation 
Y pO: 


ir K (15.7) 





Hemoglobin 
New properties emerge when four heme-containing polypeptides 
come together to form a tetramer. The Og-binding curve of hemo- 
globin is sigmoid rather than hyperbolic (see Figure 15.20), and 
Equation 15.4 does not describe such curves. Of course, each he- 
moglobin molecule has four hemes and can bind up to four oxygen 
molecules. Suppose for the moment that O, binding to hemoglobin 
is an “all-ornone” phenomenon, where Hb exists either free of Os 
or with four Os molecules bound. This supposition represents the 
extreme case for cooperative binding of a ligand by a protein with 
multiple binding sites. In effect, it says that if one ligand binds to the 
protein molecule, then all other sites are immediately occupied by 
ligand. Or, to say it another way for the case in hand, suppose that 
four Ox molecules bind to Hb simultaneously: 
Hb + 4 Os =Hb(Os)4 
Then the dissociation constant, K, would be 
[Hb] [O2]* ies 
(Hb(02)4] ele 


By analogy with Equation 15.4, the equation for fractional satura- 
tion of Hb is given by 
[pOs]* 


= IpO] + K TK (15.9) 


A plot of Y versus Os according to Equation 15.9 is presented in 
Figure 2. This curve has the characteristic sigmoid shape seen for 
Oy» binding by Hb. Half-saturation is set to be a pOs of 26 torr. 
Note that when pO» is low, the fractional saturation, Y, changes 


1.0 


Y 0.5 





a E E 
0 10 20 30 40 50 
pOds, torr 
Aà FIGURE2 Oxygen saturation curve for Hb in the form of Y versus pOs, 
assuming n = 4 and Ps) = 26 torr. The graph has the characteristic exper- 
imentally observed sigmoid shape. 
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The Oxygen-Binding Curves of Myoglobin and Hemoglobin (Continued) 


very little as pO» increases. The interpretation is that Hb has little 
affinity for O, at these low partial pressures of O. However, as 
pO» reaches some threshold value and the first Oo is bound, Y, the 
fractional saturation, increases rapidly. Note that the slope of the 
curve is steepest in the region where Y = 0.5. The sigmoid char- 
acter of this curve is diagnostic of the fact that the binding of O% 
to one site on Hb strongly enhances binding of additional Ox mol- 
ecules to the remaining vacant sites on the same Hb molecule, a 
phenomenon aptly termed cooperativity. (If each O» bound in- 
dependently, exerting no influence on the affinity of Hb for more 


pletely noninteracting, that is, if the binding of one Os to Hb had 
no influence on the binding of additional Os molecules to the 
same Hb, n would equal 1. Figure 3 compares these extremes. Ob- 
viously, the real situation falls between the extremes of n = 1 or 4. 
The qualitative answer is that Oy binding by Hb is highly coopera- 
tive, and the binding of the first O markedly enhances the 
binding of subsequent Ox molecules. However, this binding is not 
quite an all-ornone phenomenon. 


1.0 


O» binding, this plot would be hyperbolic.) 

The experimentally observed oxygen-binding curve for Hb 
does not fit the graph given in Figure 2 exactly. If we generalize 
Equation 15.9 by replacing the exponent 4 with n, we can write 
the equation as 





[p02]” 
= IpO + K (15.10) 
Rearranging yields Y 0.5 
ae (15.11) 
l1- Y K 


This equation states that the ratio of oxygenated heme groups (Y) 
to Os-free heme (1 — Y) is equal to the nth power of the pO» di- 
vided by the apparent dissociation constant, K. 
Archibald Hill demonstrated in 1913, well before any knowl- 
edge about the molecular organization of Hb existed, that the Os- 
binding behavior of Hb could be described by Equation 15.11. If 0 
a value of 2.8 is taken for n, Equation 15.11 fits the experimentally 








Stak 5 : pOds, torr 

observed Og-binding curve for Hb very well (Figure 3). Ifthe bind- A FIGURE3 A comparison of the experimentally observed Og curve for 
ing of Os to Hb were an all-ornone phenomenon, n would equal Hb yielding a value for n of 2.8 (blue), the hypothetical curve if n = 4 

4, as discussed previously. If the O»-binding sites on Hb were com- (red), and the curve if n = 1 (noninteracting O.-binding sites, purple). 





change as a result of ligand binding to the heme, these contacts are altered (Figure 
15.25). Hemoglobin, as a conformationally dynamic molecule, consists of two dimeric 
halves, an a; rsubunit pair and an a@2f-subunit pair. Each af6-dimer moves as a rigid 
body, and the two halves of the molecule slide past each other upon oxygenation of 
the heme. The two halves rotate some 15° about an imaginary pivot passing through 
the aB-subunits; some atoms at the interface between aB-dimers are relocated by as 
much as 0.6 nm. 


Movement of the Heme Iron by Less Than 0.04 nm Induces 
the Conformational Change in Hemoglobin 


In deoxyhemoglobin, histidine F8 is liganded to the heme iron ion, but steric con- 
straints force the Fe?*:His-N bond to be tilted about 8° from the perpendicular to 
the plane of the heme. Steric repulsion between histidine F8 and the nitrogen 
atoms of the porphyrin ring system, combined with electrostatic repulsions between 
the electrons of Fe®* and the porphyrin 7-electrons, forces the iron atom to lie out 
of the porphyrin plane by about 0.06 nm. Changes in electronic and steric factors 
upon heme oxygenation allow the Fe?* atom to move about 0.039 nm closer to the 
plane of the porphyrin, so now it is displaced only 0.021 nm above the plane. It is 
as if the Os were drawing the heme Fe** into the porphyrin plane (Figure 15.26). 
This modest displacement of 0.039 nm seems a trivial distance, but its biological 
consequences are far reaching. As the iron atom moves, it drags histidine F8 along 
with it, causing helix F, the EF corner, and the FG corner to follow. These shifts are 
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We can determine quantitatively the physiological significance of 
the sigmoid nature of the hemoglobin oxygen-binding curve, or, 
in other words, the biological importance of cooperativity. The 
equation 


Y _ [pOs]”" 
l= Po 


describes the relationship between pOs, the affinity of hemoglo- 
bin for Os (defined as Po, the partial pressure of O% giving half- 
maximal saturation of Hb with Ov), and the fraction of hemoglo- 
bin with Os bound, Y, versus the fraction of Hb with no Os bound, 
(1 — Y) (see A Deeper Look: The Oxygen-Binding Curves of Myo- 
globin and Hemoglobin on pages 470-471). The coefficient n is 
the Hill coefficient, an index of the cooperativity (sigmoidicity) of 





The Physiological Significance of the Hb: O, Interaction 


the hemoglobin oxygen-binding curve. Taking pO» in the lungs as 
100 torr, P;9 as 26 torr, and nas 2.8, the fractional saturation of the 
hemoglobin heme groups with Og, is 0.98. If pO» were to fall to 
10 torr within the capillaries of an exercising muscle, Y would 
drop to 0.06. The oxygen delivered under these conditions would 
be proportional to the difference, Yungs — Ymuscler which is 0.92. 
That is, virtually all the oxygen carried by Hb would be released. 
Suppose instead that hemoglobin binding of Oo were not cooper- 
ative; in that case, the hemoglobin oxygen-binding curve would 
be hyperbolic, and n = 1.0. Then Yin the lungs would be 0.79 and 
Y in the capillaries, 0.28; the difference in Y values would be 
0.51. Thus, under these conditions, the cooperativity of oxygen 
binding by Hb means that 0.92/0.51 or 1.8 times as much Os can 
be delivered. 





(a) 


Deoxyhemoglobin 








(b) Oxyhemoglobin 
15° 





By 


CENGAGENOW” ANIMATED FIGURE 15.25 Subunit motion in hemoglobin when the molecule goes from the 
(a) deoxy to the (b) oxy form. (Illustration: Irving Geis. Rights owned by Howard Hughes Medical Institute. Not to be repro- 
duced without permission.) See this figure animated at www.cengage.com/login. 


CENGAGENOW” ACTIVE FIGURE 15.26 Changes in the 
position of the heme iron atom upon oxygenation lead 
to conformational changes in the hemoglobin mole- 
cule. Test yourself on the concepts in this figure at 
www.cengage.com/login. 
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transmitted to the subunit interfaces, where they trigger conformational readjust- 
ments that lead to the rupture of interchain salt links. 


The Oxy and Deoxy Forms of Hemoglobin Represent Two Different 
Conformational States 


Hemoglobin resists oxygenation (see Figure 15.20) because the deoxy form is sta- 
bilized by specific hydrogen bonds and salt bridges (ion-pair bonds) (Figure 
15.27). All of these interactions are broken in oxyhemoglobin, as the molecule sta- 
bilizes into a new conformation. The shift in helix F upon oxygenation leads to 
rupture of the Tyr 6145:Val 698 hydrogen bond. In deoxyhemoglobin, with these 
interactions intact, the C-termini of the four subunits are restrained, and this con- 
formational state is termed T, the tense or taut form. In oxyhemoglobin, these 
C-termini have almost complete freedom of rotation, and the molecule is now in 
its R, or relaxed, form. 


The Allosteric Behavior of Hemoglobin Has Both Symmetry (MWC) 
Model and Sequential (KNF) Model Components 


Oxygen is accessible only to the heme groups of the a-chains when hemoglobin is 
in the T conformational state. Max Perutz has pointed out that the heme environ- 
ment of 6-chains in the T state is virtually inaccessible because of steric hindrance 
by amino acid residues in the E helix. This hindrance disappears when the hemo- 
globin molecule undergoes transition to the R conformational state. Binding of Og 
to the -chains is thus dependent on a T-to-R conformational shift, and this shift is 
triggered by the subtle changes that occur when Oy binds to the a-chain heme 
groups. Together these observations lead to a model that is partially MWC and par- 
tially KNF: Oy, binding to one a-subunit and then the other leads to sequential 
changes in conformation, followed by a switch in quaternary structure at the 
Hb: 20g state from T to R. Thus, the real behavior of this protein is an amalgam of 
the two prominent theoretical models for allosteric behavior. 


H* Promotes the Dissociation of Oxygen from Hemoglobin 


Protons, carbon dioxide, and chloride ions, as well as the metabolite 2,3- 
bisphosphoglycerate (or BPG), all affect the binding of Os by hemoglobin. Their ef- 
fects have interesting ramifications, which we shall see as we discuss them in turn. De- 
oxyhemoglobin has a higher affinity for protons than oxyhemoglobin. Thus, as the 
pH decreases, dissociation of Os from hemoglobin is enhanced. In simple symbolism, 
ignoring the stoichiometry of Ox or H* involved: 


HbO; + Ht=—=HbH* + Os 
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FIGURE 15.27 Salt bridges between different subunits 


in human deoxyhemoglobin. These noncovalen 
static interactions are disrupted upon oxygenati 
(a) A focus on those salt bridges and hydrogen 
involving interactions between N-terminal and 
C-terminal residues in the a-chains. Residues in 
lower center are Arg œ 141 (green) with Val œ; 
(purple), Asp œ; 126 (orange), Lys a 127 (yellow, 
Val B2 34 (olive); residues at top are Val a; 93 (ye 
with Tyr œ; 140 (purple). (b) A focus on those sa 


electro- 
on. 
bonds 


he 


„and 
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t bridges 


and hydrogen bonds involving C-terminal residues of 
B-chains:Val B2 78 (olive) with Tyr B2 145 (purple); His B2 


146 (light blue) with Asp B2 94 (orange) and Lys 
(yellow) (pdb id = 2HHB). 


a, 40 





Changes in the Heme Iron upon O, Binding 





In deoxyhemoglobin, the six d electrons of the heme Fe?* exist as 
four unpaired electrons and one electron pair, and five ligands 
can be accommodated: the four N-atoms of the porphyrin ring sys- 
tem and histidine F8. In this electronic configuration, the iron 
atom is paramagnetic and in the high-spin state. When the heme 
binds Og» as a sixth ligand, these electrons are rearranged into 
three ¢ pairs and the iron changes to the low-spin state and is 
diamagnetic. This change in spin state allows the bond between 


the Fe?* ion and histidine F8 to become perpendicular to the 
heme plane and to shorten. In addition, interactions between the 
porphyrin N atoms and the iron strengthen. Also, high-spin Fe?* 
has a greater atomic volume than low-spin Fe?* because its four 
unpaired e occupy four orbitals rather than two when the elec- 
trons are paired in low-spin Fe?*. So, low-spin iron is less sterically 
hindered and able to move nearer to the porphyrin plane. 
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FIGURE 15.28 The oxygen saturation curves for myoglo- 
bin and for hemoglobin at five different pH values: 7.6, 
74,7.2,7.0,and 6.8. 
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Expressed another way, H* is an antagonist of oxygen binding by Hb, and the satu- 
ration curve of Hb for Og is displaced to the right as acidity increases (Figure 15.28). 
This phenomenon is called the Bohr effect, after its discoverer, the Danish physiol- 
ogist Christian Bohr (the father of Niels Bohr, the atomic physicist). The effect has 
important physiological significance because actively metabolizing tissues produce 
acid, promoting Oy release where it is most needed. About two protons are taken 
up by deoxyhemoglobin. The N-termini of the two a-chains and the His 6146 
residues have been implicated as the major players in the Bohr effect. (The pK, of 
a free amino terminus in a protein is about 8.0, but the pK, of a protein histidine 
imidazole is around 6.5.) Neighboring carboxylate groups of Asp 694 residues help 
stabilize the protonated state of the His 6146 imidazoles that occur in deoxyhemo- 
globin. However, when Hb binds Og, changes in the conformation of 6-chains upon 
Hb oxygenation move the negative Asp function away, and dissociation of the 
imidazole protons is favored. 


CO, Also Promotes the Dissociation of 0, from Hemoglobin 


Carbon dioxide has an effect on Os binding by Hb that is similar to that of H*, 
partly because it produces H* when it dissolves in the blood: 


carbonic anhydrase _ 





CO», + H9O HeCO3; == H+ + HCO37 


carbonic acid bicarbonate 





The enzyme carbonic anhydrase promotes the hydration of CO». Many of the protons 
formed upon ionization of carbonic acid are picked up by Hb as Où dissociates. The 
bicarbonate ions are transported with the blood back to the lungs. When Hb be- 
comes oxygenated again in the lungs, H* is released and reacts with HCO; to re- 
form H COs, from which CO, is liberated. The CO» is then exhaled as a gas. 

In addition, some CO, is directly transported by hemoglobin in the form of car- 
bamate (—NHCOO  ). Free a-amino groups of Hb react with COs reversibly: 


R—NHp» + CO == R—NH—COO-+ Ht 


This reaction is driven to the right in tissues by the high CO, concentration; the 
equilibrium shifts the other way in the lungs where [COs] is low. Thus, carbamyla- 
tion of the N-termini converts them to anionic functions, which then form salt links 
with the cationic side chains of Arg a141 that stabilize the deoxy or T state of 
hemoglobin. 


In addition to COs, Cl” and BPG also bind better to deoxyhemoglobin than to 
oxyhemoglobin, causing a shift in equilibrium in favor of Oy release. These various 
effects are demonstrated by the shift in the oxygen saturation curves for Hb in the 
presence of one or more of these substances (Figure 15.29). Note that the Os- 
binding curve for Hb + BPG + CO, fits that of whole blood very well. 


2,3-Bisphosphoglycerate Is an Important Allosteric Effector 
for Hemoglobin 


The binding of 2,3-bisphosphoglycerate (BPG) to Hb promotes the release 
of Os (Figure 15.29). Erythrocytes (red blood cells) normally contain about 4.5 
mM BPG, a concentration equivalent to that of tetrameric hemoglobin molecules. 
Interestingly, this equivalence is maintained in the Hb: BPG binding stoichiome- 
try because the tetrameric Hb molecule has but one binding site for BPG. This 
site is situated within the central cavity formed by the association of the four sub- 
units. The strongly negative BPG molecule (Figure 15.30) is electrostatically bound 
via interactions with the positively charged functional groups of each Lys 682, His 
p2, His 6143, and the NH3*-terminal group of each B-chain. These positively 
charged residues are arranged to form an electrostatic pocket complementary to 
the conformation and charge distribution of BPG (Figure 15.31). In effect, BPG 
crosslinks the two B-subunits. The ionic bonds between BPG and the two B-chains 
aid in stabilizing the conformation of Hb in its deoxy form, thereby favoring the 
dissociation of oxygen. In oxyhemoglobin, this central cavity is too small for BPG 
to fit. Or, to put it another way, the conformational changes in the Hb molecule 
that accompany Oy» binding perturb the BPG-binding site so that BPG can no 
longer be accommodated. Thus, BPG and Oy, are mutually exclusive allosteric ef- 
fectors for Hb, even though their binding sites are physically distinct. 


BPG Binding to Hb Has Important Physiological Significance 


The importance of the BPG effect is evident in Figure 15.29. Hemoglobin stripped 
of BPG is virtually saturated with Og at a pO% of only 20 torr, and it cannot release its 
oxygen within tissues, where the pO» is typically 40 torr. BPG shifts the oxygen satu- 
ration curve of Hb to the right, making the Hb an Oy, delivery system eminently 
suited to the needs of the organism. BPG serves this vital function in humans, most 
primates, and a number of other mammals. However, the hemoglobins of cattle, 
sheep, goats, deer, and other animals have an intrinsically lower affinity for Oo, and 
these Hbs are relatively unaffected by BPG. 


Fetal Hemoglobin Has a Higher Affinity for 0, Because 
It Has a Lower Affinity for BPG 


The fetus depends on its mother for an adequate supply of oxygen, but its circulatory 
system is entirely independent. Gas exchange takes place across the placenta. Ideally 
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FIGURE 15.30 The structure, in ionic form, of BPG or 2,3-bisphosphoglycerate, an important allosteric effector 
for hemoglobin. 
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FIGURE 15.29 Oxygen-binding curves of blood and of 
hemoglobin in the absence and presence of CO, and 
BPG. From left to right: stripped Hb, Hb + CO, Hb + 
BPG, Hb + BPG + CO, and whole blood. 
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FIGURE 15.31 The ionic binding of BPG to the two 
B-subunits of Hb. BPG lies at center of the cavity be- 
tween the two B-subunits. The highlighted residues are 
N-terminal Val 6; and Val B> (yellow), His B, 2, His B2 2, 
His B, 143,and His B2 143 (purple), Lys Bı 82 and 

Lys B 82 (green) (pdb id = 1B86). 


FIGURE 15.32 Comparison of the oxygen saturation 
curves of Hb A and Hb F under similar conditions of pH 
and [BPG]. 





then, fetal Hb should be able to absorb O» better than maternal Hb so that an effective 
transfer of oxygen can occur. Fetal Hb differs from adult Hb in that the -chains are 
replaced by very similar, but not identical, 146-residue subunits called y-chains (gamma 
chains). Fetal Hb is thus aeye. Recall that BPG functions through its interaction with 
the B-chains. BPG binds less effectively with the y-chains of fetal Hb (also called Hb F). 
(Fetal y-chains have Ser instead of His at position 143 and thus lack two of the positive 
charges in the central BPG-binding cavity.) Figure 15.32 compares the relative affinities 
of adult Hb (also known as Hb A) and Hb F for O% under similar conditions of pH and 
[BPG]. Note that Hb F binds Os» at pO» values where most of the oxygen has dissoci- 
ated from Hb A. Much of the difference can be attributed to the diminished capacity 
of Hb F to bind BPG (compare Figures 15.29 and 15.32); Hb F thus has an intrinsically 
greater affinity for Oo, and oxygen transfer from mother to fetus is ensured. 


Sickle-Cell Anemia Is Characterized by Abnormal Red Blood Cells 


In 1904, a Chicago physician treated a 20-year-old black college student complaining 
of headache, weakness, and dizziness. The blood of this patient revealed serious 
anemia—only half the normal number of red cells were present. Many of these cells 
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Hemoglobin and Nitric Oxide 


Nitric oxide (NO -) is a simple gaseous molecule whose many re- 
markable physiological functions are still being discovered. For ex- 
ample, NO - is known to act as a neurotransmitter and as a second 
messenger in signal transduction (see Chapter 32). Furthermore, 
endothelial relaxing factor (ERF, also known as endothelium- 
derived relaxing factor, or EDRF), an elusive hormonelike agent 
that acts to relax the musculature of the walls (endothelium) of 
blood vessels and lower blood pressure, has been identified as 
NO .-. It has long been known that NO - is a high-affinity ligand for 
Hb, binding to its heme-Fe?* atom with an affinity 10,000 times 
greater than that of Os. An enigma thus arises: Why isn’t NO - in- 
stantaneously bound by Hb within human erythrocytes and pre- 
vented from exerting its vasodilation properties? 

The reason that Hb doesn’t block the action of NO - is due to 
a unique interaction between Cys 936 of Hb and NO - discovered 
by Li Jia, Celia and Joseph Bonaventura, and Johnathan Stamler at 
Duke University. Nitric oxide reacts with the sulfhydryl group of 
Cys 938, forming an S-nitroso derivative: 


—CH)—S-+N=O 


These small-molecule thiols serve to transfer NO - from erythro- 
cytes to endothelial receptors, where it acts to relax vascular ten- 
sion. NO - itself is a reactive free-radical compound whose biolog- 
ical half-life is very short (1-5 sec). S-nitrosoglutathione has a 
half-life of several hours. 

The reactions between Hb and NO: are complex. NO - forms 
a ligand with the heme-Fe?* that is quite stable in the absence of 
Op». However, in the presence of Os, NO - is oxidized to NO; and 
the heme-Fe?* of Hb is oxidized to Fe**, forming methemoglobin. 
Fortunately, the interaction of Hb with NO - is controlled by the 
allosteric transition between R-state Hb (oxyHb) and T-state Hb 
(deoxyHb). Cys 938 is more exposed and reactive in R-state Hb 
than in T-state Hb, and binding of NO - to Cys 936 precludes re- 
action of NO - with heme iron. Upon release of Os from Hb in tis- 
sues, Hb shifts conformation from R state to T state, and binding 
of NO - at Cys 93 is no longer favored. Consequently, NO - is re- 
leased from Cys 936 and transferred to small-molecule thiols for 
delivery to endothelial receptors, causing capillary vasodilation. 
This mechanism also explains the puzzling observation that free 
Hb produced by recombinant DNA methodology for use as a 
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whole-blood substitute causes a transient rise of 10 to 12 mm Hg 
in diastolic blood pressure in experimental clinical trials. (Con- 
ventional whole-blood transfusion has no such effect.) It is now ap- 
parent that the “synthetic” Hb, which has no bound NO ,, is bind- 
ing NO - in the blood and preventing its vasoregulatory function. 

In the course of hemoglobin evolution, the only invariant 
amino acid residues in globin chains are His F8 (the obligatory 
heme ligand) and a Phe residue acting to wedge the heme into its 
pocket. However, in mammals and birds, Cys 936 is also invariant, 
no doubt due to its vital role in NO - delivery. 


This S-nitroso group is in equilibrium with other S-nitroso com- 
pounds formed by reaction of NO ; with small-molecule thiols 
such as free cysteine or glutathione (an isoglutamylcysteinylgly- 
cine tripeptide): 


O O 
H | H || 
H;+N—C—CH»—CH»—C—N—C—C—N—CHp» 

| H | H 
Coo- CH,—s-Ezz 





COO- 


Adapted from Jia, L., et al., 1996. S-Nitrosohaemoglobin: A dynamic activ- 


S-nitrosoglutathione ity of blood involved in vascular control. Nature 380:221-226. 








were abnormally shaped; in fact, instead of the characteristic disc shape, these ery- 
throcytes were elongated and crescentlike in form, a feature that eventually gave 
name to the disease sickle-cell anemia. These sickle cells pass less freely through the 
capillaries, impairing circulation and causing tissue damage. Furthermore, these cells 
are more fragile and rupture more easily than normal red cells, leading to anemia. 


Sickle-Cell Anemia Is a Molecular Disease 


A single amino acid substitution in the -chains of Hb causes sickle-cell anemia. Re- 
placement of the glutamate residue at position 6 in the 6-chain by a valine residue 
marks the only chemical difference between Hb A and sickle-cell hemoglobin, Hb S. 
The amino acid residues at position £6 lie at the surface of the hemoglobin molecule. 
In Hb A, the ionic R groups of the Glu residues fit this environment. In contrast, the 
aliphatic side chains of the Val residues in Hb S create hydrophobic protrusions 
where none existed before. To the detriment of individuals who carry this trait, a hy- 
drophobic pocket forms in the EF corner of each 6-chain of Hb when it is in the de- 
oxy state, and this pocket nicely accommodates the Val side chain of a neighboring 
HbS molecule (Figure 15.33). This interaction leads to the aggregation of Hb S mol- 
ecules into long, chainlike polymeric structures. The obvious consequence is that de- 
oxyHb S is less soluble than deoxyHb A. The concentration of hemoglobin in red 
blood cells is high (about 150 mg/mL), so even in normal circumstances it is on the 
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Oxy- Deoxy- Oxy- 
hemoglobin A hemoglobin A hemoglobin S 
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CENGAGENOW” ANIMATED FIGURE 15.33 The polymerization of Hb S via the interactions between the 
hydrophobic Val side chains at position B6 and the hydrophobic pockets in the EF corners of B-chains in neigh- 
boring Hb molecules. (a) The protruding “block” on Oxy S represents the Val hydrophobic protrusion. The com- 
plementary hydrophobic pocket in the EF corner of deoxy B-chains is represented by a square-shaped indenta- 
tion. (This indentation is probably present in Hb A also.) Only the > Val protrusions and the 6, EF pockets are 
shown. (The f; Val protrusions and the B2 EF pockets are not involved, although they are present.) (b) The 
polymerization of Hb S via B, Val6 insertions into neighboring B; pockets. (c) Molecular graphic of an Hb S 
dimer of tetramers. Bz Val residues are highlighted in blue; heme is shown in red (pdb id = 2HBS). (d) Molecular 
graphic of the Hb S filament (pdb id = 2HBS). See this figure animated at www.cengage.com/login. 


verge of crystallization. The formation of insoluble deoxyHb S fibers distorts the red 
cell into the elongated sickle shape characteristic of the disease.” 


*In certain regions of Africa, the sickle-cell trait is found in 20% of the people. Why does such a dele- 
terious heritable condition persist in the population? For reasons as yet unknown, individuals with this 
trait are less susceptible to the most virulent form of malaria. The geographic distribution of malaria 
and the sickle-cell trait are positively correlated. 





SUMMARY 


15.1 What Factors Influence Enzymatic Activity? The two prominent 
ways to regulate enzyme activity are (1) to increase or decrease the num- 
ber of enzyme molecules or (2) to increase or decrease the intrinsic 
activity of each enzyme molecule. Changes in enzyme amounts are typ- 
ically regulated via gene expression and protein degradation. Changes 
in the intrinsic activity of enzyme molecules are achieved principally by 
allosteric regulation or covalent modification. 


15.2 What Are the General Features of Allosteric Regulation? 
Allosteric enzymes show a sigmoid response of velocity, v, to increasing 
[S], indicating that binding of S to the enzyme is cooperative. Allosteric 
enzymes often are susceptible to feedback inhibition. Allosteric enzymes 
may also respond to allosteric activation. Allosteric activators signal a 
need for the end product of the pathway in which the allosteric enzyme 
functions. As a general rule, allosteric enzymes are oligomeric, with each 
monomer possessing a substrate-binding site and an allosteric site where 
effectors bind. Interaction of one subunit of an allosteric enzyme with its 


substrate (or its effectors) is communicated to the other subunits of the 
enzyme through intersubunit interactions. These interactions can lead 
to conformational transitions that make it easier (or harder) for addi- 
tional equivalents of ligand (S, A, or I) to bind to the enzyme. 


15.3 Can Allosteric Regulation Be Explained by Conformational 
Changes in Proteins? Monod, Wyman, and Changeux postulated that 
the subunits of allosteric enzymes can exist in two conformational 
states (R and T), that all subunits in any enzyme molecule are in the 
same conformational state (symmetry), that equilibrium strongly favors 
the T conformational state, and that S binds preferentially (“only”) to 
the R state. Sigmoid binding curves result, provided that [Tp] => [Ro] 
in the absence of S and that S binds “only” to R. Positive or negative 
effectors influence the relative T/R equilibrium by binding preferen- 
tially to T (negative effectors) or R (positive effectors), and the sub- 
strate saturation curve is shifted to the right (negative effectors) or left 
(positive effectors). 


In an alternative allosteric model suggested by Koshland, Nemethy, 
and Filmer (the KNF model), S binding leads to conformational changes 
in the enzyme. The altered conformation of the enzyme may display 
higher affinity for the substrate (positive cooperativity) or lower affinity 
for the substrate or other ligand (negative cooperativity). Negative co- 
operativity is not possible within the MWC model. Reversible changes in 
the oligomeric state of a protein can also yield allosteric behavior. For ex- 
ample, a monomer-oligomer equilibrium for an allosteric protein, 
where only the oligomer binds S and [monomer] > [oligomer], would 
show cooperative substrate binding. 


15.4 What Kinds of Covalent Modification Regulate the Activity of 
Enzymes? Reversible phosphorylation is the most prominent form of 
covalent modification in cellular regulation. Phosphorylation is accom- 
plished by protein kinases; phosphoprotein phosphatases act in the 
reverse direction to remove the phosphate group. Regulation must be 
imposed on these converter enzymes so that their enzyme targets adopt 
the metabolically appropriate state (active versus inactive). Thus, these 
converter enzymes are themselves the targets of allosteric regulation or 
covalent modification. Although several hundred chemical modifica- 
tions of proteins have been described, only a few are used for reversible 
conversion of enzymes between active and inactive forms. Besides phos- 
phorylation, these regulatory types include adenylylation, uridylylation, 
ADP-ribosylation, methylation, and oxidation-reduction of protein 
disulfide bonds. 


15.5 Is the Activity of Some Enzymes Controlled by Both Allosteric 
Regulation and Covalent Modification? Some enzymes are subject to 
both allosteric regulation and regulation by covalent modification. A 
prime example is glycogen phosphorylase. Glycogen phosphorylase 
exists in two forms, a and b, which differ only in whether or not Ser! 
OH is phosphorylated (a) or not (b). Glycogen phosphorylase b shows 
positive cooperativity in binding its substrate, phosphate. In addition, 
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glycogen phosphorylase bis allosterically activated by the positive effec- 
tor AMP. In contrast, ATP and glucose-6-P are negative effectors for 
glycogen phosphorylase b. Covalent modification of glycogen phospho- 
rylase b by phosphorylase kinase converts it from a less active, allosteri- 
cally regulated form to the more active a form that is less responsive to 
allosteric regulation. Glycogen phosphorylase is both activated and 
freed from allosteric control by covalent modification. 


Special Focus: Is There an Example in Nature That Exemplifies the 
Relationship Between Quaternary Structure and the Emergence of Al- 
losteric Properties? Hemoglobin and Myoglobin—Paradigms of Pro- 
tein Structure and Function Myoglobin and hemoglobin have illumi- 
nated our understanding of protein structure and function. Myoglobin 
is monomeric, whereas hemoglobin has a quaternary structure. Myo- 
globin functions as an oxygen-storage protein in muscle; Hb is an Os- 
transport protein. When Mb binds Os, its heme iron atom is drawn 
within the plane of the heme, slightly shifting the position of the F helix 
of the protein. Hemoglobin shows cooperative binding of Og and allo- 
steric regulation by H*, COs, and 2,3-bisphosphoglycerate. The allosteric 
properties of Hb can be traced to the movement of the F helix upon Og 
binding to Hb heme groups and the effects of F-helix movement on in- 
teractions between the protein’s subunits that alter the intrinsic affinity 
of the other subunits for Og. The allosteric transitions in Hb partially 
conform to the MWC model in that a concerted conformational change 
from a Tstate, low-affinity conformation to an R-state, high-affinity form 
takes place after 2 Op are bound (by the 2 Hb a-subunits). However, Hb 
also behaves somewhat according to the KNF model of allostery in that 
oxygen binding leads to sequential changes in the conformation and Os 
affinity of hemoglobin subunits. Sickle-cell anemia is a molecular disease 
traceable to a tendency for Hb S to polymerize as a consequence of hav- 
ing a BE6V amino acid substitution that creates a “sticky” hydrophobic 
patch on the Hb surface. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 


1. List six general ways in which enzyme activity is controlled. 


2. Why do you suppose proteolytic enzymes are often synthesized as 
inactive zymogens? 

3. (Integrates with Chapter 13.) Draw both Lineweaver—Burk plots 
and Hanes—Woolf plots for an MWC allosteric enzyme system, show- 
ing separate curves for the kinetic response in (a) the absence of 
any effectors, (b) the presence of allosteric activator A, and (c) the 
presence of allosteric inhibitor I. 


4. The KNF model for allosteric transitions includes the possibility of 
negative cooperativity. Draw Lineweaver-Burk and Hanes—Woolf 
plots for the case of negative cooperativity in substrate binding. (As 
a point of reference, include a line showing the classic Michaelis— 
Menten response of v to [S].) 

i Fh, . Y pO 

. The equation a-y” ( Poo 
fractional saturation of hemoglobin with Og), given P;) and n (see 
box on page 472). Let Ps) = 26 torr and n = 2.8. Calculate Y in the 
lungs, where pOz, = 100 torr, and Yin the capillaries, where pO, = 

40 torr. What is the efficiency of Os delivery under these condi- 

tions (expressed as Yiungs — Yeapillaries)? Repeat the calculations, but 

for n= 1. Compare the values for Yungs ~ Yeapittaries for n = 2.8 

versus Yiungs — Yeapillaries for n = 1 to determine the effect of coop- 

erative Os binding on oxygen delivery by hemoglobin. 





i allows the calculation of Y (the 


6. The cAMP formed by adenylyl cyclase (Figure 15.18) does not persist 
because 5'-phosphodiesterase activity prevalent in cells hydrolyzes 
cAMP to give 5'-AMP. Caffeine inhibits 5'-phosphodiesterase activity. 
Describe the effects on glycogen phosphorylase activity that arise as a 
consequence of drinking lots of caffeinated coffee. 


7. If no precautions are taken, blood that has been stored for some 
time becomes depleted in 2,3-BPG. What happens if such blood is 
used in a transfusion? 

8. Enzymes have evolved such that their K,, values (or Kos values) for 
substrate(s) are roughly equal to the in vivo concentration(s) of 
the substrate (s). Assume that glycogen phosphorylase is assayed at 
[Pi] ~ Kos in the absence and presence of AMP or ATP. Estimate 
from Figure 15.14 the relative glycogen phosphorylase activity 
when (a) neither AMP or ATP is present, (b) AMP is present, and 
(c) ATP is present. (Hint: Use a ruler to get relative values for the 
velocity v at the appropriate midpoints of the saturation curves.) 

9. Cholera toxin is an enzyme that covalently modifies the G,-subunit 
of G proteins. (Cholera toxin catalyzes the transfer of ADP-ribose 
from NAD* to an arginine residue in G,, an ADP-ribosylation reac- 
tion.) Covalent modification of G, inactivates its GTPase activity. 
Predict the consequences of cholera toxin on cellular cAMP and 
glycogen levels. 

10. Allosteric enzymes that sit at branch points leading to several essen- 
tial products sometimes display negative cooperativity for feedback 
inhibition (allosteric inhibition) by one of the products. What 
might be the advantage of negative cooperativity instead of positive 
cooperativity in feedback inhibitor binding by such enzymes? 

11. Consult Table 15.2 and 
a. Suggest a consensus amino acid sequence within phosphorylase 

kinase that makes it a target of protein kinase A (the cAMP- 
dependent protein kinase). 

b. Suggest an effective amino acid sequence for a regulatory domain 
pseudosubstrate sequence that would exert intrasteric control on 
phosphorylase kinase by blocking its active site. 

12. What are the relative advantages (and disadvantages) of allosteric 
regulation versus covalent modification? 
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13. You land a post as scientific investigator with a pharmaceutical com- 
pany that would like to develop drugs to treat people with sickle-cell 
anemia. They want ideas from you! What molecular properties of 
Hb S might you suggest as potential targets of drug therapy? 


14. Under appropriate conditions, nitric oxide (NO +) combines with 
Cys 936 in hemoglobin and influences its interaction with Op. Is 
this interaction an example of allosteric regulation or covalent 
modification? 


15. Lactate, a metabolite produced under anaerobic conditions in mus- 
cle, lowers the affinity of myoglobin for Oy. This effect is beneficial, 
because Og dissociation from Mb under anaerobic conditions will 
provide the muscle with oxygen. Lactate binds to Mb at a site dis- 
tinct from the O.-binding site at the heme. In light of this observa- 
tion, discuss whether myoglobin should be considered an allosteric 
protein. 


16. An allosteric model based on multiple oligomeric states of a pro- 
tein has been proposed by E. K. Jaffe (2005. Morpheeins: A new 
structural paradigm for allosteric regulation. Trends in Biochemical 
Sciences 30:490-497). This model coins the term morpheeins to de- 
scribe the different forms of a protein that can assume more than 
one conformation, where each distinct conformation assembles 
into an oligomeric structure with a fixed number of subunits. For 
example, conformation A of the protein monomer forms trimers, 
whereas conformation B of the monomer forms tetramers. If 
trimers and tetramers have different kinetic properties (Ky and keat 
values), as in low-activity trimers and high-activity tetramers, then 
the morpheein ensemble behaves like an allosterically regulated 
enzyme. Drawing on the traditional MWC model as an analogy, di- 
agram a simple morpheein model in which wedge-shaped protein 
monomers assemble into trimers but the alternative conformation 


for the monomer (a square shape) forms tetramers. Further, the 
substrate, S, or allosteric regulator, A, binds “only” to the square 
conformation, and its binding prevents the square from adopting 
the wedge conformation. Describe how your diagram yields allo- 
steric behavior. 


17. CTP synthetase catalyzes the synthesis of CTP from UTP: 
UTP + ATP + glutamine == CTP + glutamate + ADP + P; 


The substrates UTP and ATP show positive cooperativity in their 
binding to the enzyme, which is an a4-type homotetramer. However, 
the other substrate, glutamine, shows negative cooperativity. Draw 
substrate saturation curves of the form v versus [S]/ Ko, for each of 
these three substrates that illustrate these effects. 

18. Glyceraldehyde-3-phosphate dehydrogenase catalyzes the synthesis 
of 1,3-bisphosphoglycerate: 


Glyceraldehyde-3-P + P; + NAD* <=> 1,3-BPG + NADH + Ht 


The enzyme is a tetramer. NAD* binding shows negative coopera- 
tivity. Draw a diagram of possible conformational states for this 
tetrameric enzyme and its response to NAD* binding that illustrates 
negative cooperativity. 


Preparing for the MCAT Exam 


19. On the basis of the graphs shown in Figures 15.28 and 15.29 and the 
relationship between blood pH and respiration (Chapter 2), predict 
the effect of hyperventilation and hypoventilation on Hb:O, affinity. 

20. Figure 15.17 traces the activation of glycogen phosphorylase from 
hormone to phosphorylation of the 6 form of glycogen phosphory- 
lase to the a form. These effects are reversible when hormone dis- 
appears. Suggest reactions by which such reversibility is achieved. 
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Molecular Motors 


ESSENTIAL QUESTION 


Movement is an intrinsic property associated with all living things. Within cells, mole- 
cules undergo coordinated and organized movements, and cells themselves may move 
across a surface. At the tissue level, muscle contraction allows higher organisms to 
carry out and control crucial internal functions, such as peristalsis in the gut and the 
beating of the heart. Muscle contraction also enables the organism to perform orga- 
nized and sophisticated movements, such as walking, running, flying, and swimming. 


How can biological macromolecules, carrying out conformational changes on 
the molecular level, achieve these feats of movement that span the microscopic 
and macroscopic worlds? 


16015 What Is a Molecular Motor? 


Motor proteins, also known as molecular motors, use chemical energy (ATP) to or- 
chestrate movements, transforming ATP energy into the mechanical energy of mo- 
tion. In all cases, ATP hydrolysis is presumed to drive and control protein confor- 
mational changes that result in sliding or walking movements of one molecule 
relative to another. To carry out directed movements, molecular motors must be able 
to associate and dissociate reversibly with a polymeric protein array, a surface or sub- 
structure in the cell. ATP hydrolysis drives the process by which the motor protein 
ratchets along the protein array or surface. As fundamental and straightforward as 
all this sounds, elucidation of these basically simple processes has been extremely 
challenging for biochemists, involving the application of many sophisticated chemi- 
cal and physical methods in many different laboratories. This chapter describes the 
structures and chemical functions of molecular motor proteins and some of the ex- 
periments by which we have come to understand them. 

Molecular motors may be linear or rotating. Linear motors crawl or creep along a 
polymer lattice, whereas rotating motors consist of a rotating element (the “rotor”) 
and a stationary element (the “stator”), in a fashion much like a simple electrical 
motor. The linear motors we will discuss include kinesins and dyneins (which crawl 
along microtubules), myosin (which slides along actin filaments in muscle), and 
DNA helicases (which move along a DNA lattice, unwinding duplex DNA to form 
single-stranded DNA). Rotating motors include the flagellar motor complex, de- 
scribed in this chapter, and the ATP synthase, which will be described in Chapter 20. 


16:28 What Is the Molecular Mechanism 
of Muscle Contraction? 


Muscle Contraction Is Triggered by Ca?* Release 
from Intracellular Stores 


Muscle contraction is the result of interactions between myosin and actin, the two 
predominant muscle proteins. Thick filaments of myosin slide along thin fila- 
ments of actin to cause contraction. 

The cells of skeletal muscle are long and multinucleate and are referred to as 
muscle fibers. Skeletal muscles in higher animals consist of 100-um-diameter fibers, 
some as long as the muscle itself. Each of these muscle fibers contains hundreds of 
myofibrils (Figure 16.1), each of which spans the length of the fiber and is about 
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FIGURE 16.1 The structure of a skeletal muscle cell, 
showing the manner in which transverse tubules en- 
able the sarcolemmal membrane to extend into the in- 
terior of the fiber. T-tubules and sarcoplasmic reticulum 
(SR) membranes are juxtaposed at structures termed 
triad junctions (inset). 


Myofibril 


Mitochondrion 


HUMAN BIOCHEMISTRY 


Transverse tubule 





Smooth Muscle Effectors Are Useful Drugs 


Not all vertebrate muscle is skeletal muscle. Vertebrate organisms 
employ smooth muscle for long, slow, and involuntary contractions 
in various organs, including large blood vessels, intestinal walls, the 
gums of the mouth, and in the female, the uterus. Smooth muscle 
contraction is triggered by Ca?*-activated phosphorylation of 
myosin by myosin light-chain kinase (MLCK). The action of epi- 
nephrine and related agents forms the basis of therapeutic control 
of smooth muscle contraction. Breathing disorders, including 
asthma and various allergies, can result from excessive contraction 
of bronchial smooth muscle tissue. Treatment with epinephrine, 
whether by tablets or aerosol inhalation, inhibits MLCK and relaxes 
bronchial muscle tissue. More specific bronchodilators, such as 
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albuterol (see accompanying figure), act more selectively on the 
lungs and avoid the undesirable side effects of epinephrine on the 
heart. Albuterol is also used to prevent premature labor in pregnant 
women because of its relaxing effect on uterine smooth muscle. 
Conversely, oxytocin, known also as Pitocin, stimulates contraction 
of uterine smooth muscle. This natural secretion of the pituitary 
gland is often administered to induce labor. 





The structure of oxytocin. 
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1 to 2 um in diameter. Myofibrils are linear arrays of cylindrical sarcomeres, the 
basic structural units of muscle contraction. Each myofibril is surrounded by a spe- 
cialized endoplasmic reticulum called the sarcoplasmic reticulum (SR). The SR con- 
tains high concentrations of Ca2t, and the release of Ca?* from the SR and its 
interactions within the sarcomeres trigger muscle contraction. The muscle fiber is 
surrounded by the sarcolemma, a specialized plasma membrane. Extensions of the 
sarcolemma, called transverse tubules, or t-tubules, reach deep into the muscle fiber, 
enabling the sarcolemmal membrane to be in contact with each myofibril. 

Skeletal muscle contractions are initiated by nerve stimuli that act directly on the 
muscle. Nerve impulses produce an electrochemical signal (see Chapter 32) called 
an action potential that spreads over the sarcolemmal membrane and into the fiber 
along the t-tubule network. This signal induces the release of Ca?* ions from the SR. 
These Ca®* ions bind to proteins within the muscle fibers and induce contraction. 


The Molecular Structure of Skeletal Muscle Is Based on Actin 
and Myosin 


Examination of myofibrils in the electron microscope reveals a banded or striated 
structure. The bands are traditionally identified by letters (Figure 16.2). Regions of 
high electron density, denoted A bands, alternate with regions of low electron den- 
sity, the I bands. Small, dark Z lines lie in the middle of the I bands, marking the 
ends of the sarcomere. Each A band has a central region of slightly lower electron 
density called the H zone, which contains a central M disc (also called an M line). 
Electron micrographs of cross sections of each of these regions reveal molecular de- 
tails. The H zone shows a regular, hexagonally arranged array of thick filaments of 
myosin (15 nm diameter), whereas the I band shows a regular, hexagonal array of 
thin filaments of actin, together with proteins known as troponin and tropomyosin 
(7 nm diameter). In the dark regions at the ends of each A band, the thin and thick 
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FIGURE 16.2 Electron micrograph of a skeletal muscle 
myofibril (in longitudinal section). The length of one 
sarcomere is indicated, as are the A and | bands, the H 
zone, the M disc, and the Z lines. Cross sections from the 
H zone show a hexagonal array of thick filaments, 
whereas the | band cross section shows a hexagonal 
array of thin filaments. 


484 Chapter16 Molecular Motors 





FIGURE 16.3 The three-dimensional structure of an 
actin monomer from skeletal muscle. This view shows 


the two domains (left and right) of actin 
(pdb id = 1J6Z). 





FIGURE 16.4 A molecular model of an actin polymer, 
based on the actin monomer structure shown in 
Figure 16.3 (pdb id = 1A5X). 


FIGURE 16.5 (a) A schematic drawing of a myosin hexamer, showing the two heavy chains 
and four light chains. The tail is a coiled coil of intertwined a-helices extending from the two 
globular heads. One of each of the myosin light-chain proteins, LC1 and LC2, is bound to each of 
the globular heads. (b) A ribbon diagram shows the structure of the myosin head. The head and 
neck domains of the heavy chain are red; the essential light chain is yellow and the regulatory 


light chain is blue (pdb id = 1B7T). 


filaments interdigitate, as shown in Figure 16.2. The thin and thick filaments are 
joined by cross-bridges. These cross-bridges are actually extensions of the myosin 
molecules, and muscle contraction is accomplished by the sliding of the cross- 
bridges along the thin filaments, a mechanical movement driven by the free energy 
of ATP hydrolysis. 


The Composition and Structure of Thin Filaments Actin, the principal component 
of thin filaments, is found in substantial amounts in most eukaryotic cells. At low ionic 
strength, actin exists as a 42-kD globular protein, denoted G-actin (Figure 16.3). Un- 
der physiological conditions (higher ionic strength), G-actin polymerizes to form a fi- 
brous form of actin, called F-actin. As shown in Figure 16.4, F-actin is a right-handed 
helical structure, with a helix pitch of about 72 nm per turn. The F-actin helix is the 
core of the thin filament, to which tropomyosin and the troponin complex also add. 
Tropomyosin winds around actin filaments and prevents myosin binding in resting 
muscle. When a nerve impulse arrives at the sarcolemmal membrane, Ca?* ions re- 
leased from the sarcoplasmic reticulum bind to the troponin complex, inducing a 
conformation change that allows myosin to bind to actin, initiating contraction. In 
nonmuscle cells, actin filaments are the highways across which a variety of cellular 
cargo is transported. 


The Composition and Structure of Thick Filaments Myosin, the principal compo- 
nent of muscle thick filaments, is a large protein consisting of six polypeptides, with 
an aggregate molecular weight of approximately 540 kD. As shown in Figure 16.5, the 
six peptides include two 230-kD heavy chains, as well as two pairs of different 20-kD 
light chains, denoted LC1 and LC2. The heavy chains consist of globular amino- 
terminal myosin heads, joined to long a-helical carboxy-terminal segments, the tails. 
These tails are intertwined to form a left-handed coiled coil approximately 2 nm in 
diameter and 130 to 150 nm long. Each of the heads in this dimeric structure is asso- 
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ciated with an LCI] and an LC2. The myosin heads exhibit ATPase activity, and hy- 
drolysis of ATP by the myosin heads drives muscle contraction. LC] is also known as 
the essential light chain, and LC2 is designated the regulatory light chain. Both light 
chains are homologous to calmodulin and troponin C. Dissociation of LC] from the 
myosin heads by alkali cations results in loss of the myosin ATPase activity. 

The myosin head consists of a globular domain, where ATP is bound and hy- 
drolyzed, and a long a-helical neck, to which the light chains are bound. The most 
prominent feature of the globular head is the actin-binding cleft between the so- 
called upper and lower domains. The N-terminal domain and the upper domain to- 
gether form a seven-stranded B-sheet. The ATP-binding site is partially defined by 
three loops: switch 1, switch 2, and the P-loop. Conformation changes driven by 
ATP hydrolysis cause a rotation of the converter domain and the long-neck helix— 
the fundamental event in contraction. 


Repeating Structural Elements Are the Secret of Myosin’s Coiled Coils Several 
key features of the myosin sequence are responsible for the a-helical coiled coils 
formed by myosin tails. Several orders of repeating structure are found in all myosin 
tails, including 7-residue, 28-residue, and 196-residue repeating units. Large 
stretches of the tail domain are composed of 7-residue repeating segments. The first 
and fourth residues of these 7-residue units are generally small, hydrophobic amino 
acids, whereas the second, third, and sixth are likely to be charged residues. The 
consequence of this arrangement is shown in Figure 16.6. Seven residues form two 
turns of an a-helix, and in the coiled coil structure of the myosin tails, the first and 
fourth residues face the interior contact region of the coiled coil. Residues b, c, and 
f (2, 3, and 6) of the 7-residue repeat face the periphery, where charged residues 
can interact with the water solvent. At the 28 (4 X 7) residue and 196 (28 X 7) 
residue levels, specialized amino acid sequence patterns promote packing of large 
numbers of myosin tails in offset or staggered arrays (Figure 16.7). 
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FIGURE 16.6 An axial view of the two-stranded, a-helical 
coiled coil of a myosin tail. Hydrophobic residues a and 
d of the 7-residue repeat sequence align to form a 
hydrophobic core. Residues b, c, and f face the outer sur- 
face of the coiled coil and are typically ionic. 





FIGURE 16.7 The packing of myosin molecules in a thick 
filament. Adjoining molecules are offset by approxi- 
mately 14 nm, a distance corresponding to 98 residues 
of the coiled coil. 





Skeletal muscle myosin is just one member of a large class of en- 
zymes that convert the free energy of NTP hydrolysis into chemi- 
cal signaling, mechanical work, or both. These enzymes all employ 
a polypeptide loop between a -strand and an a-helix with the 
sequence GxxxxGK(S/T) or GxxGK(S/T). This sequence, the so- 
called P-loop, coordinates the triphosphate chain of the NTP to be 
cleaved. Side-chain (lysine amino and serine/threonine-OH) 
groups, as well as backbone (amide-NH) groups of the P-loop, 
position the B- and y-phosphate groups of the substrate so as to 
facilitate hydrolysis. Genomic analysis reveals that 10% to 18% of 
predicted gene products are P-loop NTPases. P-loops are found in 
a variety of motor proteins, including myosins and kinesins, as well 
as in the ABC ATPases (see Chapter 9), the AAA+ ATPases (see 


The P-Loop: A Common Motif in Enzymes That Hydrolyze Nucleoside Triphosphates 








Section 16.4), the F1 ATPase (a rotary motor; see Chapter 20), the 
GTP-binding proteins known as G-proteins (for example, EF-Tu 
discussed in Chapter 30 and Ras discussed in Chapter 32), and 
adenylate kinase (see Chapter 27). 


> P-loops of histidine 
permease (pdb id = 1B0U, 
orange) and HprK protein 
kinase (pdb id = I1KKL, green). 
ATP is shown in light blue. 
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HUMAN BIOCHEMISTRY 


The Molecular Defect in Duchenne Muscular Dystrophy Involves 


an Actin-Anchoring Protein 


Duchenne muscular dystrophy is a degenerative and fatal disorder 
of muscle affecting approximately 1 in 3500 boys. Victims of 
Duchenne dystrophy show early abnormalities in walking and run- 
ning. By the age of 5, the victim cannot run and has difficulty stand- 
ing, and by early adolescence, walking is difficult or impossible. 
The loss of muscle function progresses upward in the body, affect- 
ing next the arms and the diaphragm. Respiratory problems or in- 
fections usually result in death by the age of 30. Louis Kunkel and 
his co-workers identified the Duchenne muscular dystrophy gene 
in 1986. This gene produces a protein called dystrophin, which is 
highly homologous to a-actinin and spectrin. A defect in dys- 
trophin is responsible for the muscle degeneration of Duchenne 
dystrophy. 

Dystrophin is located on the cytoplasmic face of the muscle 
plasma membrane, linked to the plasma membrane via an integral 
membrane glycoprotein. Dystrophin has a high molecular mass 


(427 kD) but constitutes less than 0.01% of the total muscle protein. 
It folds into four principal domains (see accompanying figure, part 
a), including an N-terminal domain similar to the actin-binding do- 
mains of actinin (in muscle) and spectrin (in red blood cells), a long 
repeat domain, a B-dystroglycan-binding domain, and a C-terminal 
domain that is unique to dystrophin. The repeat domain consists of 
24 triple-helical repeat units of approximately 109 residues each. 
“Spacer sequences” high in proline content, which do not align with 
the repeat consensus sequence, occur at the beginning and end of 
the repeat domain. Spacer segments are found between repeat ele- 
ments 3 and 4 and 19 and 20. The high proline content of the spac- 
ers suggests that they may represent hinge domains. The spacer/ 
hinge segments are sensitive to proteolytic enzymes, indicating that 
they may represent more exposed regions of the polypeptide. The 
N-terminal actin-binding domain appears capable of binding to 24 
actin monomers in a polymerized actin filament. 
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A comparison of the amino acid sequence of dystrophin, a-actinin, and spectrin. The potential 
hinge segments in the dystrophin structure are indicated. 











The Mechanism of Muscle Contraction Is Based on Sliding Filaments 


When muscle fibers contract, the thick myosin filaments slide or walk along the thin 
actin filaments. The basic elements of the sliding filament model were first de- 
scribed in 1954 by two different research groups: Hugh Huxley and his colleague 
Jean Hanson, and the physiologist Andrew Huxley and his colleague Ralph 
Niedergerke. Several key discoveries paved the way for this model. Electron micro- 
scopic studies of muscle revealed that sarcomeres decreased in length during con- 
traction and that this decrease was due to decreases in the width of both the I band 
and the H zone (Figure 16.8). At the same time, the width of the A band (which is 
the length of the thick filaments) and the distance from the Z discs to the nearby H 
zone (that is, the length of the thin filaments) did not change. These observations 
made it clear that the lengths of both the thin and thick filaments were constant 
during contraction. This conclusion was consistent with a sliding filament model. 


The Sliding Filament Model The shortening of a sarcomere (Figure 16.8) involves 
sliding motions in opposing directions at the two ends of a myosin thick filament. Net 


Dystrophin itself appears to be part of an elaborate protein- 
glycoprotein complex that bridges the inner cytoskeleton (actin fil- 
aments) and the extracellular matrix (via a matrix protein called 
laminin) (see figure). It is now clear that defects in one or more 
of the proteins in this complex are responsible for many of the 
other forms of muscular dystrophy. The glycoprotein complex is 
composed of two subcomplexes, the dystroglycan complex and 
the sarcoglycan complex. The dystroglycan complex consists of 
a-dystroglycan, an extracellular protein that binds to merosin, a 
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laminin subunit and component of the extracellular matrix, and 
B-dystroglycan, a transmembrane protein that binds the C-terminal 
domain of dystrophin inside the cell (see figure). The sarcoglycan 
complex is composed of a-, B-, and y-sarcoglycans, all of which are 
transmembrane glycoproteins. Alterations of the sarcoglycan pro- 
teins are linked to limb-girdle muscular dystrophy and autosomal 
recessive muscular dystrophy. Mutations in the gene for merosin, 
which binds to a-dystroglycan, are linked to severe congenital mus- 
cular dystrophy, yet another form of the disease. 





< A model for the actin—-dystrophin-glycoprotein complex 
in skeletal muscle. Dystrophin is postulated to form tetramers 
of antiparallel monomers that bind actin at their N-termini 
and a family of dystrophin-associated glycoproteins at their 
C-termini. This dystrophin-anchored complex may function 
to stabilize the sarcolemmal membrane during contraction— 
relaxation cycles, link the contractile force generated in the 
cell (fiber) with the extracellular environment, or maintain 
local organization of key proteins in the membrane. The 
dystrophin-associated membrane proteins (dystroglycans, 
DGs, and sarcoglycans, SGs) range from 25 to 154 kD. 
(Adapted from Ahn, A. H., and Kunkel, L. M., 1993. Nature Genetics 
3:283-291; and Worton, R., 1995. Science 270:755—756.) 
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Contracted 


H zone and I band decrease in width 


a FIGURE 16.8 The sliding filament model of skeletal 
“ muscle contraction. The decrease in sarcomere 
length is due to decreases in the width of the | band 
and H zone, with no change in the width of the A 
band.These observations mean that the lengths of 
both the thick and thin filaments do not change 
during contraction. Rather, the thick and thin fila- 
ments slide along one another. 
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sliding motions in a specific direction occur because the thin and thick filaments both 
have directional character. Actin filaments always extend outward from the Z lines in a 
uniform manner. The myosin thick filaments also assemble in a directional manner. 
The polarity of myosin thick filaments reverses at the M disc, which means that actin 
filaments on either side of the M disc are pulled toward the M disc during contraction 
by the sliding of the myosin heads, causing net shortening of the sarcomere. 


Albert Szent-Gydrgyi’s Discovery of the Effects of Actin on Myosin The molecular 
events of contraction are powered by the ATPase activity of myosin. Much of our pre- 
sent understanding of this reaction and its dependence on actin can be traced to sev- 
eral key discoveries by Albert Szent-Györgyi at the University of Szeged in Hungary in 
the early 1940s. 

In a series of elegant and insightful experiments, Szent-Györgyi showed the 
following: 


e Solution viscosity is increased dramatically when solutions of myosin and 
actin are mixed. Increased viscosity is a manifestation of the formation of an 
actomyosin complex. 

e The viscosity of an actomyosin solution is lowered by the addition of ATP, indi- 
cating that ATP decreases myosin’s affinity for actin. 

e Myosin ATPase activity is increased substantially by actin. (For this reason, Szent- 
Györgyi gave the name actin to the thin filament protein.) The ATPase turnover 
number of pure myosin is 0.05/sec, but when actin is added, the turnover num- 
ber increases to about 10/sec, a number more like that of intact muscle fibers. 


Szent-Gy6rgyi’s experiments demonstrated that ATP hydrolysis and the associa- 
tion and dissociation of actin and myosin are coupled. It is this coupling that en- 
ables ATP hydrolysis to power muscle contraction. 


The Coupling Mechanism: ATP Hydrolysis Drives Conformation Changes in the 
Myosin Heads The only remaining piece of the puzzle is this: How does the close 
coupling of actin-myosin binding and ATP hydrolysis result in the shortening of 
myofibrils? Put another way, how are ATP hydrolysis and the sliding filament model 
related? The answer to this puzzle is shown in Figure 16.9. The free energy of ATP 
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CENGAGENOW” ACTIVE FIGURE 16.9 The mechanism of skeletal muscle contraction. The free energy of ATP 
hydrolysis drives a conformational change in the myosin head, resulting in net movement of the myosin heads 
along the actin filament. (Adapted from Geeves, M., and Holmes, K., 2005. The molecular mechanism of muscle contraction. 
Advances in Protein Chemistry 71:161-193.) Test yourself on the concepts in this figure at 
www.cengage.com/login. 
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The optical trapping experiment involves the attachment of 
myosin molecules to silica beads that are immobilized on a mi- 
croscope coverslip (see accompanying figure). Actin filaments 
are then prepared such that a polystyrene bead is attached to 
each end of the filament. These beads can be “caught” and held 
in place in solution by a pair of “optical traps’—two high- 
intensity infrared laser beams, one focused on the polystyrene 
bead at one end of the actin filament and the other focused on 
the bead at the other end of the actin filament. The force acting 
on each bead in such a trap is proportional to the position of 
the bead in the “trap,” so displacement and forces acting on the 
bead (and thus on the actin filament) can both be measured. 


Molecular “Tweezers” of Light Take the Measure of a Muscle Fiber’s Force 


For a movement of 11 nm against a force of 4 pN, the energy re- 
quirement is larger: 


w= (4 pN): (11 nm) = 4.4 x 10°?°J 
If the cellular free energy of hydrolysis of ATP is taken as —50 kJ/ 


mol, the free energy available from the hydrolysis of a single ATP 
molecule is 


AG = (—50 kJ/mol) / (6.02 X 107° molecules/mol) = 8.3 x 107* J 


Thus, the free energy of hydrolysis of a single ATP molecule is sufficient to 
drive the observed movements against the forces that have been measured. 


When the “trapped” actin filament is brought close to the 
myosin-coated silica bead, one or a few myosin molecules may 
interact with sites on the actin and ATP-induced interactions of 
individual myosin molecules with the trapped actin filament can 
be measured and quantitated. Such optical trapping experi- 
ments have shown that a single cycle or turnover of a single myosin 
molecule along an actin filament involves an average movement of 4 to 
11 nm (40-110 A) and generates an average force of 1.7 to 4X 1072 
newton (1.7—4 piconewtons [pN]). 

The magnitudes of the movements observed in the optical trap- 
ping experiments are consistent with the movements predicted by 
the cryoelectron microscopy imaging data. Can the movements 
and forces detected in a single contraction cycle by optical trap- 
ping also be related to the energy available from hydrolysis of a sin- 
gle ATP molecule? The energy required for a contraction cycle is 
defined by the “work” accomplished by contraction, and work (w) 
is defined as force (F) times distance (d): 


w=F-d 


Optical trap Optical trap 


Polystyrene beads 


Actin 
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A Movements of single myosin molecules along an actin filament can be 
measured by means of an optical trap consisting of laser beams focused 
on polystyrene beads attached to the ends of actin molecules. (Adapted from 
Finer, J. T., et al., 1994. Single myosin molecule mechanics: Piconewton forces and nanometre 
steps. Nature 368:113-119. See also Block, S. M., 1995. Macromolecular physiology. 
Nature 378: 132-133.) 


For a movement of 4 nm against a force of 1.7 pN, we have 


w= (1.7 pN) - (4 nm) = 0.68 x 107% J 








hydrolysis is translated into conformation changes in the myosin head, so dissocia- 
tion of myosin and actin, hydrolysis of ATP, and rebinding of myosin and actin oc- 
cur with stepwise movement of the myosin head along the actin filament. The con- 
formation changes in the myosin head are driven by the binding and hydrolysis of 
ATP. 

As shown in the cycle in Figure 16.9, the myosin heads—with the hydrolysis prod- 
ucts ADP and P; bound—are mainly dissociated from the actin filaments in resting 
muscle. When the signal to contract is presented (see following discussion), the 
myosin heads move out from the thick filaments to bind to actin on the thin fila- 
ments (step 1). Actin binding closes the cleft in the myosin head, which causes a 
twist in the large B-sheet. The twist causes switch 1 and the P-loop to “open,” both 
of them moving away from the bound ADP and P; (Figure 16.10). The B-sheet twist 
also straightens the kink in the relay helix. These conformation changes result in 
the top-of-power stroke state shown in Figure 16.9, but this state is transient. The 
power stroke occurs almost immediately, accompanied by dissociation of P; and 
then ADP. The power stroke consists of a 60° rotation of the converter domain and 
the long a-helical neck into the down position relative to the myosin head (Figure 
16.9)—a movement that results in a 100-A movement of the end of the neck in the 
direction of contraction. 

The end of the power stroke is termed the rigor-like state because without access 
to additional ATP, the actin—myosin pair would be locked together, unable to dis- 
sociate. However, binding of another ATP causes dissociation of myosin from actin, 
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«1 FIGURE 16.10 Details of the switch domains, the relay helix and the converter domain are shown for (a) the post- 
rigor state and (b) the pre-power stroke state of skeletal muscle myosin. ATP hydrolysis drives these conformation 
changes. Actin binding induces a twist in the large B-sheet of the myosin head, causing the switch 1 and P-loop 
segments to “open.” (Adapted from Geeves, M, and Holmes, K, 2005.The molecular mechanism of muscle contraction. Advances in 
Protein Chemistry 71:161-193. Figure provided by Kenneth Holmes, Max Plank Institute for Medical Research, Heidelberg.) 


as first noticed by Szent-Györgyi. This dissociation occurs with “opening” of switch 
2, in which the switch 2 segment (the lower part of strand 5 and a short following 
loop) moves out of the plane of the seven-stranded f-sheet (Figure 16.10). 

ATP forms a strong interaction with switch 1 in the myosin active site, inducing 
switch 1 to close (moving toward the £- and y-phosphates of bound ATP), presum- 
ably also causing switch 2 to close (with the lower part of strand 5 moving back into 
the plane of the seven-stranded f-sheet). Switch 2 closing induces formation of the 
kink in the relay helix, causing a 60° rotation of the converter domain and neck he- 
lix into the up position relative to the myosin head. This movement completes the 
cycle of contraction as it “primes” the motor, preparing it for the next power stroke. 
This cycle is repeated at rates up to 5/sec in a typical skeletal muscle contraction. 
The conformational changes occurring in this cycle are the secret of the energy 
coupling that allows ATP binding and hydrolysis to drive muscle contraction. 


16:38 What Are the Molecular Motors That Orchestrate 
the Mechanochemistry of Microtubules? 


Filaments of the Cytoskeleton Are Highways That Move Cellular Cargo 


Most eukaryotic cells contain elaborate networks of protein fibers collectively 
termed the cytoskeleton. The cytoskeleton is a dynamic, three-dimensional struc- 
ture (Figure 16.11) that fills the cytoplasm and functions to: 


Establish cell shape 

Provide mechanical strength 

Facilitate cell movement 

Support intracellular transport of organelles and other cellular cargo 
Guide chromosome separation during mitosis and meiosis 


Three types of fibers comprise the cytoskeleton: microfilaments of actin (with a 
diameter of 3 to 6 nm), microtubules made from tubulin (20 to 25 nm diameter), 
and intermediate filaments formed from a variety of proteins (about 10 nm diame- 
ter). All of these have dynamic properties that facilitate the movement of organelles 
and other molecular cargo through the cell. 

Intermediate filaments provide a supporting network that allows cells to resist 
mechanical stress and deformation. Intermediate filaments are dynamic, and short 
filament segments (termed “squiggles” by Robert Goldman and his co-workers) can 
be transported across cells by motor proteins riding on microfilaments and micro- 
tubules. Polymeric actin microfilaments serve at least two functions in cells: They 
form networks just beneath the plasma membrane that link transmembrane pro- 
teins to cytoplasmic proteins, and they provide transcellular tracks on which or- 
ganelles can be transported by myosin-like proteins. Microtubules are the best un- 
derstood components of the cytoskeleton, and they are the focus of this section. 

Microtubules are hollow, cylindrical structures, approximately 30 nm in diameter, 
formed from tubulin, a dimeric protein composed of two similar 55-kD subunits 
known as a-tubulin and B-tubulin. Eva Nogales, Sharon Wolf, and Kenneth Downing 
have determined the structure of the bovine tubulin aB-dimer to 3.7 Aresolution (Fig- 
ure 16.12a). Tubulin dimers polymerize as shown in Figure 16.12b to form micro- 
tubules, which are essentially helical structures, with 13 tubulin monomer “residues” 
per turn. Microtubules grown in vitro are dynamic structures that are constantly being 
assembled and disassembled. Because all tubulin dimers in a microtubule are oriented 
similarly, microtubules are polar structures. The end of the microtubule at which 
growth occurs is the plus end, and the other is the minus end. Microtubules in vitro 
carry out a GTP-dependent process called treadmilling, in which tubulin dimers are 


16.3. What Are the Molecular Motors That Orchestrate the Mechanochemistry of Microtubules? 











© Dr. Gopal Murti/Visuals Unlimited 





491 























David Phillips/Visuals Unlimited 


Eric Grave/Phototake 











© Thomas Deerinck/Visuals Unlimited 


(d) 


(g) 
Plasma 
membrane 





Endoplasmic 
reticulum 


< W Microtubule 


Mitochondrion 


NN 
H 
(~~ 
| 4 Al Microfilaments 


Intermediate 
filaments 














Dr. Tony Brain/Custom Medical Stock 


FIGURE 16.11 Micrographs and electron micrographs of cytoskeletal ele- 
ments, cilia, and flagella: (a) microtubules (shown in red); (b) rat sperm tail 
microtubules (cross section); (c) Stylonychia, a ciliated protozoan (under- 
going division); (d) cytoskeleton of a eukaryotic cell (microtubules are 
green; nucleus is blue); (e) Pseudomonas fluorescens (aerobic soil bac- 
terium), showing flagella; (f) nasal cilia; and (g) schematic drawing of ele- 
ments of the cytoskeleton. 


added to the plus end at about the same rate at which dimers are removed from the 


minus end (Figure 16.13). 


Although composed only of 55-kD tubulin subunits, microtubules can grow suffi- 
ciently large to span a eukaryotic cell or to form large structures such as cilia and fla- 
gella. Inside cells, networks of microtubules play many functions, including formation 
of the mitotic spindle that segregates chromosomes during cell division, the move- 
ment of organelles and various vesicular structures through the cell, and the variation 
and maintenance of cell shape. In most cells, microtubules are oriented with their mi- 
nus ends toward the centrosome and their plus ends toward the cell periphery. This 
consistent orientation is important for mechanisms of intracellular transport. 
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CENGAGENOW” ACTIVE FIGURE 16.13 A model of 

the GTP-dependent treadmilling process. Both œ- and 
B-tubulin possess two different binding sites for GTP. 
The polymerization of tubulin to form microtubules is 
driven by GTP hydrolysis in a process that is only begin- 
ning to be understood in detail. Test yourself on the 
concepts in this figure at www.cengage.com/login. 
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FIGURE 16.12 (a) The structure of the tubulin @B-heterodimer (pdb id = 1JFF).(b) Microtubules may be viewed 
as consisting of 13 parallel, staggered protofilaments of alternating a-tubulin and B-tubulin subunits. The se- 
quences of the æ- and B-subunits of tubulin are homologous, and the aB-tubulin dimers are quite stable if Ca?* 
is present. The dimer is dissociated only by strong denaturing agents. 


Three Classes of Motor Proteins Move Intracellular Cargo 


Three principal classes of motor proteins move organelles and other cellular cargo 
on cytoskeletal filament highways in both eukaryotic and prokaryotic cells. In addi- 
tion to the myosins, most cells contain kinesins and dyneins. Humans possess 
40 genes for myosins, 45 for kinesins, and at least 14 for dyneins (Table 16.1). The 
large number of genes in each class reflects specialized structures required for a va- 
riety of functions. This diversity notwithstanding, these three classes of motor pro- 
teins share remarkable similarities of structure and function, as we shall see. 
Kinesin 1, also called conventional kinesin, is a tetramer consisting of a dimer of 
heavy chains (110 kD) associated with two light chains (65 kD). The heavy chains 
contain an N-terminal motor domain, a long coiled-coil stalk with a central hinge, 
and a globular C-terminal tail domain where the light chains bind (Figure 16.14). 
The motor domain binds to tubulin in microtubules, and the globular tail domain 
associates with the intended cellular cargo, for example, an organelle, an mRNA 
molecule, or an intermediate filament. In different kinesin families, the motor 








TABLE 16.1 | 
Number of Genes 

Genome Kinesins Dyneins Myosins 
Giardia lamblia (protozoan parasite) 25 10 0 
Saccharomyces cerevisiae (yeast) 25 1 5 
Drosophila melanogaster (fruit fly) 25 13 13 
Caenorhabditis elegans (roundworm) 20 2 17 
Arabidopsis thaliana (flowering plant) 61 0 17 
Homo sapiens (human) 45 14-16 40 
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FIGURE 16.14 (a) Domain structure of kinesins, myosin 
V, and cytoplasmic dynein. (b) Molecular models of 
kinesin 1, myosin V, and cytoplasmic dynein. (Adapted 
from Vale, R., 2003. The molecular motor toolbox for intracellular 
transport. Cell 112:467-480.) 
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Effectors of Microtubule Polymerization as Therapeutic Agents 


Microtubules in eukaryotic cells are important for the mainte- 
nance and modulation of cell shape and the disposition of intra- 
cellular elements during the growth cycle and mitosis. It may 
thus come as no surprise that the inhibition of microtubule 
polymerization can block many normal cellular processes. The 
alkaloid colchicine (see accompanying figure), a constituent of 
the swollen, underground stems of the autumn crocus (Colchi- 
cum autumnale) and meadow saffron, inhibits the polymerization HCO — C 
of tubulin into microtubules. This effect blocks the mitotic cycle ` f 
of plants and animals. Colchicine also inhibits cell motility and O 
intracellular transport of vesicles and organelles (which in turn HCO 
blocks secretory processes of cells). Colchicine has been used 

for hundreds of years to alleviate some of the acute pain of gout C — OCH; 
and rheumatism. In gout, white cell lysosomes surround and 
engulf small crystals of uric acid. The subsequent rupture of the 
lysosomes and the attendant lysis of the white cells initiate an 
inflammatory response that causes intense pain. The mecha- 
nism of pain alleviation by colchicine is not known for certain, 
but appears to involve inhibition of white cell movement in tis- HeC—O 
sues. Interestingly, colchicine’s ability to inhibit mitosis has A 
given it an important role in the commercial development of 

new varieties of agricultural and ornamental plants. When mito- HC — O 
sis is blocked by colchicine, the treated cells may be left with an ` 
extra set of chromosomes. Plants with extra sets of chromo- 
somes are typically larger and more vigorous than normal 
plants. Flowers developed in this way may grow with double the 
normal number of petals, and fruits may produce much larger Colchicine 


Another class of alkaloids, the vinca alkaloids from Vinca rosea, 
the Madagascar periwinkle, can also bind to tubulin and inhibit 
microtubule polymerization. Vinblastine and vincristine are used 
as potent agents for cancer chemotherapy because of their ability 
to inhibit the growth of fast-growing tumor cells. For reasons that 
are not well understood, colchicine is not an effective chemother- 
apeutic agent, although it appears to act similarly to the vinca 
alkaloids in inhibiting tubulin polymerization. 

The antitumor drug taxol was originally isolated from the bark 
of Taxus brevifolia, the Pacific yew tree. Like vinblastine and col- 
chicine, taxol inhibits cell replication by acting on microtubules. 
Unlike these other antimitotic drugs, however, taxol stimulates 
microtubule polymerization and stabilizes microtubules. The re- 
markable success of taxol in the treatment of breast and ovarian 
cancers stimulated research efforts to synthesize taxol directly 
and to identify new antimitotic agents that, like taxol, stimulate 
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domain is located in different places in the sequence, depending on the function 
of the specific family. 

The first dyneins to be discovered were axonemal dyneins, which cause sliding of 
microtubules in cilia and flagella. Cytoplasmic dyneins were first identified in 
Caenorhabditis elegans, a nematode worm. Cytoplasmic dynein consists of a dimer of 
two heavy chains (500 kD) with several other tightly associated light chains (Figure 
16.14). Each heavy chain contains a large motor domain (380 kD) encompassing six 
AAA-+ domains (see Section 16.4) arranged as a hexamer. A 10-nm stalk composed 
of a coiled coil projects from the head, between the fourth and the fifth AAA+ do- 
mains. The stalk is the microtubule-binding domain. 

Myosin V is a multimeric protein that consists of 16 polypeptide chains. The 
structure is built around a dimer of heavy chains, each of which includes head, 
neck, and tail domains. The heavy chain head domain is virtually indistinguish- 
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able from the head domain of myosin II from skeletal muscle (see Figure 16.5), 
but the neck domain is three times longer than the myosin II neck helix and it 
contains six repeats of a calmodulin-binding domain. Myosin V is normally associ- 
ated with an essential light chain (similar to that of myosin II), together with sev- 
eral calmodulins. Adjoining the neck is a 30-nm-long coiled-coil domain. The tail 
domain of myosin V also binds a light chain and is adapted to bind specific or- 
ganelles and other cargoes. 


Dyneins Move Organelles in a Plus-to-Minus Direction; Kinesins, 
in a Minus-to-Plus Direction—Mostly 


The mechanisms of intracellular, microtubule-based transport of organelles and vesi- 
cles were first elucidated in studies of axons, the long projections of neurons that ex- 
tend geat distances from the body of the cell. In these cells, it was found that sub- 
cellular organelles and vesicles could travel at surprisingly fast rates—as great as 1000 
to 2000 nm/sec—in either direction. 

Cytosolic dyneins specifically move organelles and vesicles from the plus end of a 
microtubule to the minus end. Thus, dyneins move vesicles and organelles from the 
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FIGURE 16.15 (a) Rapid axonal transport along micro- 
tubules permits the exchange of material between the 
synaptic terminal and the body of the nerve cell. (b) Vesi- 
cles, multivesicular bodies, and mitochondria are carried 
through the axon by this mechanism. (Adapted from a 
drawing by Ronald Vale.) 
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LIRE Processivity of Motor Proteins 
Distance Processivity 
Traveled (average 
Rate of Before number of % Chance of 
Movement Step Size Dissociating steps before Dissociating 
Motor (nm/sec) (nm) (nm) dissociating) in One Step 
Kinesin 1 600 8 800-1200 100-120 csl 
Myosin V 1000 36 700-2100 20-60 ~2-5 
Dynein 600 24-32 1000 30-40 ~2-3 
(cytoplasmic) 














cell periphery toward the centrosome (or, in an axon, from the synaptic termini to- 
ward the cell body). Most kinesins, on the other hand, assist the movement of or- 
ganelles and vesicles from the minus end to the plus end of microtubules, resulting 
in outward movement of organelles and vesicles (Figure 16.15). Certain unconven- 
tional kinesins move in the opposite direction, transporting cargo in the plus-to-mi- 
nus direction on microtubules. These kinesins have their motor domain located at 
the C-terminus of the polypeptide (see Figure 16.14). 


Cytoskeletal Motors Are Highly Processive 


The motors that move organelles and other cellular cargo on microtubules and 
actin filaments must be processive, meaning that they must make many steps along 
their cellular journey without letting go of their filamentous highway. Dyneins, 
nonskeletal myosins, and most kinesins are processive motors (Table 16.2). Motors 
in all these classes can carry cargo over roughly similar distances (700 to 2100 nm) 
before dissociating.! The step size of kinesin 1 is smaller than those of myosin V and 
cytoplasmic dynein; thus, its overall processivity (the average number of steps made 
before dissociating) is necessarily higher. Moving at rates of 600 to 1000 nm/sec, 
these motors can carry their cargoes for a second or more before dissociating from 
their filaments. 


ATP Binding and Hydrolysis Drive Hand-over-Hand 
Movement of Kinesin 


Kinesin movement along microtubules is driven by the cycle of ATP binding and hy- 
drolysis. The molecular details are similar in some ways to those of the skeletal mus- 
cle myosin—actin motor but are quite different in other ways. Kinesins, like skeletal 
muscle myosin, contain switch 1 and switch 2 domains that open and close in re- 
sponse to ATP binding and hydrolysis. Together these switches act as a “y-phosphate 
sensor,” which can detect the presence or absence of the y-phosphate of an adenine 
nucleotide in the active site. The switch movements between the ATP-bound and the 
ADP-bound states thus induce conformation changes that are propagated through 
a relay helix to a neck linker that rotates, in ways similar to skeletal muscle myosin 
(Figure 16.16). Thus, just as in skeletal myosin, small movements of the y-phosphate sensor 
at the ATP site ave translated into piston-like movement of a relay helix and then into rotations 
of the neck linker that result in motor movement. 

Here the kinesin and myosin models diverge, however, because the dimer of 
kinesin heavy chains translates these ATP-induced conformation changes into a 
hand-over-hand movement of its motor domain heads along the microtubule fila- 
ment. Ronald Vale and Ronald Milligan have likened this movement of kinesin 
heads along a microtubule to a judo expert throwing an opponent with a forward 
swing of the arm. 

A model of kinesin motor movement is shown in Figure 16.17. Kinesin heads in 
solution (that is, not attached to a microtubule) contain tightly bound ADP. Bind- 


‘Compare these distances with the dimensions of typical cells in Table 1.2. 
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FIGURE 16.16 Ribbon structures of the myosin and kinesin motor domains and the conformational changes 
triggered by the relay helix. The upper panels represent the motor domains of myosin and kinesin, respectively, in 
the ATP- or ADP-P;-like state. Similar structural elements in the catalytic cores of the two domains are shown in 
blue, the relay helices are dark green, and the mechanical elements (neck linker for kinesin, lever arm domains for 
myosin) are yellow. The nucleotide is shown as a white space-filling model. The similarity of the conformation 
changes caused by the relay helix in going from the ATP/ADP-P;-bound state to the ADP-bound or nucleotide- 
free state is shown in the lower panels. In both cases, the mechanical elements of the protein shift their positions 
in response to relay helix motion. Note that the direction of mechanical element motion is nearly perpendicular to 
the relay helix motion. (Adapted from Vale, R. D., and Milligan, R. A., 2000. The way things move: Looking under the hood of 
molecular motor proteins. Science 288:88-95.) 


ing of one head of a kinesin multimer to a microtubule causes dissociation of ADP 
from that head. ATP binds rapidly, triggering the neck linker to rotate or ratchet 
forward, throwing the second head forward as well and bringing it near the next 
binding site on the microtubule, 8 nm farther along the filament. The trailing head 
then hydrolyzes ATP and releases inorganic phosphate (but retains ADP), inducing 
its neck linker to return to its original orientation relative to the head. Exchange 
of ADP for ATP on the forward head begins the cycle again. The structure of the 
kinesin—microtubule complex (Figure 16.18) shows the switch 2 helix of kinesin in 
intimate contact with the microtubule at the junction of the a- and -subunits of a 
tubulin dimer. 


The Conformation Change That Leads to Movement Is Different 
in Myosins and Dyneins 


The movement of myosin motors on cytoskeletal actin filaments is presumed to be 
similar to the myosin—actin interaction in skeletal muscle. Clearly, however, the dif- 
ferent structure of the dynein hexameric motor domain and its associated coiled- 
coil stalk (see Figure 16.14) must represent a different motor mechanism. ATP- 
dependent conformation changes in the ring of AAA+ modules must be translated 
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FIGURE 16.17 A model for the motility cycle of kinesin. 
The two heads of the kinesin dimer work together to 
move processively along a microtubule. Frame 1: Each 
kinesin head is bound to the tubulin surface. The heads 
are connected to the coiled coil by “neck linker” segments 
(orange and red). Frame 2: Conformation changes in the 
neck linkers flip the trailing head by 160°, over and be- 
yond the leading head and toward the next tubulin bind- 
ing site. Frame 3: The new leading head binds to a new 
site on the tubulin surface (with ADP dissociation), com- 
pleting an 80 A movement of the coiled coil and the 
kinesin’s cargo. During this time, the trailing head hydro- 
lyzes ATP to ADP and P;.Frame 4: ATP binds to the leading 
head, and P; dissociates from the trailing head, completing 
the cycle. (Adapted from Vale, R. and Milligan, R, 2000. The way 
things move: Looking under the hood of molecular motor proteins. 
Science 288:88-95.) 
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FIGURE 16.18 In the kinesin-microtubule complex, the 
switch 2 helix (yellow) of kinesin (left) lies in contact 
with the microtubule (right) at the subunit interface of a 
tubulin dimer (pdb id = 2HXH). 
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FIGURE 16.19 A mechanism for the dynein power stroke involves conformation changes in the head domain 
(a) that facilitate movement of the stalk along a microtubule (b). ATP binding to the motor domain promotes 
dissociation of dynein from the microtubule. Hydrolysis of ATP causes a conformation change that primes the 
structure for a power stroke. Microtubule movement is initiated by tight binding to the tip of the stalk, which 
promotes a conformation change in the head ring (the power stroke). Release of ADP and P; from the catalytic 
site causes tilting of the stalk at the end of the cycle. (Adapted from Oiwa, K., and Sakakibara, H., 2005. Recent progress 
in dynein structure and mechanism. Current Opinion in Cell Biology 17:98-103.) 


into movements of the tip of the coiled-coil stalk along a microtubule. A proposed 
mechanism for dynein movement (Figure 16.19) suggests that the events of ATP 
binding and hydrolysis and ADP and P; release at an AAA+ module swing a linker 
that joins the AAA+ domain and the dynein tail. 


16:4: How Do Molecular Motors Unwind DNA? 


The ability of proteins to move in controlled ways along nucleic acid chains is im- 
portant to many biological processes. For example, when DNA is to be replicated, 
the strands of the double helix must be unwound and separated to expose single- 
stranded DNA templates. Similarly, histone octamers (Figure 11.26) slide along 
DNA strands in chromatin remodeling, Holliday junctions (see Figure 28.22) move, 
and nucleic acids move in and out of viral capsids. The motor proteins that move 
directionally along nucleic acid strands and accomplish these many functions are 
called translocases. The translocases that unwind DNA or RNA duplex substrates 
are termed helicases. Thus, all helicases are translocases, but not vice versa. 
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TABLE 16.3 
Quanterary Direction of Representative 
Family Subunit Structure Structure Movement Motor 
SFIA Superfamily 1 Monomeric A* Rep 
SFIB B RecD 
I | Core domains 
A | Accessory domains 
DNA binding DNA binding Unknown 
SF2 Superfamily 2 Monomeric AorB NS3 
Protease Protein:protein? 
SF3 Superfamily 3 (BPV E1) Hexameric A BPV El 
Origin binding 
SF4 Superfamily 4 Hexameric B T7 gp4 
Primase 
SF5 Superfamily 5 Hexameric B Rho 
Origin-binding domain 
SF6 Superfamily 6 (MCM) Hexameric AorB FtsK 
a i 
Zn binding 








*A: helicase moves 3'—>5' on nucleic acid. B: helicase moves 5’-3’. 


All translocases and helicases are members of six protein “superfamilies” (Table 
16.3 and Figure 16.20), all of them related evolutionarily to RecA, a DNA-binding 
protein (pages 881-882). Motors of superfamily 1 (SF1) and superfamily 2 (SF2) 
consist of two RecA domains in a tandem repeat. Motors of SF3 through SF6 are 
built from single RecA domain peptides that associate to form hexamers and dode- 
camers. Each superfamily possesses characteristic conserved residues and sequence 
elements (Table 16.3), most of which are shared between several superfamilies. All 
members of a given superfamily move in the same direction on a DNA or RNA tem- 
plate (either 5’ to 3’ or 3’ to 5’). The hexameric motor proteins of the SF3 and SF6 
superfamilies are members of the ancient AAA+ ATPase family. (AAA stands for 
“ATPases associated with various cellular activities,” and the “+” sign refers to an ex- 
panded definition of the family characteristics.) 

Translocases and helicases, like other molecular motors, require energy for their 
function. The energy for movement along a nucleic acid strand, as well as for sepa- 
ration of the strands of a duplex (DNA or RNA), is provided by hydrolysis of ATP. 
Translocases and helicases move on nucleic acid strands at rates of a few base pairs 
to several thousand base pairs per second. These movements are carefully regulated 
by accessory proteins in nearly all cases. Translocases and helicases typically move 
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along the DNA or RNA lattice for long distances without dissociating. This is termed 

Gene wf ee processive movement, and helicases are said to have a high processivity. For example, 
the E. coli BCD helicase, which is involved in recombination processes, can unwind 

oa 33,000 base pairs before it dissociates from the DNA lattice. Processive movement is es- 


sential for helicases involved in DNA replication, where millions of base pairs must be 
replicated rapidly. 

Helicases have evolved at least two structural and functional strategies for achiev- 
ing high processivity. The hexameric helicases (of the SF3 through SF6 superfamilies) 
form ringlike structures that completely encircle at least one of the strands of a DNA 
duplex. The SF1 and SF2 helicases, notably Rep helicase from E. coli, are monomeric 
or homodimeric and move processively along the DNA helix by means of a “hand- 
over-hand” movement that is remarkably similar to that of kinesin’s movement along 
microtubules. A key feature of hand-over-hand movement of a dimeric motor protein 
along a polymer is that at least one of the motor subunits must be bound to the poly- 
mer at any moment. 


Negative Cooperativity Facilitates Hand-over-Hand Movement 


How does hand-over-hand movement of a motor protein along a polymer occur? 
Clues have come from the structures of Rep helicase and its complexes with DNA. 
The Rep helicase from £. coli is a 76-kD protein that is monomeric in the absence 
of DNA. Binding of Rep helicase to either single-stranded or double-stranded DNA 
induces dimerization, and the Rep dimer is the active species in unwinding DNA. 
Each subunit of the Rep dimer can bind either single-stranded (ss) or double- 
stranded (ds) DNA. However, the binding of Rep dimer subunits to DNA is negatively co- 
operative (see Chapter 15). Once the first Rep subunit is bound, the affinity of DNA 
for the second subunit is at least 10,000 times weaker than that for the first! This 
negative cooperativity provides an obvious advantage for hand-over-hand walking. 
When one “hand” has bound the polymer substrate, the other “hand” releases. A 
conformation change could then move the unbound “hand” one step farther 
along the polymer where it can bind again. 





FIGURE 16.20 (a) Translocase and helicase motors of But what would provide the energy for such a conformation change? ATP 
SF1 and SF2 are monomers that consist of two RecA hydrolysis is the driving force for Rep helicase movement along DNA, and the neg- 
domains in a tandem repeat (pdb id = 1QHG). ative cooperativity of Rep binding to DNA is regulated by nucleotide binding. In the 


(b) Motor peptides of SF3 through SF6 associate to 


absence of nucleotide, a Rep dimer is favored, in which only one subunit is bound 
form hexamers (as shown) or dodecamers 


to ssDNA. In Figure 16.21a, this state is represented as PS [a Rep dimer (Po) bound 


(pdb id = 1CRO). 
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FIGURE 16.21 (a) A hand-over-hand model for movement along (and unwinding of ) DNA by 
E.coli Rep helicase. The PS state consists of a Rep dimer bound to ssDNA. The PSD state in- 
volves one Rep monomer bound to ssDNA and the other bound to dsDNA. The P-S; state has 
ssDNA bound to each Rep monomer. ATP binding and hydrolysis control the interconversion of 
these states and walking along the DNA substrate. (b) Crystal structure of the E. coli Rep helicase 
monomer with bound ssDNA (dark blue, ball and stick) and ADP (red). The monomer consists of 
four domains designated 1A (residues 1-84 and 196-276), 1B (residues 85-195), 2A (residues 
277-373 and 543-670), and 2B (residues 374-542). The open (purple) and closed (green) confor- 
mations of the 2B domain are superimposed in this figure (pdb id = 1UAA). (From Korolev, S., 
Hsieh, J., Gauss, G., Lohman, T. L., and Waksman, G., 1997. Major domain swiveling revealed by the crystal struc- 
tures of complexes of E.coli Rep helicase bound to single-stranded DNA and ADP. Cell 90:635-647. Reprinted 
by permission of Cell Press.) 
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to ssDNA (S)]. Timothy Lohman and his colleagues at Washington University in 
St. Louis have shown that binding of ATP analogs induces formation of a complex 
of the Rep dimer with both ssDNA and dsDNA, one to each Rep subunit (shown as 
PSD in Figure 16.21a). In their model, unwinding of the dsDNA and ATP hydroly- 
sis occur at this point, leaving a PəSo state in which both Rep subunits are bound to 
ssDNA. Dissociation of ADP and P; leave the PoS state again (Figure 16.21a). 

Work by Lohman and his colleagues has shown that coupling of ATP hydroly- 
sis and hand-over-hand movement of Rep over the DNA involves the existence of 
the Rep dimer in an asymmetric state. A crystal structure of the Rep dimer in com- 
plex with ssDNA and ADP shows that the two Rep monomers are in different con- 
formations (Figure 16.21b). The two conformations differ by a 130° rotation about 
a hinge region between two subdomains within the monomer subunit. The hand- 
over-hand walking of the Rep dimer along the DNA surface may involve alterna- 
tion of each subunit between these two conformations, with coordination of the 
movements by nucleotide binding and hydrolysis. 


Papillomavirus E1 Helicase Moves along DNA on a Spiral Staircase 


Papillomaviruses are tumor viruses that cause both cancerous and benign lesions in 
a host. Replication of papillomaviral DNA within a host cell requires the multifunc- 
tional 605-residue viral El protein. Monomers of El assemble at a replication ori- 
gin on DNA and form a pair of hexamers that wrap around a single strand of DNA. 
These assemblies are helicases that operate bidirectionally in the replication of 
viral DNA. The N-terminal half of the El protein includes a regulatory domain and 
a sequence-specific DNA-binding domain, whereas the helicase activity is located in 
the C-terminal half of the protein. The C-terminal helicase domain (Figure 16.22) 
includes a segment involved in oligomer formation (residues 300 to 378) and an 
AAA+ domain (residues 379 to 605). 

AAA+ domains are found in proteins of many functions, including motor activ- 
ity by dyneins (see Section 16.3) and helicases, protein degradation by proteasomes 
(see Chapter 31), and disassembly of SNARE complexes (see Chapter 9). This ubiq- 
uitous module consists of two subdomains: an N-terminal segment known as an 
a/B Rossman fold, and a C-terminal a-helical domain (Figure 16.23). The Rossman 
fold is wedge-shaped and has a -sheet of parallel strands in a 65-61-64-63-62 pat- 
tern. Key features of this fold include a Lys residue in the Walker A motif, an 
Asp-Asp or Asp-Glu pair in the Walker B motif, and a crucial Arg residue in a struc- 
ture called an arginine finger. These three motifs are essential for ATP binding and 
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FIGURE 16.22 (a) The papillomavirus E1 protein is a 605- 
residue monomer that forms hexameric assemblies at 
specific sites on single-stranded DNA (pdb id = 2V9P). 
(b) The C-terminal helicase domain shown here 
includes an oligomerization domain (magenta) and the 
AAA+ domain (blue). 


FIGURE 16.23 The AAA+ domain is composed of an 
N-terminal, wedge-shaped Rossman fold and a 
C-terminal a-helical domain (upper left). The P-loop 
(red), the Walker A motif (purple), the Walker B motif 
(yellow), and the arginine finger (blue) are shown. The 
ATP-binding sites lie between subunits of the hexamer. 
Each ATP site includes the arginine finger of one sub- 
unit and the Walker A and Walker B motifs of the adja- 
cent subunit (edb id = 1D2N). 
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FIGURE 16.24 A view of the E1 helicase along the 
ssDNA axis, showing the DNA-binding hairpin loops 
from each monomer (colored) interacting with the 
phosphates of DNA. DNA is shown as a ball-and-stick 
model in the center of the structure (pdb id = 2GXA). 


FIGURE 16.25 The hairpin loops of each subunit in the 
E1 helicase interact with two adjacent nucleotides in 
the DNA chain. Interactions include an ionic bond 
between Lys? (yellow) and a DNA phosphate oxygen, a 
hydrogen bond between His?’ and the phosphate of 
an adjacent nucleotide, and van der Waals interactions 
between the aromatic rings of Phe** (purple) and His°°” 
(olive) and the aliphatic chain of Lys°®, with the sugar 
linking the two phosphates (pdb id = 2GXA). 





hydrolysis. In an AAA+ hexamer, the ATP-binding sites lie at the interface between 
any two subunits, involving the arginine finger of any given subunit and the Walker 
A and Walker B motifs of the adjacent subunit. 

The structure of a large fragment (residues 306 to 577) of the papillomavirus E1 
protein bound to a segment of ssDNA (Figure 16.24) reveals the remarkable mech- 
anism by which this helicase traverses a DNA chain. The oligomerization domains 
form a symmetric hexamer, but the six AAA+ domains each display a unique con- 
formation. The DNA strand is bound in the center pore of the AAA+ hexamer, with 
six nucleotides of the DNA chain each bound to residues from each of the protein 
subunits. The crucial nucleotide-binding residues include Lys°°’ and His°” on a 
hairpin loop and Phe***, all of which face the center pore of the protein (Figure 
16.25). Lys*' interacts with one DNA phosphate oxygen, and His*” forms a hydro- 
gen bond with the phosphate of an adjacent nucleotide in the DNA chain. The 
aliphatic portion of Lys* and the aromatic groups of Phe**! and His?” share van 
der Waals interactions with the DNA sugar moiety linking these two phosphates. The 
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FIGURE 16.26 The hairpin loops of the E1 helicase hexamer are arranged in a spiral staircase that winds around 
the DNA strand (a) side view; (b) axial view (pdb id = 2GXA). As the helicase moves along the strand, the hair- 
pin loop of one protein monomer binds each nucleotide as it enters the central cavity of the helicase. The loop 
adopts six conformations (a) as the helicase moves along the DNA, preserving the loop—nucleotide interaction 
until the nucleotide exits the cavity. The released protein loop then returns to the other end of the cavity to 
bind a new, incoming nucleotide. DNA is shown as a stick structure. His? of each hairpin loop is shown in 
space-filling mode. 


hairpin loops of the six protein subunits form a spiral staircase, following the ssDNA as it 
threads through the central pore of the hexamer (Figure 16.26). 

Eric Enemark and Leemor Joshua-Tor have suggested that a central hairpin 
loop from one of the AAA+ subunits coordinates each DNA nucleotide as it enters 
the helicase pore. Then, as each AAA+ domain proceeds through the intermedi- 
ate states of ATP binding and hydrolysis, its hairpin loop steps down through the 
six conformations of the staircase, maintaining continuous contact with its nu- 
cleotide, as it escorts it through the pore, finally releasing the nucleotide as it ex- 
its the pore. Following release, the hairpin moves back to the top of the staircase, 
picks up the next available nucleotide, and begins another journey down the stair- 
case. For one full cycle of the hexamer, each subunit hydrolyzes one ATP, releases 
one ADP, and translocates one nucleotide through the central pore. A full cycle 
thus translocates six nucleotides with associated hydrolysis of six ATPs and release 
of six ADPs. 


1625» How Do Bacterial Flagella Use a Proton Gradient 
to Drive Rotation? 


Bacterial cells swim and move by rotating their flagella. The flagella of E. coli are he- 
lical filaments up to 15,000 nm (15 um) in length and 15 nm in diameter. The di- 
rection of rotation of these filaments affects the movements of the cell. When the 
half-dozen flagella on the surface of the bacterial cell rotate in a counterclockwise 
(CCW) direction, they twist and bundle together in a left-handed helical structure 
and rotate in a concerted fashion, propelling the cell through the medium. Every 
few seconds, the flagellar motor reverses, the helical bundle of filaments (now turn- 
ing clockwise, or CW) unwinds into a jumble, and the bacterium somersaults or 
tumbles. Alternating between CCW and CW rotations, the bacterium can move to- 
ward food sources, such as amino acids and sugars. 

The rotations of bacterial flagellar filaments are the result of the rotation of 
motor protein complexes in the bacterial plasma membrane. 
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The Flagellar Rotor Is a Complex Structure 


The flagellum is built from at least 25 proteins and comprises three parts: a rotary 
motor anchored in the bacterial inner membrane, a long filament that serves as a 
helical propellor, and a “hook” that functions as a universal joint that connects the 
motor with the filament (Figure 16.27). The rotary motor includes several rings of 
protein subunits, including the C ring, the MS ring, the P ring, and the L ring. The 
MS ring is built from 26 copies of the protein FliF. The C ring is attached to the MS 
ring and includes three “rotor” proteins—FliG, FliM, and FliN—involved in rota- 
tion of the motor. The C ring includes 26 copies of FliG, 34 copies of FliM, and 
34 X 4 = 136 copies of FIiN. The stationary portion of the motor—the “stator”—is 
formed from the proteins motA and motB. Eight motA,—motB» complexes are em- 
bedded in the bacterial inner membrane around the MS ring. 


Gradients of H* and Na* Drive Flagellar Rotors 


What energy source drives the flagellar motor? Gradients of protons and Na* ions 
exist across bacterial inner membranes, typically with more H* and Na* outside 
the cell. In E. coli, spontaneous inward flow of protons through the motA—motB 
complexes drives the rotation of the motor (Figure 16.28). In Vibrio cholerae, inward 
Na* ion flow powers the motor. Flagellar motors are thus energy conversion de- 
vices. In E. coli, each motA—motB complex passes 70 H* per revolution of the mo- 
tor. With a full complement of eight motA—motB complexes, a motor conducts 
about 560 protons per revolution. The H*-driven flagellar rotors reach top rota- 
tional speeds of about 360 Hz (corresponding to 21,600 rpm). Thus, the overall 
rate of proton flow for the motor is approximately 200,000 H*/sec! Flagellar mo- 
tors driven by Na* ions are even faster, with rotational rates of 1700 Hz (100,000 
rpm) observed in Vibrio. 

The motA—motB complexes work with FliG in the C ring to transfer protons 
across the membrane. FliG contains 335 residues, and most of the FliG protein 
structure (residues 104 to 335) consists of two compact domains joined by an 
a-helix (Figure 16.28). A ridge on the C-terminal domain contains five charged 
residues that interact with motA and are important for motor rotation. Asp”? of 
motB is essential for rotation of the motor and is probably involved in proton trans- 
fer. David Blair has proposed a model for creation of two membrane channels from 
the transmembrane segments of the motA,-motBy complex. Blair has suggested 
that each encounter of a motA-motB complex with a FliG subunit as the motor 
turns results in movement of one proton through each of these channels. The pas- 
sage of about 70 H* through each motA—motB complex in one revolution of the 


FIGURE 16.27 A model of the E.coli flagellar motor. The motor 
is anchored by interactions of stationary motA and motB 
proteins in the M and S rings with the inner membrane. 
Spontaneous flow of protons through the motA-motB com- 
plexes and into the cell drives the rotation of the motor. Flow 
rates of 200,000 protons per second drive the motor at 
speeds approaching 22,000 rpm. (Adapted from Thomas, D.R., 
Morgan, D. G, Francis, N. R., and DeRosier, D. J., 2007. Bit by bit the struc- 
ture of the complete flagellar hook/basal body complex. Microscopy 
and Microanalysis 13:34-35. Image provided by David J. DeRosier, 





Brandeis University.) 
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motor (which would involve encounters with 34 FliG subunits) is consistent with 
this suggestion (70/34 = ~2). 


The Flagellar Rotor Self-Assembles in a Spontaneous Process 


Flagellar rotors are true masterpieces of biological self-assembly. The ring of FliF 
subunits, within the MS ring, is the first to assemble in the plasma membrane. Other 
proteins then attach to this ring one after another, from the base to the tip, to con- 
struct the motor structure. Once the motor has formed, the flexible hook and the 
flagellar filament are assembled. Precise recognition of the existing template struc- 
ture allows this highly ordered self-assembly process to proceed without error. The 
flagellar filament is made from 20,000 to 30,000 copies of flagellin polymerized into 
a hollow helical tube structure. Each turn of the helical filament contains about 
5000 flagellin subunits and is about 2300 nm long. A complete flagellum can have 
up to six full helical turns. Flagellin molecules are transported through the long, 
narrow, central channel of the motor and flagellum from the cell interior to the far 
end of the flagellum, where they self-assemble with the help of a pentameric com- 
plex of FliD, a capping protein (see Figure 16.27). The FliD complex has a plate 
and five leg domains. It rotates in a stepping fashion at the end of the filament, ex- 
posing one binding site at a time and guiding the binding of newly arriving flagellin 
molecules in a helical pattern. 


Flagellar Filaments Are Composed of Protofilaments of Flagellin 


Each cylindrical flagellar filament is composed of 11 fibrils or protofilaments that 
form the cylinder, with each fibril lying at a slight tilt to the cylinder axis (Figure 
16.29a). An end-on view of the filament shows 11 subunits, each representing the 
end of a protofilament (Figure 16.29b). The flagellin protein of Salmonella typhi- 


FIGURE 16.28 Interactions between the stationary 
motA-motB complexes and the rotating FliG ring drive 
the flagellar motor. Proton flow through the motA-motB 
complexes is presumably coupled to conformation 
changes that alter ionic interactions between charged 
residues at the motA—motB and FIiG interface, driving 
rotation of the FliG ring. Other conserved features 
include a hydrophobic patch (light green), a Gly-Gly 
motif (purple), and a EHPQR motif (blue, in the middle 
domain (pdb id = 1LKV). 
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FIGURE 16.29 The E coli flagellum is composed of 11 protofilaments that run the length of the flagellar fila- 
ment. The filament is shown in cross section (a) and perpendicular to the filament (b). The protofilaments are 
long polymers of the flagellin protein (c), which consists of two a-helical domains (DO and D1) that lie at a 
slight tilt to the filament axis and two B-sheet domains (D2 and D3) that extend outward from the filament. 
The N- and C-termini of the polypeptide are indicated (pdb id = 1UCU). (Parts (a) and (b) courtesy of Keiichi Namba, 
Osaka University, Japan.) 


murium contains 494 residues and consists of four domains, denoted DO through D3 
(Figure 16.29c). DO and D1 are composed of a-helices, whereas D2 and D3 consist 
primarily of 6-strands. The N-terminus of the peptide chain lies at the base of DO. 
The peptide runs from DO to D3 and then reverses and returns to DO, where the 
N- and C-termini are juxtaposed. The structure resembles a Greek capital gamma 
(I), with a height of 140 A and a width of 110 A. Each flagellin protein is arranged 
with DO inside the filament and D3 facing the outside. The central pore, 20 Ain 
diameter, is lined by the a-helices of DO. 


Motor Reversal Involves Conformation Switching of Motor 
and Filament Proteins 


The flagellar motor reverses direction every few seconds so that the bacterium 
can change its swimming direction to seek better environments. Motor reversal 
involves conformation changes both in motor proteins and also in the filament 
itself. In the motor structure, the rotor proteins FliG, FliM, and FliN work to- 
gether to control direction changes of the motor, and they are known collectively 
as the switch complex. FliN appears to lie at the base of the C ring, FIG lies at 
the top of the C ring, and FliM resides in the middle, contacting both FliN and 
FliG (Figure 16.30). 

Reversal of the flagellar motor causes the long filament to switch from a left- 
handed helical structure to a right-handed helical form. This makes the bundle of 
flagella fall apart, causing the bacterium to tumble. This left-to-right switch in the 
filament is caused by a conformational change that occurs in the flagellin subunits 
in some protofilaments. Interestingly, the driving force for these conformation 
changes is probably the torque applied to DO and D1 of flagellin subunits along the 
filament when the motor itself reverses. 


rotor consists of the rotor proteins FliG, FliM, and FIiN. Interactions between these 
three proteins are presumed to control the direction of the rotor. Direction changes 
initiated here are communicated by protein conformation changes across the motor 


complex and throughout the length of the filament. Self-association of FliM subunits is 
mediated by hydrophilic residues of the a1 helix (red) on one subunit and on a short helix 
and loop on the adjacent subunit. Juxtaposed FIiN subunits in the ring form a hydropho- 
bic cleft (yellow). (FliG: pdb id = 1LKV; FliM: pdb id = 2HP7; FIIN: pdb id = 1YAB.) (Image on 


left courtesy of David J. DeRosier, Brandeis University.) 





FIGURE 16.30 The switch complex that controls direction changes by the flagellar \, 
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SUMMARY 


16.1 What Is a Molecular Motor? Motor proteins, also known as mol- 
ecular motors, use chemical energy (ATP) to orchestrate different 
movements, transforming ATP energy into the mechanical energy of 
motion. In all cases, ATP hydrolysis is presumed to drive and control 
protein conformational changes that result in sliding or walking move- 
ments of one molecule relative to another. To carry out directed move- 
ments, molecular motors must be able to associate and dissociate 


reversibly with a polymeric protein array, a surface, or substructure in 
the cell. ATP hydrolysis drives the process by which the motor protein 
ratchets along the protein array or surface. Molecular motors may be 
linear or rotating. Linear motors crawl or creep along a polymer lattice, 
whereas rotating motors consist of a rotating element (the “rotor”) and 
a stationary element (the “stator”), in a fashion much like a simple elec- 
trical motor. 
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16.2 What Is the Molecular Mechanism of Muscle Contraction? Exam- 
ination of myofibrils in the electron microscope reveals a banded or stri- 
ated structure. The so-called H zone shows a regular, hexagonally 
arranged array of thick filaments, whereas the I band shows a regular, 
hexagonal array of thin filaments. In the dark regions at the ends of each 
A band, the thin and thick filaments interdigitate. The thin filaments are 
composed primarily of three proteins called actin, troponin, and tropo- 
myosin. The thick filaments consist mainly of a protein called myosin. 
The thin and thick filaments are joined by cross-bridges. These cross- 
bridges are actually extensions of the myosin molecules, and muscle con- 
traction is accomplished by the sliding of the cross-bridges along the thin 
filaments, a mechanical movement driven by the free energy of ATP 
hydrolysis. 

Myosin, the principal component of muscle thick filaments, is a large 
protein consisting of six polypeptides, including light chains and heavy 
chains. The heavy chains consist of globular amino-terminal myosin 
heads, joined to long a-helical carboxy-terminal segments, the tails. 
These tails are intertwined to form a left-handed coiled coil approxi- 
mately 2 nm in diameter and 130 to 150 nm long. The myosin heads 
exhibit ATPase activity, and hydrolysis of ATP by the myosin heads drives 
muscle contraction. 

The free energy of ATP hydrolysis is translated into a conformation 
change in the myosin head, so dissociation of myosin and actin, hydro- 
lysis of ATP, and rebinding of myosin and actin occur with stepwise 
movement of the myosin S1 head along the actin filament. The confor- 
mation change in the myosin head is driven by the hydrolysis of ATP. 


16.3 What Are the Molecular Motors That Orchestrate the Mechano- 
chemistry of Microtubules? Microtubules are hollow, cylindrical struc- 
tures, approximately 30 nm in diameter, formed from tubulin, a dimeric 
protein composed of two similar 55-kD subunits known as @-tubulin and 
6-tubulin. Tubulin dimers polymerize to form microtubules, which are es- 
sentially helical structures, with 13 tubulin monomer “residues” per turn. 
Microtubules are, in fact, a significant part of the cytoskeleton, a sort of in- 
tracellular scaffold formed of microtubules, intermediate filaments, and 
microfilaments. In most cells, microtubules are oriented with their minus 
ends toward the centrosome and their plus ends toward the cell periphery. 
This consistent orientation is important for mechanisms of intracellular 


transport. Microtubules are also the fundamental building blocks of eu- 
karyotic cilia and flagella. Microtubules also mediate the intracellular mo- 
tion of organelles and vesicles. 


16.4 How Do Molecular Motors Unwind DNA? When DNA is to be 
replicated or repaired, the strands of the double helix must be unwound 
and separated to form single-stranded DNA intermediates. This separa- 
tion is carried out by molecular motors known as DNA helicases that 
move along the length of the DNA lattice, sequentially destabilizing the 
hydrogen bonds between complementary base pairs. The movement 
along the lattice and the separation of the DNA strands are coupled to 
the hydrolysis of nucleoside 5’-triphosphates. The Æ. coli BCD helicase, 
which is involved in recombination processes, can unwind 33,000 base 
pairs before it dissociates from the DNA lattice. Processive movement is 
essential for helicases involved in DNA replication, where millions of 
base pairs must be replicated rapidly. Certain hexameric helicases form 
ringlike structures that completely encircle at least one of the strands of 
a DNA duplex. Other helicases, notably Rep helicase from Æ. coli, are 
homodimeric and move processively along the DNA helix by means of a 
“hand-over-hand” movement that is remarkably similar to that of ki- 
nesin’s movement along microtubules. 


16.5 How Do Bacterial Flagella Use a Proton Gradient to Drive Rotation? 
Bacterial cells swim and move by rotating their flagella. The direction of 
rotation of these flagella affects the movements of the cell. When the half- 
dozen flagella on the surface of the bacterial cell rotate in a counter- 
clockwise direction, they twist and bundle together and rotate in a con- 
certed fashion, propelling the cell through the medium. The rotations of 
bacterial flagellar filaments are the result of the rotation of motor protein 
complexes in the bacterial plasma membrane. The flagellar motor con- 
sists of multiple rings (including the MS ring and the C ring). The rings 
are surrounded by a circular array of membrane proteins. In all, at least 
40 genes appear to code for proteins involved in this magnificent assem- 
bly. One of these, the motB protein, lies on the edge of the M ring, where 
it interacts with the motA protein, located in the membrane protein array 
and facing the M ring. In contrast to the many other motor proteins de- 
scribed in this chapter, a proton gradient, not ATP hydrolysis, drives the 
flagellar motor. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. The cheetah is generally regarded as nature’s fastest mammal, but 
another amazing athlete in the animal kingdom (and almost as fast 
as the cheetah) is the pronghorn antelope, which roams the plains 
of Wyoming. Whereas the cheetah can maintain its top speed of 
70 mph for only a few seconds, the pronghorn antelope can run at 
60 mph for about an hour! (It is thought to have evolved to do so 
in order to elude now-extinct ancestral cheetahs that lived in North 
America.) What differences would you expect in the muscle struc- 
ture and anatomy of pronghorn antelopes that could account for 
their remarkable speed and endurance? 

2. An ATP analog, B,y-methylene-ATP, in which a —CH»)— group re- 
places the oxygen atom between the £- and y-phosphorus atoms, is a 
potent inhibitor of muscle contraction. At which step in the contrac- 
tion cycle would you expect B,y-methylene-ATP to block contraction? 

3. ATP stores in muscle are augmented or supplemented by stores of 
phosphocreatine. During periods of contraction, phosphocreatine 
is hydrolyzed to drive the synthesis of needed ATP in the creatine 
kinase reaction: 


Phosphocreatine + ADP——> creatine + ATP 


Muscle cells contain two different isozymes of creatine kinase, one 
in the mitochondria and one in the sarcoplasm. Explain. 


4. Rigor is a muscle condition in which muscle fibers, depleted of ATP 
and phosphocreatine, develop a state of extreme rigidity and can- 
not be easily extended. (In death, this state is called rigor mortis, the 
rigor of death.) From what you have learned about muscle contrac- 
tion, explain the state of rigor in molecular terms. 

5. Skeletal muscle can generate approximately 3 to 4 kg of tension or 
force per square centimeter of cross-sectional area. This number is 
roughly the same for all mammals. Because many human muscles 
have large cross-sectional areas, the force that these muscles can 
(and must) generate is prodigious. The gluteus maximus (on which 
you are probably sitting as you read this) can generate a tension of 
1200 kg! Estimate the cross-sectional area of all of the muscles in 
your body and the total force that your skeletal muscles could gen- 
erate if they all contracted at once. 


6. Calculate a diameter for a tubulin monomer, assuming that the mono- 
mer MW is 55,000, that the monomer is spherical, and that the den- 
sity of the protein monomer is 1.3 g/mL. How does the number that 
you calculate compare to the dimension portrayed in Figure 16.12? 


7. Use the number you obtained in problem 6 to calculate how many 
tubulin monomers would be found in a microtubule that stretched 
across the length of a liver cell. (See Table 1.2 for the diameter of a 
liver cell.) 

8. The giant axon of the squid may be up to 4 inches in length. Use 
the value cited in this chapter for the rate of movement of vesicles 


and organelles across axons to determine the time required for a 
vesicle to traverse the length of this axon. 

9. As noted in this chapter, the myosin molecules in thick filaments of 
muscle are offset by approximately 14 nm. To how many residues of 
a coiled-coil structure does this correspond? 

10. Use the equations of Chapter 9 to determine the free energy dif- 
ference represented by a Ca®* gradient across the sarcoplasmic 
reticulum membrane if the luminal (inside) concentration of 
Ca?* is 1 mM and the concentration of Ca?* in the solution 
bathing the muscle fibers is 1 uM. 

11. Use the equations of Chapter 3 to determine the free energy of hy- 
drolysis of ATP by the sarcoplasmic reticulum Ca-ATPase if the con- 
centration of ATP is 3 mM, the concentration of ADP is 1 mM, and 
the concentration of P; is 2 mM. 

12. Under the conditions described in problems 10 and 11, what is the 
maximum number of Ca?* ions that could be transported per ATP 
hydrolyzed by the Ca-ATPase? 

13. For each of the motor proteins in Table 16.2, calculate the force ex- 
erted over the step size given, assuming that the free energy of hy- 
drolysis of ATP under cellular conditions is —50 kJ/mol. 

14. When you go to the gym to work out, you not only exercise many 
muscles but also involve many myosins (and actins) in any given ex- 
ercise activity. Suppose you lift a 10-kg weight a total distance of 
0.4 m. Using the data in Table 16.2 for myosin, calculate the mini- 
mum number of myosin heads required to lift this weight and the 
number of sliding steps these myosins must make along their asso- 
ciated actin filaments. 

15. In which of the following tissues would you expect to find smooth 
muscle? 

. Arteries 

. Stomach 

. Urinary bladder 

. Diaphragm 

. Uterus 

. The gums in your mouth 


moaneawe 


16. When an action potential (nerve impulse) arrives at a muscle mem- 
brane (sarcolemma), in what order do the following events occur? 
a. Release of Ca** ions from the sarcoplasmic reticulum 
b. Hydrolysis of ATP, with release of energy 
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c. Detachment of myosin from actin 
d. Sliding of myosin along actin filament 
e. Opening of switch 1 and switch 2 on myosin head 


17. (Essay question.) You are invited by the National Science Founda- 
tion to attend a scientific meeting to set the agenda for funding of 
basic research related to molecular motors for the next 10 years. 
Only basic research will be funded, ruling out studies on human 
subjects. You are asked to suggest the research area most worthy of 
scientific research. Your presentation must include (1) a brief back- 
ground on what we currently know about the subject; (2) identifi- 
cation of a key research topic about which more needs to be known; 
and (3) a justification of why additional knowledge in this area is 
critical for advancing the field (that is, why investigations in this 
area are especially important). You are not being asked to provide 
the methods or experiments that might be used to address the 
problem—only the concept. Base your presentation on what you 
have learned in this chapter (you may consult and include refer- 
ences from the Further Reading section), and limit your presenta- 
tion to 300 words. 


Preparing for the MCAT Exam 


18. Consult Figure 16.17 and use the data in problem 8 to determine 
how many steps a kinesin motor must take to traverse the length of 
the squid giant axon. 


19. When athletes overexert themselves on hot days, they often suffer 

immobility from painful muscle cramps. Which of the following is 

a reasonable hypothesis to explain such cramps? 

a. Muscle cells do not have enough ATP for normal muscle relax- 
ation. 

b. Excessive sweating has affected the salt balance within the muscles. 

c. Prolonged contractions have temporarily interrupted blood flow 
to parts of the muscle. 

d. All of the above. 


20. Duchenne muscular dystrophy is a sex-linked recessive disorder 
associated with severe deterioration of muscle tissue. The gene for 
the disease: 

a. is inherited by males from their mothers. 

b. should be more common in females than in males. 
c. both a and b. 

d. neither a nor b. 
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Metabolism: An Overview 


ESSENTIAL QUESTION 


The word metabolism derives from the Greek word for “change.” Metabolism repre- 
sents the sum of the chemical changes that convert nutrients, the “raw materials” 
necessary to nourish living organisms, into energy and the chemically complex 
finished products of cells. Metabolism consists of literally hundreds of enzymatic 
reactions organized into discrete pathways. These pathways proceed in a stepwise 
fashion, transforming substrates into end products through many specific chemical 
intermediates. Metabolism is sometimes referred to as intermediary metabolism to 
reflect this aspect of the process. 


What are the anabolic and catabolic processes that satisfy the metabolic needs 
of the cell? 


tæl Is Metabolism Similar in Different Organisms? 


One of the great unifying principles of modern biology is that organisms show marked 
similarity in their major pathways of metabolism. Given the almost unlimited possibil- 
ities within organic chemistry, this generality would appear most unlikely. Yet it’s true, 
and it provides strong evidence that all life has descended from a common ancestral 
form. All forms of nutrition and almost all metabolic pathways evolved in early prokary- 
otes prior to the appearance of eukaryotes 1 billion years ago. For example, glycolysis, 
the metabolic pathway by which energy is released from glucose and captured in the 
form of ATP under anaerobic conditions, is common to almost every cell. It is believed 
to be the most ancient of metabolic pathways, having arisen prior to the appearance 
of oxygen in abundance in the atmosphere. All organisms, even those that can syn- 
thesize their own glucose, are capable of glucose degradation and ATP synthesis via gly- 
colysis. Other prominent pathways are also virtually ubiquitous among organisms. 


Living Things Exhibit Metabolic Diversity 


Although most cells have the same basic set of central metabolic pathways, different 
cells (and, by extension, different organisms) are characterized by the alternative 
pathways they might express. These pathways offer a wide diversity of metabolic pos- 
sibilities. For instance, organisms are often classified according to the major meta- 
bolic pathways they exploit to obtain carbon or energy. Classification based on 
carbon requirements defines two major groups: autotrophs and heterotrophs. 
Autotrophs are organisms that can use just carbon dioxide as their sole source of 
carbon. Heterotrophs require an organic form of carbon, such as glucose, in order 
to synthesize other essential carbon compounds. 

Classification based on energy sources also gives two groups: phototrophs and 
chemotrophs. Phototrophs are photosynthetic organisms, which use light as a source of 
energy. Chemotrophs use organic compounds such as glucose or, in some instances, 
oxidizable inorganic substances such as Fe?*, NO.~, NH,*, or elemental sulfur as sole 
sources of energy. Typically, the energy is extracted through oxidation-reduction re- 
actions. Based on these characteristics, every organism falls into one of four categories 
(Table 17.1). 


Metabolic Diversity Among the Five Kingdoms Prokaryotes (the kingdom 
Monera—archaea and bacteria) show a greater metabolic diversity than all the 
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Anise swallowtail butterfly (Papilio zelicans) with its 
pupal case. Metamorphosis of butterflies is a dra- 
matic example of metabolic change. 





Allis flux, nothing stays still. 
Nothing endures but change. 


Heraclitus (c. 540-c. 480 B.C.) 
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EVES VA Metabolic Classi on of Orc According to Their Carbon an 


Classification Carbon Source Energy Source Electron Donors 


Examples 





Photoautotrophs CO» Light H20, HS, S, other inorganic 


compounds 


Green plants, algae, cyanobacteria, 
photosynthetic bacteria 
Photoheterotrophs Organic 
compounds 


CO, 


Light Organic compounds Nonsulfur purple bacteria 
Oxidation-reduction 


reactions 


Chemoautotrophs Inorganic compounds: Ho, HəS, 
NH,"*, NO, , Fe?*, Mn?+ 
Organic compounds 


(e.g., glucose) 


Nitrifying bacteria; hydrogen, sulfur, 
and iron bacteria 
Oxidation—reduction 
reactions 


Chemoheterotrophs Organic 


compounds 


All animals, most microorganisms, 
nonphotosynthetic plant tissue such as 
roots, photosynthetic cells in the dark 








four eukaryotic kingdoms (Protoctista [previously called Protozoa], Fungi, Plants, 
and Animals) put together. Prokaryotes are variously chemoheterotrophic, pho- 
toautotrophic, photoheterotrophic, or chemoautotrophic. No protoctista are 
chemoautotrophs; fungi and animals are exclusively chemoheterotrophs; plants 
are characteristically photoautotrophs, although some are heterotrophic in their 
mode of carbon acquisition. 


Oxygen Is Essential to Life for Aerobes 


A further metabolic distinction among organisms is whether or not they can use 
oxygen as an electron acceptor in energy-producing pathways. Those that can are 
called aerobes or aerobic organisms; others, termed anaerobes, can subsist without 
Os. Organisms for which Og is obligatory for life are called obligate aerobes; 
humans are an example. Some species, the so-called facultative anaerobes, can 
adapt to anaerobic conditions by substituting other electron acceptors for Os in 
their energy-producing pathways; Escherichia coli is an example. Yet others cannot 
use oxygen at all and are even poisoned by it; these are the obligate anaerobes. 
Clostridium botulinum, the bacterium that produces botulin toxin, is representative. 


The Flow of Energy in the Biosphere and the Carbon and Oxygen 


Photoautotrophic Heterotrophic : 
cells cells Cycles Are Intimately Related 
` HO a The primary source of energy for life is the sun. Photoautotrophs utilize light energy 


2 
to drive the synthesis of organic molecules, such as carbohydrates, from atmospheric 


CO» and water (Figure 17.1). Heterotrophic cells then use these organic products of 
photosynthetic cells both as fuels and as building blocks, or precursors, for the 
biosynthesis of their own unique complement of biomolecules. Ultimately, CO» is 


co, 


FIGURE 17.1 The flow of energy in the biosphere is cou- 
pled primarily to the carbon and oxygen cycles. 


A DEEPER LOOK 


Calcium Carbonate—A Biological Sink for CO, 





A major biological sink for CO» that is often overlooked is the 
calcium carbonate shells of corals, molluscs, and crustacea. 
These invertebrate animals deposit CaCO; in the form of pro- 
tective exoskeletons. In some invertebrates, such as the sclerac- 
tinians (hard corals) of tropical seas, photosynthetic dinoflagel- 
lates (kingdom Protoctista) known as zooxanthellae live within the 


Ca?t + 2 HCO; —=CaCOx) + HCO; 
HCO; =— H,O + CO, 


animal cells as endosymbionts. These phototrophic cells use 
light to drive the resynthesis of organic molecules from CO» re- 
leased (as bicarbonate ion) by the animal’s metabolic activity. In 
the presence of Ca?*, the photosynthetic CO, fixation “pulls” the 
deposition of CaCOs, as summarized in the following coupled 
reactions: 





HO + CO.—-> carbohydrate + O» 








172 


the end product of heterotrophic carbon metabolism, and CO, is returned to the at- 
mosphere for reuse by the photoautotrophs. In effect, solar energy is converted to 
the chemical energy of organic molecules by photoautotrophs, and heterotrophs 
recover this energy by metabolizing the organic substances. The flow of energy in the 
biosphere is thus conveyed within the carbon cycle, and the impetus driving the 
cycle is light energy. 


‘1722s What Can Be Learned from Metabolic Maps? 


Metabolic maps (Figure 17.2) portray the principal reactions of the intermediary 
metabolism of carbohydrates, lipids, amino acids, nucleotides, and their deriva- 
tives. These maps are very complex at first glance and seem to be virtually impos- 
sible to learn easily. Despite their appearance, these maps become easy to follow 
once the major metabolic routes are known and their functions are understood. 
The underlying order of metabolism and the important interrelationships between 
the various pathways then appear as simple patterns against the seemingly compli- 
cated background. 


The Metabolic Map Can Be Viewed as a Set of Dots and Lines 


One interesting transformation of the intermediary metabolism map is to represent 
each intermediate as a black dot and each enzyme as a line (Figure 17.3). Then, the 
more than 1000 different enzymes and substrates are represented by just two symbols. 
This chart has about 520 dots (intermediates). Table 17.2 lists the numbers of dots that 
have one or two or more lines (enzymes) associated with them. Thus, this table classi- 
fies intermediates by the number of enzymes that act upon them. A dot connected to 
just a single line must be either a nutrient, a storage form, an end product, or an ex- 
cretory product of metabolism. Also, because many pathways tend to proceed in only 
one direction (that is, they are essentially irreversible under physiological conditions), 
a dot connected to just two lines is probably an intermediate in only one pathway and 
has only one fate in metabolism. If three lines are connected to a dot, that intermedi- 
ate has at least two possible metabolic fates; four lines, three fates; and so on. Note that 
about 80% of the intermediates connect to only one or two lines and thus have only a 
single role in the cell. However, intermediates at branch points are subject to a variety 
of fates. In such instances, the pathway followed is an important regulatory choice. In- 
deed, whether any substrate is routed down a particular metabolic pathway is the con- 
sequence of a regulatory decision made in response to the cell’s (or organism’s) mo- 
mentary requirements for energy or nutrition. The regulation of metabolism is an 
interesting and important subject to which we will return often. 


Alternative Models Can Provide New Insights into Pathways 


Alternative mappings of metabolic reactions have been postulated for several rea- 
sons. First and most obviously, the sheer complexity of pathways has prompted 
biochemists to seek simpler portrayals of an organism’s chemistry. Second, tradi- 
tional metabolite-focused maps (Figure 17.4a) do not provide insight into the spa- 
tial and temporal organization of the metabolites and the enzymes that intercon- 
vert them. Even more significantly, the rise of genomics (the study of the whole 
genomes of organisms), transcriptomics (the study of global messenger RNA ex- 
pression), and proteomics (the study of the totality of proteins) has provoked 
fresh conceptions of biological order and function. For example, Juliet Gerrard 
has proposed that metabolic maps be recast in protein-centric presentations (Figure 
17.4b). In such maps, the metabolites and the enzymes that interconvert them are 
transposed, revealing a new emphasis—the metabolites are “signals” in a cellular 
network of proteins. 

Protein-centric maps may be condensed and simplified by realizing that some 
pathway enzymes are clustered in multienzyme complexes and that metabolites are 
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CENGAGENOW” ANIMATED FIGURE 17.2 A metabolic map, indicating the reactions of intermediary metabo- 
lism and the enzymes that catalyze them. More than 500 different chemical intermediates, or metabolites, and 
a greater number of enzymes are represented here. (Source: From Donald Nicholson, Map #22, © International Union of 
Biochemistry and Molecular Biology.) See this figure animated at www.cengage.com/login. 
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sing pathways known as glycolysis and the citric acid cycle. (Adapted from Alberts, B., et al., 1989. Molecular Bio 
ed.N 


FIGURE 17.3 The metabolic map as a set of dots and lines. The heavy dots and lines trace the central ener 
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FIGURE 17.4 (a) The traditional view of a metabolic pathway is 
metabolite-centric. (b) Gerrard has proposed that a protein-centric 
view is more informative for some purposes. (c) A simplified version 
of the protein-centric view where proteins in the pathway form 
multifunctional complexes. 


literally passed from enzyme to enzyme within such clusters (Figure 17.4c). The re- 
sult is a simplified representation of metabolic networks, containing only the es- 
sential signaling information. Metabolic maps are representations of large amounts 
of information. Conceptualizing them in different formats enables biochemists to 
analyze vast amounts of information in new and insightful ways. 


Multienzyme Systems May Take Different Forms 


The individual metabolic pathways of anabolism and catabolism consist of se- 
quential enzymatic steps (Figure 17.5). Several types of organization are possible. 
The enzymes of some multienzyme systems may exist as physically separate, soluble 
entities, with diffusing intermediates (Figure 17.5a). In other instances, the en- 
zymes of a pathway are collected to form a discrete multienzyme complex, and the 
substrate is sequentially modified as it is passed along from enzyme to enzyme (Fig- 
ure 17.5b). This type of organization has the advantage that intermediates are not 
lost or diluted by diffusion. In a third pattern of organization, the enzymes common 
to a pathway reside together as a membrane-bound system (Figure 17.5c). In this case, 
the enzyme participants (and perhaps the substrates as well) must diffuse in just the 
two dimensions of the membrane to interact with their neighbors. 

As research reveals the ultrastructural organization of the cell in ever greater de- 
tail, more and more of the so-called soluble enzyme systems are found to be physi- 
cally united into functional complexes. Thus, in many (perhaps all) metabolic path- 
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ways, the consecutively acting enzymes are associated into stable multienzyme com- 
plexes that are sometimes referred to as metabolons, a word meaning “units of 
metabolism.” 


17238 How Do Anabolic and Catabolic Processes Form 
the Core of Metabolic Pathways? 


Metabolism serves two fundamentally different purposes: the generation of energy 
to drive vital functions and the synthesis of biological molecules. To achieve these 
ends, metabolism consists largely of two contrasting processes: catabolism and an- 
abolism. Catabolic pathways are characteristically energy yielding, whereas anabolic path- 
ways are energy requiring. Catabolism involves the oxidative degradation of complex 
nutrient molecules (carbohydrates, lipids, and proteins) obtained either from the 
environment or from cellular reserves. The breakdown of these molecules by ca- 
tabolism leads to the formation of simpler molecules such as lactic acid, ethanol, 
carbon dioxide, urea, or ammonia. Catabolic reactions are usually exergonic, and 
often the chemical energy released is captured in the form of ATP (see Chapter 3). 
Because catabolism is oxidative for the most part, part of the chemical energy may 
be conserved as energy-rich electrons transferred to the coenzymes NAD* and 
NADP*. These two reduced coenzymes have very different metabolic roles: NAD+ 
reduction is part of catabolism; NADPH oxidation is an important aspect of 
anabolism. The energy released upon oxidation of NADH is coupled to the phos- 
phorylation of ADP in aerobic cells, and so NADH oxidation back to NAD* serves 
to generate more ATP; in contrast, NADPH is the source of the reducing power 
needed to drive reductive biosynthetic reactions. 

Thermodynamic considerations demand that the energy necessary for biosyn- 
thesis of any substance exceed the energy available from its catabolism. Otherwise, 


FIGURE 17.5 Schematic representation of types of multi- 
enzyme systems carrying out a metabolic pathway: 

(a) Physically separate, soluble enzymes with diffusing 
intermediates. (b) A multienzyme complex. Substrate 
enters the complex and becomes bound and then se- 
quentially modified by enzymes E; to E; before product 
is released. No intermediates are free to diffuse away. 

(c) Amembrane-bound multienzyme system. 
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FIGURE 17.6 Energy relationships between the path- 
ways of catabolism and anabolism. Oxidative, exergonic 
pathways of catabolism release free energy and reduc- 
ing power that are captured in the form of ATP and 
NADPH, respectively. Anabolic processes are endergonic, 
consuming chemical energy in the form of ATP and 
using NADPH as a source of high-energy electrons for 
reductive purposes. 


organisms could achieve the status of perpetual motion machines: A few molecules 
of substrate whose catabolism yielded more ATP than required for its resynthesis 
would allow the cell to cycle this substance and harvest an endless supply of energy. 


Anabolism Is Biosynthesis 


Anabolism is a synthetic process in which the varied and complex biomolecules (pro- 
teins, nucleic acids, polysaccharides, and lipids) are assembled from simpler precur- 
sors. Such biosynthesis involves the formation of new covalent bonds, and an input 
of chemical energy is necessary to drive such endergonic processes. The ATP gener- 
ated by catabolism provides this energy. Furthermore, NADPH is an excellent donor 
of high-energy electrons for the reductive reactions of anabolism. Despite their di- 
vergent roles, anabolism and catabolism are interrelated in that the products of one 
provide the substrates of the other (Figure 17.6). Many metabolic intermediates are 
shared between the two processes, and the precursors needed by anabolic pathways 
are found among the products of catabolism. 


Anabolism and Catabolism Are Not Mutually Exclusive 


Interestingly, anabolism and catabolism occur simultaneously in the cell. The con- 
flicting demands of concomitant catabolism and anabolism are managed by cells 
in two ways. First, the cell maintains tight and separate regulation of both catabo- 
lism and anabolism, so metabolic needs are served in an immediate and orderly 
fashion. Second, competing metabolic pathways are often localized within differ- 
ent cellular compartments. Isolating opposing activities within distinct compart- 
ments, such as separate organelles, avoids interference between them. For exam- 
ple, the enzymes responsible for catabolism of fatty acids, the fatty acid oxidation 
pathway, are localized within mitochondria. In contrast, fatty acid biosynthesis takes 
place in the cytosol. In subsequent chapters, we shall see that the particular mole- 
cular interactions responsible for the regulation of metabolism become important 
for an understanding and appreciation of metabolic biochemistry. 


The Pathways of Catabolism Converge to a Few End Products 


If we survey the catabolism of the principal energy-yielding nutrients (carbohy- 
drates, lipids, and proteins) in a typical heterotrophic cell, we see that the degra- 
dation of these substances involves a succession of enzymatic reactions. In the pres- 
ence of oxygen (aerobic catabolism), these molecules are degraded ultimately to 
carbon dioxide, water, and ammonia. Aerobic catabolism consists of three distinct 
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stages. In stage 1, the nutrient macromolecules are broken down into their respec- 
tive building blocks. Despite the great diversity of macromolecules, these building 
blocks represent a rather limited number of products. Proteins yield up their 
20 component amino acids, polysaccharides give rise to carbohydrate units that are 
convertible to glucose, and lipids are broken down into glycerol and fatty acids 
(Figure 17.7). 

In stage 2, the collection of product building blocks generated in stage 1 is further 
degraded to yield an even more limited set of simpler metabolic intermediates. The 








co, 











FIGURE 17.7 The three stages of catabolism. 

Stage 1: Proteins, polysaccharides, and lipids are broken 
down into their component building blocks, which are 
relatively few in number. Stage 2: The various building 
blocks are degraded into the common product, the 
acetyl groups of acetyl-CoA. Stage 3: Catabolism con- 
verges to three principal end products: water, carbon 
dioxide, and ammonia. 
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Amphi is from the Greek for “on both sides.” 


deamination of amino acids leaves a-keto acid carbon skeletons. Several of these 
a-keto acids are citric acid cycle intermediates and are fed directly into stage 3 catab- 
olism via this cycle. Others are converted either to the three-carbon a-keto acid pyru- 
vate or to the acetyl groups of acetyl-coenzyme A (acetyl-CoA). Glucose and the glycerol 
from lipids also generate pyruvate, whereas the fatty acids are broken into two-carbon 
units that appear as acetyl-CoA. Because pyruvate also gives rise to acetyl-CoA, we see 
that the degradation of macromolecular nutrients converges to a common end prod- 
uct, acetyl-CoA (Figure 17.7). 

The combustion of the acetyl groups of acetyl-CoA by the citric acid cycle and ox- 
idative phosphorylation to produce CO, and H20 represents stage 3 of catabolism. The 
end products of the citric acid cycle, CO, and HO, are the ultimate waste products 
of aerobic catabolism. As we shall see in Chapter 19, the oxidation of acetyl-CoA dur- 
ing stage 3 metabolism generates most of the energy produced by the cell. 


Anabolic Pathways Diverge, Synthesizing an Astounding Variety 
of Biomolecules from a Limited Set of Building Blocks 


A rather limited collection of simple precursor molecules is sufficient to provide for 
the biosynthesis of virtually any cellular constituent, be it protein, nucleic acid, 
lipid, or polysaccharide. All of these substances are constructed from appropriate 
building blocks via the pathways of anabolism. In turn, the building blocks (amino 
acids, nucleotides, sugars, and fatty acids) can be generated from metabolites in the 
cell. For example, amino acids can be formed by amination of the corresponding 
a-keto acid carbon skeletons, and pyruvate can be converted to hexoses for poly- 
saccharide biosynthesis. 


Amphibolic Intermediates Play Dual Roles 


Certain of the central pathways of intermediary metabolism, such as the citric acid 
cycle, and many metabolites of other pathways have dual purposes—they serve in both 
catabolism and anabolism. This dual nature is reflected in the designation of such 
pathways as amphibolic rather than solely catabolic or anabolic. In any event, in con- 
trast to catabolism—which converges to the common intermediate, acetylCoA—the 
pathways of anabolism diverge from a small group of simple metabolic intermediates 
to yield a spectacular variety of cellular constituents. 


Corresponding Pathways of Catabolism and Anabolism Differ 
in Important Ways 


The anabolic pathway for synthesis of a given end product usually does not precisely 
match the pathway used for catabolism of the same substance. Some of the interme- 
diates may be common to steps in both pathways, but different enzymatic reactions 
and unique metabolites characterize other steps. A good example of these differences 
is found in a comparison of the catabolism of glucose to pyruvic acid by the pathway 
of glycolysis and the biosynthesis of glucose from pyruvate by the pathway called glu- 
coneogenesis. The glycolytic pathway from glucose to pyruvate consists of 10 enzymes. 
Although it may seem efficient for glucose synthesis from pyruvate to proceed by a re- 
versal of all 10 steps, gluconeogenesis uses only seven of the glycolytic enzymes in re- 
verse, replacing those remaining with four enzymes specific to glucose biosynthesis. 
In similar fashion, the pathway responsible for degrading proteins to amino acids dif- 
fers from the protein synthesis system, and the oxidative degradation of fatty acids to 
two-carbon acetyl-CoA groups does not follow the same reaction path as the biosyn- 
thesis of fatty acids from acetyl-CoA. 


Metabolic Regulation Requires Different Pathways for Oppositely Directed 
Metabolic Sequences A second reason for different pathways serving in opposite 
metabolic directions is that such pathways must be independently regulated. If ca- 
tabolism and anabolism passed along the same set of metabolic tracks, equilibrium 
considerations would dictate that slowing the traffic in one direction by inhibiting 
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a particular enzymatic reaction would necessarily slow traffic in the opposite direc- 
tion. Independent regulation of anabolism and catabolism can be accomplished 
only if these two contrasting processes move along different routes or, in the case of 
shared pathways, the rate-limiting steps serving as the points of regulation are cat- 
alyzed by enzymes that are unique to each opposing sequence (Figure 17.8). 


ATP Serves in a Cellular Energy Cycle 


We saw in Chapter 3 that ATP is the energy currency of cells. In phototrophs, ATP 
is one of the two energy-rich primary products resulting from the transformation of 
light energy into chemical energy. (The other is NADPH; see the following discus- 
sion.) In heterotrophs, the pathways of catabolism have as their major purpose the 
release of free energy that can be captured in the form of energy-rich phosphoric 
anhydride bonds in ATP. In turn, ATP provides the energy that drives the manifold 
activities of all living cells—the synthesis of complex biomolecules, the osmotic 
work involved in transporting substances into cells, the work of cell motility, and the 
work of muscle contraction. These diverse activities are all powered by energy re- 
leased in the hydrolysis of ATP to ADP and P;. Thus, there is an energy cycle in cells 
where ATP serves as the vessel carrying energy from photosynthesis or catabolism to 
the energy-requiring processes unique to living cells (Figure 17.9). 
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FIGURE 17.8 Parallel pathways of catabolism and an- 
abolism must differ in at least one metabolic step in 
order that they can be regulated independently. Shown 
here are two possible arrangements of opposing cata- 
bolic and anabolic sequences between A and P. (a) The 
parallel sequences proceed via independent routes. 

(b) Only one reaction has two different enzymes, a cata- 
bolic one (E3) and its anabolic counterpart (Es). These 
provide sites for regulation. 


FIGURE 17.9 The ATP cycle in cells. ATP is formed via 

photosynthesis in phototrophic cells or catabolism in 
heterotrophic cells. Energy-requiring cellular activities 
are powered by ATP hydrolysis, liberating ADP and P.. 
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FIGURE 17.10 Comparison of the state of reduction 
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—COOH (carboxyls) > CO, (carbon dioxide, the final reduced reduced 
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NAD* Collects Electrons Released in Catabolism 


The substrates of catabolism—proteins, carbohydrates, and lipids—are good sources 
of chemical energy because the carbon atoms in these molecules are in a relatively 
reduced state (Figure 17.10). In the oxidative reactions of catabolism, reducing 
equivalents are released from these substrates, often in the form of hydride ions (a 
proton coupled with two electrons, H:~). These hydride ions are transferred in en- 
zymatic dehydrogenase reactions from the substrates to NAD+ molecules, reducing 
them to NADH. A second proton accompanies these reactions, appearing in the 
overall equation as H* (Figure 17.11). In turn, NADH is oxidized back to NAD* 
when it transfers its reducing equivalents to electron acceptor systems that are part 
of the metabolic apparatus of the mitochondria. The ultimate oxidizing agent (e~ ac- 
ceptor) is Oo, becoming reduced to HO. 

Oxidation reactions are exergonic, and the energy released is coupled with the 
formation of ATP in a process called oxidative phosphorylation. The NAD*-NADH 
system can be viewed as a shuttle that carries the electrons released from catabolic 
substrates to the mitochondria, where they are transferred to Os, the ultimate elec- 
tron acceptor in catabolism. In the process, the free energy released is trapped in 
ATP. The NADH cycle is an important player in the transformation of the chemical 
energy of carbon compounds into the chemical energy of phosphoric anhydride 
bonds. Such transformations of energy from one form to another are referred to as 
energy transduction. Oxidative phosphorylation is one cellular mechanism for en- 
ergy transduction. Chapter 20 is devoted to electron transport reactions and oxida- 
tive phosphorylation. 
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FIGURE 17.11 Hydrogen and electrons released in the course of oxidative catabolism are transferred as hydride 
ions to the pyridine nucleotide, NAD*, to form NADH + H* in dehydrogenase reactions of the type 


AH, + NAD*t —> A + NADH + H+ 


The reaction shown is catalyzed by alcohol dehydrogenase. 
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NADPH Provides the Reducing Power for Anabolic Processes 


Whereas catabolism is fundamentally an oxidative process, anabolism is, by its con- 
trasting nature, reductive. The biosynthesis of the complex constituents of the cell 
begins at the level of intermediates derived from the degradative pathways of catab- 
olism; or, less commonly, biosynthesis begins with oxidized substances available in 
the inanimate environment, such as carbon dioxide. When the hydrocarbon chains 
of fatty acids are assembled from acetyl-CoA units, activated hydrogens are needed 
to reduce the carbonyl (C=O) carbon of acetyl-CoA into a —CH»— at every other 
position along the chain. When glucose is synthesized from CO, during photosyn- 
thesis in plants, reducing power is required. These reducing equivalents are pro- 
vided by NADPH, the usual source of high-energy hydrogens for reductive biosyn- 
thesis. NADPH is generated when NADP* is reduced with electrons in the form of 
hydride ions. In heterotrophic organisms, these electrons are removed from fuel 
molecules by NADP*-specific dehydrogenases. In these organisms, NADPH can be 
viewed as the carrier of electrons from catabolic reactions to anabolic reactions 
(Figure 17.12). In photosynthetic organisms, the energy of light is used to pull elec- 
trons from water and transfer them to NADP*; Os is a by-product of this process. 


Coenzymes and Vitamins Provide Unique Chemistry 
and Essential Nutrients to Pathways 


In addition to NAD* and NADPH, a variety of other small molecules are essential 
to metabolism. Some of these are essential nutrients called vitamins. (The name was 
coined by Kazimierz Funk, who discovered thiamine as a cure for beriberi in 1912 
and termed it a “vital amine.” He later proposed that other diseases might be cured 
by “vitamins.”) 

Vitamins are required in the diet, usually in trace amounts, because they cannot 
be synthesized by the organism itself. The requirement for any given vitamin de- 
pends on the organism. Not all vitamins are required by all organisms. Vitamins re- 
quired in the human diet are listed in Table 17.3. These important substances are 
traditionally distinguished as being either water soluble or fat soluble. 

Except for vitamin C (ascorbic acid), the water-soluble vitamins are all compo- 
nents or precursors of important biological substances known as coenzymes. These 
are low-molecular-weight molecules that bring unique chemical functionality to 
certain enzyme reactions. Coenzymes may also act as carriers of specific functional 
groups, such as methyl groups and acyl groups. The side chains of the common 
amino acids provide only a limited range of chemical reactivities and carrier prop- 
erties. Coenzymes, acting in concert with appropriate enzymes, provide a broader 
range of catalytic properties for the reactions of metabolism. Coenzymes are typi- 
cally modified by these reactions and are then converted back to their original 
forms by other enzymes, so small amounts of these substances can be used repeat- 
edly. The coenzymes derived from the water-soluble vitamins are listed in Table 
17.3. Each will be discussed in the context of the chemistry they provide to specific 
pathways in Chapters 18 through 27. The fat-soluble vitamins are not directly re- 
lated to coenzymes, but they play essential roles in a variety of critical biological 
processes, including vision, maintenance of bone structure, and blood coagula- 
tion. The mechanisms of action of fat-soluble vitamins are not as well understood 
as their water-soluble counterparts, but modern research efforts are gradually clos- 
ing this gap. 
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Armed with the knowledge that metabolism is organized into pathways of successive 
reactions, we can appreciate by hindsight the techniques employed by early bio- 
chemists to reveal their sequence. A major intellectual advance took place at the 
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FIGURE 17.12 Transfer of reducing equivalents from 
catabolism to anabolism via the NADPH cycle. 
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eee V 
Discussed 
Vitamin Coenzyme Form Function in Chapter 
Water-Soluble 
Thiamine (vitamin Bı) Thiamine pyrophosphate Decarboxylation of a-keto acids and formation and 19, 22 
cleavage of a-hydroxyketones 
Niacin (nicotinic acid) Nicotinamide adenine Hydride transfer 18-27 
dinucleotide (NAD*) 
Nicotinamide adenine dinucleo- Hydride transfer 21, 22, 24-26 
tide phosphate (NADP*) 
Riboflavin (vitamin Bo) Flavin adenine dinucleotide One- and two-electron transfer 19, 20, 23, 26 
(FAD) 
Flavin mononucleotide (FMN) One- and two-electron transfer 20 
Pantothenic acid Coenzyme A Activation of acyl groups for transfer by nucleophilic 19, 23, 24, 27 
attack, and activation of the a-hydrogen of the acyl 
group for abstraction as a proton 
Pyridoxal, pyridoxine, Pyridoxal phosphate Formation of stable Schiff base (aldimine) adducts 25 
pyridoxamine (vitamin Bg) with a-amino groups of amino acids; serving as an 
electron sink to stabilize reaction intermediates 
Cobalamin (vitamin Bj») 5'-Deoxyadenosylcobalamin Intramolecular rearrangement, reduction of 23 
ribonucleotides to deoxyribonucleotides, and 
methyl group transfer 
Methylcobalamin 
Biotin Biotin—lysine complexes Carrier of carboxyl groups in carboxylation 22, 24 
(biocytin) reactions 
Lipoic acid Lipoyl-lysine complexes Coupling acyl group transfer and electron transfer 19 
(lipoamide) during oxidation and decarboxylation of a-keto acids 
Folic acid Tetrahydrofolate Acceptor and donor of 1-C units for all oxidation 25, 26 
levels of carbon except that of COs 
Fat-Soluble 
Retinol (vitamin A) 
Retinal (vitamin A) 
Retinoic acid (vitamin A) 
Ergocalciferol (vitamin Də) 
Cholecalciferol (vitamin Ds) 
a-Tocopherol (vitamin E) 
Menaquinone (vitamin K) 











end of the 19th century when Eduard Buchner showed that the fermentation of glu- 
cose to yield ethanol and carbon dioxide can occur in extracts of broken yeast cells. 
Until this discovery, many thought that metabolism was a vital property, unique to 
intact cells; even the eminent microbiologist Louis Pasteur, who contributed so 
much to our understanding of fermentation, was a vitalist, one of those who be- 
lieved that the processes of living substance transcend the laws of chemistry and 
physics. After Buchner’s revelation, biochemists searched for intermediates in the 
transformation of glucose and soon learned that inorganic phosphate was essential 
to glucose breakdown. This observation gradually led to the discovery of a variety of 
phosphorylated organic compounds that serve as intermediates along the fermen- 
tative pathway. 

An important tool for elucidating the steps in the pathway was the use of meta- 
bolic inhibitors. Adding an enzyme inhibitor to a cell-free extract caused an accumu- 
lation of intermediates in the pathway prior to the point of inhibition (Figure 
17.13). Each inhibitor was specific for a particular site in the sequence of metabolic 
events. As the arsenal of inhibitors was expanded, the individual steps in metabo- 
lism were revealed. 
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FIGURE 17.13 The use of inhibitors to reveal the se- 
quence of reactions in a metabolic pathway. (a) Control: 
Under normal conditions, the steady-state concentra- 
tions of a series of intermediates will be determined by 
the relative activities of the enzymes in the pathway. 

(b) Plus inhibitor: In the presence of an inhibitor (in this 
case, an inhibitor of enzyme 4), intermediates upstream 
of the metabolic block (B, C, and D) accumulate, reveal- 
ing themselves as intermediates in the pathway. The 
concentration of intermediates lying downstream (E and 
F) will fall. 


Mutations Create Specific Metabolic Blocks 


Genetics provides an approach to the identification of intermediate steps in me- 
tabolism that is somewhat analogous to inhibition. Mutation in a gene encoding an 
enzyme often results in an inability to synthesize the enzyme in an active form. Such 
a defect leads to a block in the metabolic pathway at the point where the enzyme 
acts, and the enzyme’s substrate accumulates. Such genetic disorders are lethal if 
the end product of the pathway is essential or if the accumulated intermediates have 
toxic effects. In microorganisms, however, it is often possible to manipulate the 
growth medium so that essential end products are provided. Then the biochemical 
consequences of the mutation can be investigated. Studies on mutations in genes of 
the filamentous fungus Neurospora crassa led G. W. Beadle and E. L. Tatum to hy- 
pothesize in 1941 that genes are units of heredity that encode enzymes (a principle 
referred to as the “one gene—one enzyme” hypothesis). 


Isotopic Tracers Can Be Used as Metabolic Probes 


Another widely used approach to the elucidation of metabolic sequences is to “feed” 
cells a substrate or metabolic intermediate labeled with a particular isotopic form of 
an element that can be traced. Two sorts of isotopes are useful in this regard: radioac- 
tive isotopes, such as !C, and stable “heavy” isotopes, such as !8O or !°N (Table 17.4). 


TABLE 17.4 | 









Isotope Type Radiation Type Half-Life Relative Abundance* 
°H Stable 0.0154% 
°H Radioactive B= 12.1 years 

BG Stable 1.1% 

G Radioactive B- 5700 years 

5N Stable 0.365% 
180 Stable 0.204% 
23Na Radioactive BLY 15 hours 

3P Radioactive 6- 14.3 days 

35 Radioactive 6- 87.1 days 

36C] Radioactive B- 310,000 years 

BRK Radioactive B- 12.5 hours 

Ca Radioactive B- 152 days 

59Fe Radioactive BLY 45 days 

1317 Radioactive B. y 8 days 








*The relative natural abundance of a stable isotope is important because, in tracer studies, the amount of stable isotope 
is typically expressed in terms of atoms percent excess over the natural abundance of the isotope. 
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FIGURE 17.14 One of the earliest experiments using a ra- 
dioactive isotope as a metabolic tracer. Cells of Chlorella 
(a green alga) synthesizing carbohydrate from carbon 
dioxide were exposed briefly (5 sec) to '*C-labeled CO). 
The products of CO) incorporation were then quickly iso- 
lated from the cells, separated by two-dimensional paper 
chromatography, and observed via autoradiographic ex- 
posure of the chromatogram. Such experiments identi- 
fied radioactive 3-phosphoglycerate (PGA) as the pri- 
mary product of CO, fixation. The 3-phosphoglycerate 
was labeled in the 1-position (in its carboxyl group). Ra- 
dioactive compounds arising from the conversion of 
3-phosphoglycerate to other metabolic intermediates 
included phosphoenolpyruvate (PEP), malic acid, 

triose phosphate, alanine, and sugar phosphates and 
diphosphates. 
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Because the chemical behavior of isotopically labeled compounds is rarely distin- 
guishable from that of their unlabeled counterparts, isotopes provide reliable “tags” 
for observing metabolic changes. The metabolic fate of a radioactively labeled sub- 
stance can be traced by determining the presence and position of the radioactive 
atoms in intermediates derived from the labeled compound (Figure 17.14). 


Heavy Isotopes Heavy isotopes endow the compounds in which they appear with 
slightly greater masses than their unlabeled counterparts. These compounds can be 
separated and quantitated by mass spectrometry (or density gradient centrifuga- 
tion, if they are macromolecules). For example, !*O was used in separate experi- 
ments as a tracer of the fate of the oxygen atoms in water and carbon dioxide to de- 
termine whether the atmospheric oxygen produced in photosynthesis arose from 
H0O, COs, or both: 


CO», F HO —> (CH0) + Oo 


If '8O-labeled COs was presented to a green plant carrying out photosynthesis, 
none of the '8O was found in O». Curiously, it was recovered as H»!8O. In contrast, 
when plants fixing CO, were equilibrated with Ho!8O, 18O, was evolved. These lat- 
ter labeling experiments established that photosynthesis is best described by the 
equation 


C10; + 2 HO —> (CHO) + 18O, + HO 


That is, in the process of photosynthesis, the two oxygen atoms in O come from 
two HO molecules. One O is lost from CO» and appears in HO, and the other 
O of CO» is retained in the carbohydrate product. Two of the four H atoms are 
accounted for in (CH2O), and two reduce the O lost from CO» to H0O. 


NMR Spectroscopy Is a Noninvasive Metabolic Probe 


A technology analogous to isotopic tracers is provided by nuclear magnetic 
resonance (NMR) spectroscopy. The atomic nuclei of certain isotopes, such as the 
naturally occurring isotope of phosphorus, *!P, have magnetic moments. The reso- 
nance frequency of a magnetic moment is influenced by the local chemical envi- 
ronment. That is, the NMR signal of the nucleus is influenced in an identifiable 
way by the chemical nature of its neighboring atoms in the compound. In many 
ways, these nuclei are ideal tracers because their signals contain a great deal of 
structural information about the environment around the atom and thus the 
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nature of the compound containing the atom. Transformations of substrates and 
metabolic intermediates labeled with magnetic nuclei can be traced by following 
changes in NMR spectra. Furthermore, NMR spectroscopy is a noninvasive proce- 
dure. Whole-body NMR spectrometers are being used today in hospitals to directly 
observe the metabolism (and clinical condition) of living subjects (Figure 17.15). 
NMR promises to be a revolutionary tool for clinical diagnosis and for the investi- 
gation of metabolism in situ (literally “in site,” meaning, in this case, “where and 
as it happens”). 


Metabolic Pathways Are Compartmentalized Within Cells 


Although the interior of a prokaryotic cell is not subdivided into compartments by 
internal membranes, the cell still shows some segregation of metabolism. For ex- 
ample, certain metabolic pathways, such as phospholipid synthesis and oxidative 
phosphorylation, are localized in the plasma membrane. Protein biosynthesis is car- 
ried out on ribosomes. 

In contrast, eukaryotic cells are extensively compartmentalized by an endo- 
membrane system. Each of these cells has a true nucleus bounded by a double 
membrane called the nuclear envelope. The nuclear envelope is continuous with 
the endomembrane system, which is composed of differentiated regions: the en- 
doplasmic reticulum; the Golgi complex; various membrane-bounded vesicles 
such as lysosomes, vacuoles, and microbodies; and, ultimately, the plasma mem- 
brane itself. Eukaryotic cells also possess mitochondria and, if they are photo- 
synthetic, chloroplasts. Disruption of the cell membrane and fractionation of the 
cell contents into the component organelles have allowed an analysis of their re- 
spective functions (Figure 17.16). Each compartment is dedicated to specialized 
metabolic functions, and the enzymes appropriate to these specialized functions 
are confined together within the organelle. In many instances, the enzymes of a 
metabolic sequence occur together within the organellar membrane. Thus, the 
flow of metabolic intermediates in the cell is spatially as well as chemically segregated. For 
example, the 10 enzymes of glycolysis are found in the cytosol, but pyruvate, the 
product of glycolysis, is fed into the mitochondria. These organelles contain the 
citric acid cycle enzymes, which oxidize pyruvate to CO». The great amount of 
energy released in the process is captured by the oxidative phosphorylation 
system of mitochondrial membranes and used to drive the formation of ATP 
(Figure 17.17). 


FIGURE 17.15 With NMR spectroscopy, one can observe 
the metabolism of a living subject in real time. These 
NMR spectra show the changes in ATP, creatine-P (phos- 
phocreatine), and P; levels in the forearm muscle of a 
human subjected to 19 minutes of exercise. Note that 
the three P atoms of ATP (a, B,and y) have different 
chemical shifts, reflecting their different chemical 
environments. 
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FIGURE 17.16 Fractionation of a cell extract by differential centrifugation. It is possible to separate organelles 
and subcellular particles in a centrifuge because their inherent size and density differences give them different 
rates of sedimentation in an applied centrifugal field. Nuclei are pelleted in relatively weak centrifugal fields and 
mitochondria in somewhat stronger fields, whereas very strong centrifugal fields are necessary to pellet ribo- 
somes and fragments of the endomembrane system. 
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FIGURE 17.17 Compartmentalization of glycolysis, the citric acid cycle, and oxidative phosphorylation. 
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Rapid advances in chemical analysis have made it possible to carry out comprehen- 
sive studies of the many metabolites in a living organism. The metabolome is the 
complete set of low-molecular-weight molecules present in an organism or excreted 
by it under a given set of physiological conditions. Metabolomics is the systematic 
identification and quantitation of all these metabolites in a given organism or sam- 
ple. It is quite remarkable that biochemists can foresee the rise of a true systems 
biology, where comprehensive information sets from the genome, the transcrip- 
tome, the proteome, and the metabolome will combine to provide incisive descrip- 
tions of biological systems and detailed understanding of many human diseases. 

Even simple organisms present daunting challenges for metabolomic analyses. 
There are more than 500 metabolites represented in Figure 17.2, but far more ex- 
ist in a typical cell. For example, the 40 or so fatty acids occurring in a cell can alone 
account for thousands of different metabolites. (Triglycerides, with three fatty acids 
esterified to a glycerol backbone, could account for 40 X 40 X 40 = 64,000 species 
by themselves!) The Human Metabolomics Database (www.hmdb.ca) provides data 
on more than 2500 metabolites known in cells of the human body and human body 
fluids (blood, urine, and so on). Metabolomic measurements must be able to re- 
solve and discriminate this array of small molecules. Moreover, concentrations of 
metabolites vary widely, from 107? M (for many hormones) to 0.1 M (for Na* ions). 
Comprehensive metabolomic analyses involve processing of many samples, so the 
time and cost required per sample must be as low as possible. 

Mass spectrometry (MS) and nuclear magnetic resonance (NMR) are both pow- 
erful techniques for metabolomic analysis. Mass spectrometry offers unmatched sen- 
sitivity for detection of metabolites at low concentrations (Figure 17.18), and NMR 
spectroscopy can provide remarkable resolution and discrimination of metabolites 
in complex mixtures (Figure 17.19). Combination of these techniques with a variety 
of chromatographic separation protocols (Figure 17.20) makes it possible to analyze 
thousands of metabolites in biological samples rapidly and at low cost. 
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FIGURE 17.18 Mass spectrometry offers remarkable 
sensitivity for metabolomic analyses. Shown here are 
desorption electrospray ionization mass spectra for 
urine samples from individuals with inborn errors of 
metabolism. PKU = phenylketonuria; HCY = homo- 
cystinuria; MSUD = maple syrup urine disease. (Adapted 
from Pan, Z., Gu, H, et al., 2007. Principal component analysis of 
urine metabolites by NMR and DESI-MS in patients with inborn 
errors of metabolism. Analytical and Bioanalytical Chemistry 
387:539-549.) 
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FIGURE 17.19 (a) One-dimensional 'H NMR 
spectrum of an equimolar mixture of 


26 small-molecule standards. (b) Two- ois : iff ~ WOS a Threonine, Valine 
dimensional NMR spectrum of the same AA Sucrose Glucose, Maltose 
mixture (red) overlaid onto a spectrum of Fructose, Maltose ; A Glucose, Maltose, Sucrose 
aqueous whole-plant extract from s p Glucose, Maltose 

Arabidopsis thaliana, a model organism Sucrose Glucose 

for the study of plant molecular biology 5. 6c ae a oe i Mene 

and biochemistry. (Adapted from Lewis, l., Vi ’ è i Su Arabidopsis extract 
Schommer, S, et al., 2007. Method for determining Sucrose, 3 


"as Mixture of standards 
molar concentrations of metabolites in complex d 


solutions from two-dimensional 1H-1°C NMR 
spectra. Analytical Chemistry 79:9385-9390.) 





1004 


FIGURE 17.20 The combination of mass spectrometry 
and gas chromatography makes it possible to separate 
and identify hundreds of metabolites. Shown is an ion 
chromatogram of a human urine sample, with 1582 
separately identified peaks. (Adapted from Dettmer, K, and o Anaad LAA MAA pa e ieee ee 
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17.6 What Food Substances Form the Basis of Human Nutrition? 


1/265 What Food Substances Form the Basis 
of Human Nutrition? 


The use of foods by organisms is termed nutrition. The ability of an organism to use a 
particular food material depends upon its chemical composition and upon the meta- 
bolic pathways available to the organism. In addition to essential fiber, food includes 
the macronutrients—protein, carbohydrate, and lipid—and the micronutrients— 
including vitamins and minerals. 


Humans Require Protein 


Humans must consume protein in order to make new proteins. Dietary protein is a 
rich source of nitrogen, and certain amino acids—the so-called essential amino 
acids—cannot be synthesized by humans (and various animals) and can be ob- 
tained only in the diet. The average adult in the United States consumes far more 
protein than required for synthesis of essential proteins. Excess dietary protein is 
then merely a source of metabolic energy. Some of the amino acids (termed gluco- 
genic) can be converted into glucose, whereas others, the ketogenic amino acids, 
can be converted to fatty acids and/or keto acids. If fat and carbohydrate are al- 
ready adequate for the energy needs of the individual, then both kinds of amino 
acids will be converted to triacylglycerol and stored in adipose tissue. 

An individual’s protein undergoes a constant process of degradation and resyn- 
thesis. Together with dietary protein, this recycled protein material participates in 
a nitrogen equilibrium, or nitrogen balance. A positive nitrogen balance occurs 
whenever there is a net increase in the organism’s protein content, such as during 
periods of growth. A negative nitrogen balance exists when dietary intake of 
nitrogen is insufficient to meet the demands for new protein synthesis. 


Carbohydrates Provide Metabolic Energy 


The principal purpose of carbohydrate in the diet is production of metabolic energy. 
Simple sugars are metabolized in the glycolytic pathway (see Chapter 18). Complex 
carbohydrates are degraded into simple sugars, which then can enter the glycolytic 
pathway. Carbohydrates are also essential components of nucleotides, nucleic 
acids, glycoproteins, and glycolipids. Human metabolism can adapt to a wide range 
of dietary carbohydrate levels, but the brain requires glucose for fuel. When di- 
etary carbohydrate consumption exceeds the energy needs of the individual, ex- 
cess carbohydrate is converted to triacylglycerols and glycogen for long-term en- 
ergy storage. On the other hand, when dietary carbohydrate intake is low, ketone 
bodies are formed from acetate units to provide metabolic fuel for the brain and 
other organs. 


Lipids Are Essential, But in Moderation 


Fatty acids and triacylglycerols can be used as fuel by many tissues in the human 
body, and phospholipids are essential components of all biological membranes. 
Even though the human body can tolerate a wide range of fat intake levels, there 
are disadvantages in either extreme. Excess dietary fat is stored as triacylglycerols in 
adipose tissue, but high levels of dietary fat can also increase the risk of athero- 
sclerosis and heart disease. Moreover, high dietary fat levels are also correlated with 
increased risk for colon, breast, and prostate cancers. When dietary fat consump- 
tion is low, there is a risk of essential fatty acid deficiencies. As will be seen in Chap- 
ter 24, the human body cannot synthesize linoleic and linolenic acids, so these must 
be acquired in the diet. In addition, arachidonic acid can by synthesized in humans 
only from linoleic acid, so it too is classified as essential. The essential fatty acids are 
key components of biological membranes, and arachidonic acid is the precursor to 
prostaglandins, which mediate a variety of processes in the body. 
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A DEEPER LOOK 


Possibly the most serious nutrition problem in the United States is 
excessive food consumption, and many people have experimented 
with fad diets in the hope of losing excess weight. One of the most 
popular of the fad diets has been the high-protein, high-fat (low- 
carbohydrate) diet. The presumed rationale is tantalizing: Be- 
cause the tricarboxylic acid (TCA) cycle (see Chapter 19) plays a 
key role in fat catabolism and because glucose is usually needed to 
replenish intermediates in the TCA cycle, if carbohydrates are re- 
stricted in the diet, dietary fat should merely be converted to ke- 
tone bodies and excreted. This so-called diet appears to work at 
first because a low-carbohydrate diet results in an initial water 
(and weight) loss. This occurs because glycogen reserves are de- 





A Popular Fad Diet—Low Carbohydrates, High Protein, High Fat 


pleted by the diet and because about 3 grams of water of hydration 
are lost for every gram of glycogen. 

However, the premise for such diets is erroneous for several 
reasons. First, ketone body excretion by the human body usually 
does not exceed 20 grams (400 kJ) per day. Second, amino acids 
can function effectively to replenish TCA cycle intermediates, 
making the reduced carbohydrate regimen irrelevant. Third, the 
typical fare in a high-protein, high-fat, low-carbohydrate diet is 
expensive but not very tasty, and it is thus difficult to maintain. 
Finally, a diet high in saturated and trans fatty acids is a high risk 
factor for atherosclerosis and coronary artery disease. 








Fiber May Be Soluble or Insoluble 


The components of food materials that cannot be broken down by human digestive 
enzymes are referred to as dietary fiber. There are several kinds of dietary fiber, each 
with its own chemical and biological properties. Cellulose and hemicellulose are in- 
soluble fiber materials that stimulate regular function of the colon. They may play a 
role in reducing the risk of colon cancer. Lignins, another class of insoluble fibers, ab- 
sorb organic molecules in the digestive system. Lignins bind cholesterol and clear it 
from the digestive system, reducing the risk of heart disease. Pectins and gums are wa- 
ter-soluble fiber materials that form viscous gel-like suspensions in the digestive system, 
slowing the rate of absorption of many nutrients, including carbohydrates, and lower- 
ing serum cholesterol in many cases. The insoluble fibers are prevalent in vegetable 
grains. Water-soluble fiber is a component of fruits, legumes, and oats. 


SUMMARY 


17.1 Is Metabolism Similar in Different Organisms? One of the great 
unifying principles of modern biology is that organisms show marked 
similarity in their major pathways of metabolism. Given the almost un- 
limited possibilities within organic chemistry, this generality would ap- 
pear most unlikely. Yet it’s true, and it provides strong evidence that all 
life has descended from a common ancestral form. All forms of nutri- 
tion and almost all metabolic pathways evolved in early prokaryotes 
prior to the appearance of eukaryotes 1 billion years ago. All organisms, 
even those that can synthesize their own glucose, are capable of glucose 
degradation and ATP synthesis via glycolysis. Other prominent pathways 
are also virtually ubiquitous among organisms. 


17.2 What Can Be Learned from Metabolic Maps? Metabolism repre- 
sents the sum of the chemical changes that convert nutrients, the “raw 
materials” necessary to sustain living organisms, into energy and the 
chemically complex finished products of cells. Metabolism consists of 
literally hundreds of enzymatic reactions organized into discrete path- 
ways. Metabolic maps portray the principal reactions of the intermedi- 
ary metabolism of carbohydrates, lipids, amino acids, and their deriva- 
tives. In such maps, arrows connect metabolites and represent the 
enzyme reactions that interconvert the metabolites. Alternative map- 
pings of biochemical pathways have been proposed in a response to the 
emergence of genomic, transcriptomic, and proteomic perspectives on 
the complexity of biological systems. 


17.3 How Do Anabolic and Catabolic Processes Form the Core of Meta- 
bolic Pathways? Catabolism involves the oxidative degradation of com- 
plex nutrient molecules (carbohydrates, lipids, and proteins) obtained 


either from the environment or from cellular reserves. The breakdown 
of these molecules by catabolism leads to the formation of simpler mol- 
ecules such as lactic acid, ethanol, carbon dioxide, urea, or ammonia. 
Catabolic reactions are usually exergonic, and often the chemical en- 
ergy released is captured in the form of ATP. Anabolism is a synthetic 
process in which the varied and complex biomolecules (proteins, nu- 
cleic acids, polysaccharides, and lipids) are assembled from simpler pre- 
cursors. Such biosynthesis involves the formation of new covalent 
bonds, and an input of chemical energy is necessary to drive such en- 
dergonic processes. The ATP generated by catabolism provides this en- 
ergy. Furthermore, NADPH is an excellent donor of high-energy elec- 
trons for the reductive reactions of anabolism. 


17.4 What Experiments Can Be Used to Elucidate Metabolic Pathways? 
An important tool for elucidating the steps in the pathway is the use of 
metabolic inhibitors. Adding an enzyme inhibitor to a cell-free extract 
causes an accumulation of intermediates in the pathway prior to the 
point of inhibition. Each inhibitor is specific for a particular site in 
the sequence of metabolic events. Genetics provides an approach to the 
identification of intermediate steps in metabolism that is somewhat 
analogous to inhibition. Mutation in a gene encoding an enzyme often 
results in an inability to synthesize the enzyme in an active form. Such a 
defect leads to a block in the metabolic pathway at the point where the 
enzyme acts, and the enzyme’s substrate accumulates. Such genetic dis- 
orders are lethal if the end product of the pathway is essential or if the 
accumulated intermediates have toxic effects. In microorganisms, how- 
ever, it is often possible to manipulate the growth medium so that essen- 


tial end products are provided. Then the biochemical consequences of 
the mutation can be investigated. 


17.5 What Can the Metabolome Tell Us about a Biological System? 
Rapid advances in chemical analysis have made it possible to carry out 
comprehensive studies of the many metabolites in a living organism. The 
metabolome is the complete set of low-molecular-weight molecules 
present in an organism or excreted by it under a given set of physiologi- 
cal conditions. Metabolomics is the systematic identification and quanti- 
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tation of all these metabolites in a given organism or sample. Mass spec- 
trometry offers unmatched sensitivity for detection of metabolites at very 
low concentrations, whereas NMR spectroscopy can provide remarkable 
resolution and discrimination of metabolites in complex mixtures. 


17.6 What Food Substances Form the Basis of Human Nutrition? In 
addition to essential fiber, the food that human beings require in- 
cludes the macronutrients—protein, carbohydrate, and lipid—and 
the micronutrients—including vitamins and minerals. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. If3 X 10! kg of CO, are cycled through the biosphere annually, how 
many human equivalents (70-kg people composed of 18% carbon by 
weight) could be produced each year from this amount of CO? 

2. Define the differences in carbon and energy metabolism between 
photoautotrophs and photoheterotrophs and between chemoautotrophs and 
chemoheterotrophs. 

3. Name three principal inorganic sources of oxygen atoms that are 
commonly available in the inanimate environment and readily 
accessible to the biosphere. 

4. What are the features that generally distinguish pathways of catabo- 
lism from pathways of anabolism? 

5. Name the three principal modes of enzyme organization in meta- 
bolic pathways. 

6. Why is the pathway for the biosynthesis of a biomolecule at least 
partially different from the pathway for its catabolism? Why is the 
pathway for the biosynthesis of a biomolecule inherently more com- 
plex than the pathway for its degradation? 

7. (Integrates with Chapters 1 and 3.) What are the metabolic roles of 
ATP, NAD*, and NADPH? 

8. (Integrates with Chapter 15.) Metabolic regulation is achieved via 
regulating enzyme activity in three prominent ways: allosteric regu- 
lation, covalent modification, and enzyme synthesis and degrada- 
tion. Which of these three modes of regulation is likely to be the 
quickest; which the slowest? For each of these general enzyme reg- 
ulatory mechanisms, cite conditions in which cells might employ 
that mode in preference to either of the other two. 

9. What are the advantages of compartmentalizing particular meta- 
bolic pathways within specific organelles? 

10. Name and discuss four challenges associated with metabolomic 
measurements in biological systems. 

11. Compare and contrast mass spectrometry and NMR in terms of their 
potential advantages and disadvantages for metabolomic analysis. 


12. What chemical functionality is provided to enzyme reactions by pyri- 
doxal phosphate (see Chapter 13)? By coenzyme A (see Chapter 
19)? By vitamin By (see Chapter 23)? By thiamine pyrophosphate 
(see Chapter 19)? 

13. Define the following terms: 

a. Genome 
b. Transcriptome 


c. Proteome 
d. Metabolome 


14. The alcohol dehydrogenase reaction, described in Figure 17.11, in- 
terconverts ethyl alcohol and acetaldehyde and involves hydride 
transfer to and from NAD* and NADH, respectively. Write a rea- 
sonable mechanism for the conversion of ethanol to acetaldehyde 
by alcohol dehydrogenase. 


15. For each of the following metabolic pathways, describe where in the 
cell it occurs and identify the starting material and end product(s): 
a. Citric acid cycle 
b. Glycolysis 
c. Oxidative phosphorylation 
d. Fatty acid synthesis 


16. Many solutions to the problem of global warming have been pro- 
posed. One of these involves strategies for carbon sequestration— 
the removal of CO» from the earth’s atmosphere by various means. 
From your reading of this chapter, suggest and evaluate a strategy 
for carbon sequestration in the ocean. 


17. Consult Table 17.4, and consider the information presented for *°P 
and *S. Write reactions for the decay events for these two isotopes, 
indicating clearly the products of the decays, and calculate what 
percentage of each would remain from a sample that contained 
both and decayed for 100 days. 


Preparing for the MCAT Exam 


18. Which statement is most likely to be true concerning obligate 
anaerobes? 

a. These organisms can use oxygen if it is present in their envi- 
ronment. 

b. These organisms cannot use oxygen as their final electron 
acceptor. 

c. These organisms carry out fermentation for at least 50% of their 
ATP production. 

d. Most of these organisms are vegetative fungi. 

19. Foods rich in fiber are basically plant materials high in cellulose, a 
cell wall polysaccharide that we cannot digest. The nutritional ben- 
efits provided by such foods result from 
a. other nutrients present that can be digested and absorbed. 

b. macromolecules (like cellulose) that are absorbed without diges- 
tion and then catabolized inside the cells. 

c. microbes that are the normal symbionts of plant tissues. 

d. All of the above. 
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Glycolysis 


ESSENTIAL QUESTION 


Nearly every living cell carries out a catabolic process known as glycolysis—the 
stepwise degradation of glucose (and other simple sugars). Glycolysis is a paradigm 
of metabolic pathways. Carried out in the cytosol of cells, it is basically an anaerobic 
process; its principal steps occur with no requirement for oxygen. Living things first 
appeared in an environment lacking O3, and glycolysis was an early and important 
pathway for extracting energy from nutrient molecules. It played a central role in 
anaerobic metabolic processes during the first 2 billion years of biological evolution 
on earth. Contemporary organisms still employ glycolysis to provide precursor mol- 
ecules for aerobic catabolic pathways (such as the tricarboxylic acid cycle) and as a 
short-term energy source when oxygen is limiting. 


What are the chemical basis and logic for this central pathway of metabolism; 
that is, how does glycolysis work? 


1801s What Are the Essential Features of Glycolysis? 


In the glycolysis pathway (Figure 18.1), a molecule of glucose is converted in 10 
enzyme-catalyzed steps to two molecules of 3-carbon pyruvate. Most of the details of 
this pathway (the first metabolic pathway to be elucidated) were worked out in the 
first half of the 20th century by the German biochemists Otto Warburg, G. Embden, 
and O. Meyerhof. In fact, the sequence of reactions in Figure 18.1 is often referred 
to as the Embden—Meyerhof pathway. 

Why is glycolysis so important to organisms? There are several reasons. For 
some tissues (such as brain, kidney medulla, and rapidly contracting skeletal 
muscles) and for some cells (such as erythrocytes and sperm cells), glucose is 
the only source of metabolic energy. In addition, the product of glycolysis— 
pyruvate—is a versatile metabolite that can be used in several ways. In most tis- 
sues, when oxygen is plentiful (aerobic conditions), pyruvate is oxidized (with 
loss of the carboxyl group as CO»), and the remaining two-carbon unit becomes 
the acetyl group of acetyl-coenzyme A (acetyl-CoA) (Figure 18.2). This acetyl 
group is metabolized in the tricarboxylic acid (TCA) cycle (and fully oxidized) 
to yield CO, (see Chapter 19). 

Alternatively, in the absence of oxygen (anaerobic conditions), pyruvate can be 
reduced to lactate through oxidation of NADH to NAD*—a process termed lactic 
acid fermentation. In microorganisms such as brewer’s yeast, and in certain plant 
tissues, pyruvate can be reduced to ethanol, again with oxidation of NADH to 
NAD*. Most students will recognize this process as alcoholic fermentation. 

Glycolysis consists of two phases. In the first phase, a series of five reactions, glu- 
cose is broken down to two molecules of glyceraldehyde-3-phosphate. In the sec- 
ond phase, five subsequent reactions convert these two molecules of glyceralde- 
hyde-3-phosphate into two molecules of pyruvate. Phase 1 consumes two molecules 
of ATP (Figure 18.1). The later stages of glycolysis result in the production of four 
molecules of ATP. The net is 4 — 2 = 2 molecules of ATP produced per molecule 
of glucose. Microorganisms, plants, and animals (including humans) carry out the 
10 reactions of glycolysis in more or less similar fashion, although the rates of the in- 
dividual reactions and the means by which they are regulated differ from species to 
species. 
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Louis Pasteur’s scientific investigations into fermen- 
tation of grape sugar were pioneering studies of 


glycolysis. 





Living organisms, like machines, conform to the 
law of conservation of energy, and must pay 
for all their activities in the currency of 
catabolism. 


Ernest Baldwin 
Dynamic Aspects of Biochemistry (1952) 





KEY QUESTIONS 








18.1 What Are the Essential Features of 
Glycolysis? 

18.2 Why Are Coupled Reactions Important 
in Glycolysis? 

18.3 What Are the Chemical Principles and 
Features of the First Phase of Glycolysis? 

18.4 What Are the Chemical Principles and 
Features of the Second Phase of Glycolysis? 

18.5 What Are the Metabolic Fates of NADH 
and Pyruvate Produced in Glycolysis? 

18.6 ow Do Cells Regulate Glycolysis? 

18.7 Are Substrates Other Than Glucose Used 
in Glycolysis? 

18.8 ow Do Cells Respond to Hypoxic Stress? 
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Phase 1 Phase 2 


Phosphorylation of glucose and conversion to 2 molecules of Conversion of glyceraldehyde-3-phosphate to pyruvate and 
glyceraldehyde-3-phosphate; 2 ATPs are used to prime these reactions. coupled formation of 4 ATP and 2 NADH. 
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CENGAGENOW” ACTIVE FIGURE 18.1 The glycolytic pathway. Test yourself on the concepts in this figure 
at www.cengage.com/login. 


18.3. What Are the Chemical Principles and Features of the First Phase of Glycolysis? 
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18:28 Why Are Coupled Reactions Important in Glycolysis? 


The process of glycolysis converts some, but not all, of the metabolic energy of the glu- 
cose molecule into ATP. The free energy change for the conversion of glucose to two 
molecules of lactate (the anaerobic route in contracting muscle) is — 183.6 kJ/mol: 


C6H1206 > 2 H3C—CHOH—COO™ + 2 H* 
AG*' = —183.6 kJ/mol 


(18.1) 


This process occurs with no net oxidation or reduction. Although several individual 
steps in the pathway involve oxidation or reduction, these steps compensate each 
other exactly. Thus, the conversion of a molecule of glucose to two molecules of lac- 
tate involves simply a rearrangement of bonds, with no net loss or gain of electrons. 
The energy made available through this rearrangement is a relatively small part of 
the total energy obtainable from glucose. 

The production of two molecules of ATP in glycolysis is an energy-requiring process: 


2 ADP + 2 P —>2 ATP + 2 HO (18.2) 
AG” = 2 X 30.5 kJ/mol = 61.0 kJ/mol 
Glycolysis couples these two reactions: 
Glucose + 2 ADP + 2 P; > 2 lactate + 2 ATP + 2H*+2H.O (18.3) 


AG” = —183.6 + 61 = —122.6 kJ/mol 


Thus, under standard-state conditions, (61/183.6) X 100%, or 33%, of the free en- 
ergy released is preserved in the form of ATP in these reactions. However, as we dis- 
cussed in Chapter 3, the various solution conditions, such as pH, concentration, 
ionic strength, and presence of metal ions, can substantially alter the free energy 
change for such reactions. Under actual cellular conditions, the free energy change 
for the synthesis of ATP (Equation 18.2) is much larger, and approximately 50% of 
the available free energy is converted into ATP. Clearly, then, more than enough 
free energy is available in the conversion of glucose into lactate to drive the synthe- 
sis of two molecules of ATP. 


18:38 What Are the Chemical Principles and Features 
of the First Phase of Glycolysis? 


In the first phase of glycolysis, glucose will be phosphorylated at C-1 and C-6, and 
the six-carbon skeleton of glucose will be cleaved to yield two three-carbon mole- 
cules of glyceraldehyde-3-phosphate. Phosphorylation and cleavage reorganize the 


FIGURE 18.2 Pyruvate produced in glycolysis can be 
utilized by cells in several ways. In animals, pyruvate is 
normally converted to acetyl-coenzyme A, which is then 
oxidized in the TCA cycle to produce CO. When oxygen 
is limited, pyruvate can be converted to lactate. 
Alcoholic fermentation in yeast converts pyruvate to 
ethanol and COs. 
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FIGURE 18.3 Just as a water pump must be “primed” 
with water to get more water out, the glycolytic pathway 
is primed with ATP in steps 1 and 3 in order to achieve 
net production of ATP in the second phase of the 
pathway. 


glucose molecule so that molecules of ATP can be produced in the second phase of 
glycolysis. 


Reaction 1: Glucose Is Phosphorylated by Hexokinase 
or Glucokinase—The First Priming Reaction 


The initial reaction of the glycolysis pathway involves phosphorylation of glucose at 
carbon atom 6 by either hexokinase or glucokinase. (Recall that “kinases” are en- 
zymes that transfer the y-phosphate of ATP to nucleophilic acceptors.) Phosphory- 
lation activates glucose for the following reactions in the pathway. However, the for- 
mation of such a phosphoester is thermodynamically unfavorable and requires 
energy input to operate in the forward direction (see Chapter 3). The energy comes 
from ATP, a requirement that at first seems counterproductive. Glycolysis is de- 
signed to make ATP, not consume it. However, the hexokinase or glucokinase reac- 
tion (Figure 18.1) is one of two priming reactions in the pathway. Just as old- 
fashioned, hand-operated water pumps (Figure 18.3) have to be primed with a 
small amount of water to deliver more water to the thirsty pumper, the glycolysis 
pathway requires two priming ATP molecules to start the sequence of reactions and 
delivers four molecules of ATP in the end. 
The complete reaction for the first step in glycolysis is 


a-D-Glucose + ATP —> a-p-glucose-6-phosphate?~ + ADP®~ + H* 
AG” = —16.7 kJ/mol 


(18.4) 


The hydrolysis of ATP makes 30.5 kJ/mol available in this reaction, and the phos- 
phorylation of glucose “costs” 13.8 kJ/mol (Table 8.1). Thus, the reaction liberates 
16.7 kJ/mol under standard-state conditions (1 M concentrations), and the equilib- 
rium of the reaction lies far to the right (Keq = 850 at 25°C; see Table 18.1). Under 
cellular conditions (Table 18.2), this first reaction of glycolysis is even more favor- 
able than at standard state, with a AG of —33.9 kJ/mol (see Table 18.1). 
































UVTERERIG Reactions and Thermodynamics i 7 al 
AG” Keg AG 
Reaction Enzyme (kJ/mol) at 25°C (kJ/mol) 
a-D-Glucose + ATP == Hexokinase —16.7 850 33.9" 
glucose-6-phosphate? + ADP®~ + H* Glucokinase 
Glucose-6-phosphate?” == fructose-6-phosphate*~ Phosphoglucoisomerase +1.67 0.51 —2.92 
Fructose-6-phosphate*” + ATP*~ == Phosphofructokinase =142 310 —18.8 
fructose-1,6-bisphosphate*” + ADP®~ + H* 
Fructose-1,6-bisphosphate*~ == Fructose bisphosphate +23.9 6.43 X 107% —0.23 
dihydroxyacetone-P?~ + glyceraldehyde-3-P?~ aldolase 
Dihydroxyacetone-P?> == glyceraldehyde-3-P?- Triose phosphate isomerase +7.56 0.0472 +2.41 
Glyceraldehyde-3-P°= + P;?~ + NAD} == Glyceraldehyde-3-P +6.30 0.0786 =1.29 
1,3-bisphosphoglycerate*” + NADH + H+ dehydrogenase 
1,3-Bisphosphoglycerate*~ + ADP? == Phosphoglycerate kinase =18.9 2060 +0.1 
3-P-glycerate*~ + ATP*~ 
3-Phosphoglycerate?~ == 2-phosphoglycerate*~ Phosphoglycerate mutase +4.4 0.169 +0.83 
2-Phosphoglycerate*~ == Enolase +1.8 0.483 +1.1 
phosphoenolpyruvate?~ + HO 
Phosphoenolpyruvate?~ + ADP?~ + H* == Pyruvate kinase —31.7 3.63 X 10° —23.0 
pyruvate” + ATP*~ 
Pyruvate’ + NADH + H* = = lactate™ + NAD+ Lactate dehydrogenase —25.2 2.63 X 104 —14.8 














*AG values calculated for 310K (37°C) using the data in Table 18.2 for metabolite concentrations in erythrocytes. AG” values are assumed to be the same at 25° and 37°C. 











18.3. What Are the Chemical Principles and Features of the First Phase of Glycolysis? 539 


Extracellular 
fluid 


ED 
Cc 
ADP 


Glucose- 
G 6-phosphate 


Glucose is kept in the cell 
by phosphorylation to G-6-P, 
which cannot easily cross 
the plasma membrane 





CENGAGENOW” ANIMATED FIGURE 18.4 Phosphorylation of glucose to glucose-6-phosphate by ATP 
creates a charged molecule that cannot easily cross the plasma membrane. See this figure animated at 
www.cengage.com/login. 


The Cellular Advantages of Phosphorylating Glucose The incorporation of a phos- 
phate into glucose in this energetically favorable reaction is important for several rea- 
sons. First, phosphorylation keeps the substrate in the cell. Glucose is a neutral mole- 
cule and could diffuse across the cell membrane, but phosphorylation confers a 
negative charge on glucose and the plasma membrane is essentially impermeable to 
glucose-6-phosphate (Figure 18.4). Moreover, rapid conversion of glucose to glucose- 
6-phosphate keeps the intracellular concentration of glucose low, favoring faciliated dif- 
fusion of glucose into the cell. In addition, because regulatory control can be imposed 
only on reactions not at equilibrium, the favorable thermodynamics of this first reac- 
tion makes it an important site for regulation. 


The Isozymes of Hexokinase In most animal, plant, and microbial cells, the enzyme 
that phosphorylates glucose is hexokinase. Magnesium ion (Mg?*) is required for this 
reaction, as for the other kinase enzymes in the glycolytic pathway. The true substrate 
for the hexokinase reaction is MgATP?-. There are four isozymes of hexokinase in 
most animal tissues. Hexokinase I is the principal form in the brain. Hexokinase in 
skeletal muscle is a mixture of types I (70% to 75%) and II (25% to 30%). The K„ 
for glucose is 0.03 mM for type I and 0.3 mM for type II; thus, hexokinase operates 
efficiently at normal blood glucose levels of 4 mM or so. The animal isozymes are 
allosterically inhibited by the product, glucose-6-phosphate. High levels of glucose-6- 
phosphate inhibit hexokinase activity until consumption by glycolysis lowers its con- 
centration. The hexokinase reaction is one of three points in the glycolysis pathway 
that are regulated. As the generic name implies, hexokinase can phosphorylate a vari- 
ety of hexose sugars, including glucose, mannose, and fructose. 

The type IV isozyme of hexokinase, called glucokinase, is found predominantly 
in the liver and pancreas. Type IV is highly specific for D-glucose, has a much higher 
K,, for glucose (approximately 10 mM), and is not product inhibited. With such a 
high K,, for glucose, glucokinase becomes important metabolically only when liver 
glucose levels are high (for example, when the individual has consumed large 
amounts of carbohydrates). When glucose levels are low, hexokinase is primarily 
responsible for phosphorylating glucose. However, when glucose levels are high, glu- 
cose is converted by glucokinase to glucose-6-phosphate and is eventually stored in 
the liver as glycogen. Glucokinase is an inducible enzyme—the amount present in the 
liver is controlled by insulin (secreted by the pancreas). (Patients with diabetes melli- 
tus produce insufficient insulin. They have low levels of glucokinase, cannot tolerate 
high levels of blood glucose, and produce little liver glycogen.) Because glucose-6- 
phosphate is common to several metabolic pathways (Figure 18.5), it occupies a 
branch point in glucose metabolism. 





TERT M Steady 


















Metabolite 

Glucose 5.0 
Glucose-6-phosphate 0.083 
Fructose-6-phosphate 0.014 
Fructose-1,6-bisphosphate 0.031 
Dihydroxyacetone phosphate 0.14 
Glyceraldehyde-3-phosphate 0.019 
1,3-Bisphosphoglycerate 0.001 
2,3-Bisphosphoglycerate 4.0 
3-Phosphoglycerate 0.12 
2-Phosphoglycerate 0.030 
Phosphoenolpyruvate 0.023 
Pyruvate 0.051 
Lactate 2.9 
ATP 1.85 
ADP 0.14 
P; 1.0 








Adapted from Minakami, S., and Yoshikawa, H., 1965. Thermo- 
dynamic considerations on erythrocyte glycolysis. Biochemi- 
cal and Biophysical Research Communications 18:345. 
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FIGURE 18.6 The (a) open and (b) closed states of yeast 
hexokinase. Binding of glucose (green) induces a con- 
formation change that closes the active site, as predict- 
ed by Koshland (a: pdb id = 11G8;b: pdb id = 1BDG). 
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FIGURE 18.5 Glucose-6-phosphate is the branch point for several metabolic pathways. 


Hexokinase Binds Glucose and ATP with an Induced Fit In most organisms, hex- 
okinase occurs in a single form: a two-lobed 50-kD monomer that resembles a 
clamp, with a large groove in one side (Figure 18.6; see also Figure 13.24). Daniel 
Koshland predicted, years before structures were available, that hexokinase would 
undergo an induced fit (see Chapter 13), closing around the substrates ATP and 
glucose when they were bound. Koshland’s prediction was confirmed when struc- 
tures of the yeast enzyme were determined in the absence and presence of glucose 
(Figure 18.6). 

The human hexokinase isozymes I, II, and III are twice as big as those of lower 
organisms. They are composed of two separate domains, each similar to the yeast 
enzyme, and connected head to tail by a long a-helix (Figure 18.7). The sequence 


(a) 





Glucose 





FIGURE 18.7 (a) Mammalian hexokinase | contains an 
N-terminal domain (top) and a C-terminal domain (bottom) 
joined by a long a-helix. Each of these domains is similar in 
sequence and structure to yeast hexokinase (pdb id = 1CZA). 
(b) Human glucokinase undergoes an induced fit upon 
binding glucose (green). (Top: pdb id = 1V4T; bottom: pdb 

id = 1V4S). 





18.3. What Are the Chemical Principles and Features of the First Phase of Glycolysis? 541 


and structure similarity apparently arose from the duplication and fusion of a pri- 
mordial hexokinase gene. Interestingly, both halves of hexokinase II support catal- 
ysis, but only the C-terminal half of isozymes I and III performs phosphorylation of 
glucose. The N-terminal half, on the other hand, has apparently evolved into a form 
that allosterically regulates the activity of the C-terminal half! Type IV hexokinase 
(glucokinase) is similar in structure to the yeast enzyme, with a single clamp do- 
main, a single active site, and a mass of 50 kD (Figure 18.7). 


Reaction 2: Phosphoglucoisomerase Catalyzes the Isomerization 
of Glucose-6-Phosphate 


The second step in glycolysis is a common type of metabolic reaction: the isomer- 
ization of a sugar. In this particular case, the carbonyl oxygen of glucose-6-phosphate 
is shifted from C-1 to C-2. This amounts to isomerization of an aldose (glucose-6- 
phosphate) to a ketose—fructose-6-phosphate (Figure 18.8). The reaction is neces- 
sary for two reasons. First, the next step in glycolysis is phosphorylation at C-1, and the 
hemiacetal —OH of glucose, would be more difficult to phosphorylate than a simple 
primary hydroxyl. Second, the isomerization to fructose (with a carbonyl group at 
position 2 in the linear form) activates C-3, facilitating C-C bond cleavage in the 
fourth step of glycolysis. The enzyme responsible for this isomerization is phospho- 
glucoisomerase, also known as phosphoglucose isomerase and glucose phosphate 
isomerase. In humans, the enzyme requires Mg* for activity and is highly specific 
for glucose-6-phosphate. The AG*’ is 1.67 kJ/mol, and the value of AG under cellu- 
lar conditions (Table 18.1) is —2.92 kJ/mol. This small value means that the reaction 
operates near equilibrium in the cell and is readily reversible. Phosphoglucoiso- 
merase proceeds through an enediol intermediate, as shown in Figure 18.8. Although 
the predominant forms of glucose-6-phosphate and fructose-6-phosphate in solution 
are the ring forms, the isomerase interconverts the open-chain form of G-6-P with 
the open-chain form of F-6-P. 
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CENGAGENOW” ACTIVE FIGURE 18.8 The phosphoglucoiso- 
5 o CH.OH merase mechanism involves opening of the pyranose ring (step 1), 
O3POH CG 2 proton abstraction leading to enediol formation (step 2), and pro- 
H HO ton addition to the double bond, followed by ring closure (step 3). 
H OH Test yourself on the concepts in this figure at www.cengage 
OH H .com/login. 





Phosphoglucoisomerase, with fructose-6-P (blue) bound 
(pdb id = 1HOX). 
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Phosphofructokinase with ADP (in orange) and 
fructose-6-phosphate (in red) (pdb id = 4PFK). 
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tion of phosphofructokinase. 
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FIGURE 18.9 At high [ATP], phosphofructokinase (PFK) 
behaves cooperatively and the plot of enzyme activity 
versus [fructose-6-phosphate] is sigmoid. High [ATP] thus 
inhibits PFK, decreasing the enzyme’s affinity for fructose- 
6-phosphate. 


AG” =-14.2 kJ/mol 
AG erythrocyte =-18.8 kJ/mol 


Reaction 3: ATP Drives a Second Phosphorylation 
by Phosphofructokinase—The Second Priming Reaction 


The action of phosphoglucoisomerase, “moving” the carbonyl group from C-1 to C-2, 
creates a new primary alcohol function at C-1 (see Figure 18.8). The next step in the 
glycolytic pathway is the phosphorylation of this group by phosphofructokinase. 
Once again, the substrate that provides the phosphoryl group is ATP. Like the hexo- 
kinase/glucokinase reaction, the phosphorylation of fructose-6-phosphate is a prim- 
ing reaction and is endergonic: 


Fructose-6-P + P;——> fructose-1,6-bisphosphate (18.5) 
AG” = 16.3 kJ/mol 


When coupled (by phosphofructokinase) with the hydrolysis of ATP, the overall re- 
action becomes exergonic: 


Fructose-6-P + ATP—> fructose-1,6-bisphosphate + ADP (18.6) 
AG” = —14.2 kJ/mol 
AG (in erythrocytes) = —18.8 kJ/mol 


At pH 7 and 37°C, the phosphofructokinase reaction equilibrium lies far to the 
right. Just as the hexokinase reaction commits the cell to taking up glucose, the phos- 
phofructokinase reaction commits the cell to metabolizing glucose rather than converting it 
to another sugar or storing it. Similarly, just as the large free energy change of the 
hexokinase reaction makes it a likely candidate for regulation, so the phosphofruc- 
tokinase reaction is an important site of regulation—indeed, the most important 
site in the glycolytic pathway. 


Regulation of Phosphofructokinase Phosphofructokinase is the “valve” control- 
ling the rate of glycolysis. In addition to its role as a substrate, ATP is also an allosteric 
inhibitor of this enzyme. Thus, phosphofructokinase has two distinct binding sites 
for ATP; a high-affinity substrate site and a low-affinity regulatory site. In the pres- 
ence of high ATP concentrations, phosphofructokinase behaves cooperatively, plots 
of enzyme activity versus fructose-6-phosphate are sigmoid, and the K,, for fructose- 
6-phosphate is increased (Figure 18.9). Thus, when ATP levels are sufficiently high 
in the cytosol, glycolysis “turns off.” Under most cellular conditions, however, the 
ATP concentration does not vary over a large range. The ATP concentration in 
muscle during vigorous exercise, for example, is only about 10% lower than that 
during the resting state. The rate of glycolysis, however, varies much more. A large 
range of glycolytic rates cannot be directly accounted for by only a 10% change in 
ATP levels. 

AMP reverses the inhibition due to ATP, and AMP levels in cells can rise dramat- 
ically when ATP levels decrease, due to the action of the enzyme adenylate kinase, 
which catalyzes the reaction 


ADP + ADP =——ATP + AMP 


with the equilibrium constant: 


[ATP] [AMP] 
Keq ~ [ADP]? 





= 0.44 (18.7) 
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Phosphoglucoisomerase—A Moonlighting Protein 


When someone has a day job but also works at night (that is, un- 
der the moon) at a second job, they are said to be “moonlight- 
ing.” Similarly, a number of proteins have been found to have 
two or more different functions, and Constance Jeffery at Bran- 
deis University has dubbed these “moonlighting proteins.” Phos- 
phoglucoisomerase catalyzes the second step of glycolysis but 
also moonlights as a nerve growth factor outside animal cells. In 
fact, outside the cell, this protein is known as neuroleukin (NL), 
autocrine motility factor (AMF), and differentiation and matu- 
ration mediator (DMM). Neuroleukin is secreted by (immune 
system) T cells and promotes the survival of certain spinal neu- 


stimulates cancer cell migration. DMM causes certain leukemia 
cells to differentiate. 

How phosphoglucoisomerase is secreted by the cell for its 
moonlighting functions is unknown, but there is evidence that the 
organism itself may be harmed by this secretion. Diane Mathis and 
Christophe Benoist at the University of Strasbourg have shown 
that, in mice with disorders similar to rheumatoid arthritis, the im- 
mune system recognizes extracellular phosphoglucoisomerase as 
an antigen—that is, a protein that is “nonself.” That a protein can 
be vital to metabolism inside the cell and also function as a growth 
factor and occasionally act as an antigen outside the cell is indeed 


rons and sensory nerves. AMF is secreted by tumor cells and remarkable. 








Adenylate kinase rapidly interconverts ADP, ATP, and AMP to maintain this equi- 
librium. ADP levels in cells are typically 10% of ATP levels, and AMP levels are 
often less than 1% of the ATP concentration. Under such conditions, a small net 
change in ATP concentration due to ATP hydrolysis results in a much larger rela- 
tive increase in the AMP levels because of adenylate kinase activity. 

Clearly, the activity of phosphofructokinase depends on both ATP and AMP levels 
and is a function of the cellular energy status. Phosphofructokinase activity is in- 
creased when the energy status falls and is decreased when the energy status is high. 
The rate of glycolysis activity thus decreases when ATP is plentiful and increases when 
more ATP is needed. 

Glycolysis and the citric acid cycle (to be discussed in Chapter 19) are coupled via 
phosphofructokinase, because citrate, an intermediate in the citric acid cycle, is an 
allosteric inhibitor of phosphofructokinase. When the citric acid cycle reaches satu- 
ration, glycolysis (which “feeds” the citric acid cycle under aerobic conditions) slows 
down. The citric acid cycle directs electrons into the electron-transport chain (for the 
purpose of ATP synthesis in oxidative phosphorylation) and also provides precursor 
molecules for biosynthetic pathways. Inhibition of glycolysis by citrate ensures 
that glucose will not be committed to these activities if the citric acid cycle is already frdcioae- phosphate aadresiorino 1e hyperbolicde 
saturated. pendence of enzyme activity on substrate concentra- 

Phosphofructokinase is also regulated by 6-p-fructose-2,6-bisphosphate, a po- tion. 
tent allosteric activator that increases the affinity of phosphofructokinase for the 
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FIGURE 18.10 Fructose-2,6-bisphosphate activates phos- 
phofructokinase, increasing the affinity of the enzyme for 


substrate fructose-6-phosphate (Figure 18.10). Stimulation of phosphofructo- -O,POCH, -O. OPO}- 
kinase is also achieved by decreasing the inhibitory effects of ATP (Figure 18.11). H HO 
Fructose-2,6-bisphosphate increases the net flow of glucose through glycolysis by H CHOH 


stimulating phosphofructokinase and, as we shall see in Chapter 22, by inhibiting 
fructose-1,6-bisphosphatase, the enzyme that catalyzes this reaction in the opposite 
direction. 


OH H 
Fructose-2,6-bisphosphate 


Reaction 4: Cleavage by Fructose Bisphosphate Aldolase Creates 
Two 3-Carbon Intermediates 


Fructose bisphosphate aldolase cleaves fructose-1,6-bisphosphate between the 
C-3 and C-4 carbons to yield two triose phosphates. The products are dihydroxy- 
acetone phosphate (DHAP) and glyceraldehyde-3-phosphate. The reaction has an 
equilibrium constant of approximately 1074 M, and a corresponding AG°’ of 
+23.9 kJ/mol. These values might imply that the reaction does not proceed ef- 
fectively from left to right as written. However, the reaction makes two molecules 
(glyceraldehyde-3-P and dihydroxyacetone-P) from one molecule (fructose-1,6- 
bisphosphate), and the equilibrium is thus greatly influenced by concentration. 
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FIGURE 18.11 Fructose-2,6-bisphosphate decreases the 
inhibition of phosphofructokinase due to ATP. 
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The value of AG in erythrocytes is actually —0.23 kJ/mol (see Table 18.1). At phys- 
iological concentrations, the reaction is essentially at equilibrium. 

Two classes of aldolase enzymes are found in nature. Animal tissues produce a 
Class I aldolase, characterized by the formation of a covalent Schiff base intermediate 
between an active-site lysine and the carbonyl group of the substrate. Class I aldolases 
do not require a divalent metal ion. Class II aldolases are produced mainly in bacte- 
ria and fungi and do not form a covalent E-S intermediate, but they contain an active- 
site metal (normally zinc, Zn?*). Cyanobacteria and some other simple organisms 
possess both classes of aldolase. 

The aldolase reaction is merely the reverse of the aldol condensation well known 
to organic chemists. The latter reaction involves an attack by a nucleophilic enolate 
anion of an aldehyde or ketone on the carbonyl carbon of an aldehyde. The oppo- 
site reaction, aldol cleavage, begins with removal of a proton from the -hydroxyl 
group, which is followed by the elimination of the enolate anion. A mechanism for 
the aldol cleavage reaction of fructose-1,6-bisphosphate in the Class I-type aldolases 
is shown in Figure 18.12a. In Class II aldolases, an active-site metal such as Zn?* be- 
haves as an electrophile, polarizing the carbonyl group of the substrate and stabiliz- 
ing the enolate intermediate (Figure 18.12b). 


Reaction 5: Triose Phosphate Isomerase Completes 
the First Phase of Glycolysis 


Of the two products of the aldolase reaction, only glyceraldehyde-3-phosphate 
goes directly into the second phase of glycolysis. The other triose phosphate, 
dihydroxyacetone phosphate, must be converted to glyceraldehyde-3-phosphate by 
the enzyme triose phosphate isomerase. This reaction thus permits both products of 
the aldolase reaction to continue in the glycolytic pathway and in essence makes the 
C-1, C-2, and C3 carbons of the starting glucose molecule equivalent to the C-6, 
C-5, and C-4 carbons, respectively. The reaction mechanism involves an enediol in- 
termediate that can donate either of its hydroxyl protons to a basic residue on the 
enzyme and thereby become either dihydroxyacetone phosphate or glyceraldehyde- 
3-phosphate (Figure 18.13). Triose phosphate isomerase is one of the enzymes that 
have evolved to a state of “catalytic perfection,” with a turnover number near the dif- 
fusion limit (see Table 13.5). 

The triose phosphate isomerase reaction completes the first phase of glycolysis, 
each glucose that passes through being converted to two molecules of glyceralde- 
hyde-3-phosphate. Although the last two steps of the pathway are energetically un- 
favorable, the overall five-step reaction sequence has a net AG?’ of +2.2 kJ/mol 
(Keq = 0.43). It is the free energy of hydrolysis from the two priming molecules of 
ATP that brings the overall equilibrium constant close to 1 under standard-state 
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conditions. The net AG under cellular conditions is quite negative (—53.4 kJ/mol 
in erythrocytes). 


18:45 What Are the Chemical Principles and Features 
of the Second Phase of Glycolysis? 


The second half of the glycolytic pathway involves the reactions that convert the 
metabolic energy in the glucose molecule into ATP. Altogether, four new ATP mol- 
ecules are produced. If two are considered to offset the two ATPs consumed in 
phase 1, a net yield of two ATPs per glucose is realized. Phase 2 starts with the oxi- 
dation of glyceraldehyde-3-phosphate, a reaction with a large enough energy “kick” 
to produce a high-energy phosphate, namely, 1,3-bisphosphoglycerate (see Figure 
18.1). Phosphoryl transfer from 1,3-BPG to ADP to make ATP is highly favorable. 
The product, 3-phosphoglycerate, is converted via several steps to phosphoenol- 
pyruvate (PEP), another high-energy phosphate. PEP readily transfers its phos- 
phoryl group to ADP in the pyruvate kinase reaction to make another ATP. 


Reaction 6: Glyceraldehyde-3-Phosphate Dehydrogenase Creates 
a High-Energy Intermediate 


In the first glycolytic reaction to involve oxidation—reduction, glyceraldehyde-3- 
phosphate is oxidized to 1,3-bisphosphoglycerate by glyceraldehyde-3-phosphate de- 
hydrogenase. Although the oxidation of an aldehyde to a carboxylic acid is a highly 
exergonic reaction, the overall reaction involves both formation of a carboxylic— 
phosphoric anhydride and the reduction of NAD* to NADH and is therefore slightly 
endergonic at standard state, with a AG?’ of +6.30 kJ/mol. The free energy that 
might otherwise be released as heat in this reaction is directed into the formation of 
a high-energy phosphate compound, 1,3-bisphosphoglycerate, and the reduction of 
NAD*. The reaction mechanism involves nucleophilic attack by a cysteine —SH 
group on the carbonyl carbon of glyceraldehyde-3-phosphate to form a hemithioac- 
etal (Figure 18.14). The hemithioacetal intermediate decomposes by hydride (H:~) 
transfer to NAD* to form a high-energy thioester. Nucleophilic attack by phosphate 
displaces the product, 1,3-bisphosphoglycerate, from the enzyme. The enzyme can be 
inactivated by reaction with iodoacetate, which reacts with and blocks the essential cys- 
teine sulfhydryl. 
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The glyceraldehyde-3-phosphate dehydrogenase reaction is the site of action of 
arsenate (AsO4°-), an anion analogous to phosphate. Arsenate is an effective sub- 
strate in this reaction, forming 1-arseno-3-phosphoglycerate, but acyl arsenates are 
quite unstable and are rapidly hydrolyzed. 1-Arseno-3-phosphoglycerate breaks 
down to yield 3-phosphoglycerate, the product of the seventh reaction of glycolysis. 
The result is that glycolysis continues in the presence of arsenate, but the molecule 
of ATP formed in reaction 7 (phosphoglycerate kinase) is not made because this 
step has been bypassed. The lability of l-arseno-3-phosphoglycerate effectively un- 
couples the oxidation and phosphorylation events, which are normally tightly cou- 
pled in the glyceraldehyde-3-phosphate dehydrogenase reaction. 
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R CENGAGENOW” ACTIVE FIGURE 18.14 A mechanism for the glyceraldehyde- 
| r 3-phosphate dehydrogenase reaction. Reaction of an enzyme sulfhydryl with 
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a | loses a hydride to NAD* to become a thioester. Phosphorolysis of this 
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Reaction 7: Phosphoglycerate Kinase Is the Break-Even Reaction 


The glycolytic pathway breaks even in terms of ATPs consumed and produced 
with this reaction. The enzyme phosphoglycerate kinase transfers a phosphoryl 
group from 1,3-bisphosphoglycerate to ADP to form an ATP. Because each glu- 
cose molecule sends two molecules of glyceraldehyde-3-phosphate into the sec- 
ond phase of glycolysis and because two ATPs were consumed per glucose in the 
first-phase reactions, the phosphoglycerate kinase reaction “pays off” the ATP 
debt created by the priming reactions. As might be expected for a phosphoryl 
transfer enzyme, Mg?* ion is required for activity and the true nucleotide sub- 
strate for the reaction is MgADP-. It is appropriate to view the sixth and seventh 
reactions of glycolysis as a coupled pair, with 1,3-bisphosphoglycerate as an in- 
termediate. The phosphoglycerate kinase reaction is sufficiently exergonic at 
standard state to pull the G-3-P dehydrogenase reaction along. (In fact, the 
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aldolase and triose phosphate isomerase are also pulled forward by phospho- 
glycerate kinase.) The net result of these coupled reactions is 


Glyceraldehyde-3-phosphate + ADP + P; + NAD+ —> 
3-phosphoglycerate + ATP + NADH + H+ 
AG” = -12.6 kJ/mol (18.8) 


Another reflection of the coupling between these reactions lies in their values 
of AG under cellular conditions (Table 18.1). Despite its strongly negative AG°’, 
the phosphoglycerate kinase reaction operates at equilibrium in the erythrocyte 
(AG = 0.1 kJ/mol). In essence, the free energy available in the phosphoglycerate 
kinase reaction is used to bring the three previous reactions closer to equilib- 
rium. Viewed in this context, it is clear that ADP has been phosphorylated to 
form ATP at the expense of a substrate, namely, glyceraldehyde-3-phosphate. 
This is an example of substrate-level phosphorylation, a concept that will be en- 
countered again. (The other kind of phosphorylation, oxidative phosphorylation, 
is driven energetically by the transport of electrons from appropriate coen- 
zymes and substrates to oxygen. Oxidative phosphorylation will be covered in 
detail in Chapter 20.) Even though the coupled reactions exhibit a very favorable 
AG”, there are conditions (that is, high ATP and 3-phosphoglycerate levels) 
under which the phosphoglycerate kinase reaction can be reversed so that 3- 
phosphoglycerate is phosphorylated from ATP. 

An important regulatory molecule, 2,3-bisphosphoglycerate, is synthesized and 
metabolized by a pair of reactions that make a detour around the phosphoglycerate 
kinase reaction. 2,3-BPG, which stabilizes the deoxy form of hemoglobin and is pri- 
marily responsible for the cooperative nature of oxygen binding by hemoglobin (see 
Chapter 15), is formed from 1,3-bisphosphoglycerate by bisphosphoglycerate mutase 
(Figure 18.15). Interestingly, 3-phosphoglycerate is required for this reaction, which 
involves phosphoryl transfer from the C-1 position of 1,3-bisphosphoglycerate to the 
C-2 position of 3-phosphoglycerate (Figure 18.16). Hydrolysis of 2,3-BPG is carried 
out by 2,3-bisphosphoglycerate phosphatase. Although other cells contain only a trace of 
2,3-BPG, erythrocytes typically contain 4 to 5 mM 2,3-BPG. 


Reaction 8: Phosphoglycerate Mutase Catalyzes a Phosphoryl Transfer 


The remaining steps in the glycolytic pathway prepare for synthesis of the second ATP 
equivalent. This begins with the phosphoglycerate mutase reaction, in which the 
phosphoryl group of 3-phosphoglycerate is moved from C-3 to C-2. (The term mutase 
is applied to enzymes that catalyze migration of a functional group within a substrate 
molecule.) The free energy change for this reaction is very small under cellular con- 
ditions (AG = 0.83 kJ/mol in erythrocytes). Phosphoglycerate mutase enzymes iso- 
lated from different sources exhibit different reaction mechanisms. As shown in Fig- 
ure 18.17, the enzymes isolated from yeast and from rabbit muscle form phosphoenzyme 
intermediates, use 2,3-bisphosphoglycerate as a cofactor, and undergo intermolecular 
phosphoryl group transfers (in which the phosphate of the product 2-phosphoglyc- 
erate is not that from the 3-phosphoglycerate substrate). The prevalent form of phos- 
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FIGURE 18.15 Formation and decomposition of 2,3-bisphosphoglycerate. 
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FIGURE 18.16 The mutase that forms 2,3-BPG from 1,3-BPG requires 3-phosphoglycerate. The reaction is actu- 
ally an intermolecular phosphoryl transfer from C-1 of 1,3-BPG to C-2 of 3-PG. 
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FIGURE 18.17 A mechanism for the phosphoglycerate mutase 
reaction in rabbit muscle and in yeast. Zelda Rose of the Institute 


for Cancer Research in Philadelphia showed that the enzyme 2 
requires a small amount of 2,3-BPG to phosphorylate the histidine COO™ F 

residue before the mechanism can proceed. Prior to her work, ubo —o- 

the role of the phosphohistidine in this mechanism was not | | 

understood. H C—OH O7 


phoglycerate mutase is a phosphoenzyme, with a phosphoryl group covalently bound to 


a histidine residue at the active site. This phosphoryl group is transferred to the C-2 cuss ko. 
position of the substrate to form a transient, enzyme-bound 2,3-bisphosphoglycerate, 

which then decomposes by a second phosphoryl transfer from the C-3 position of the Gly! 

intermediate to the histidine residue on the enzyme. About once in every 100 enzyme AA Argô! 
turnovers, the intermediate, 2,3-bisphosphoglycerate, dissociates from the active site, His!83 


leaving an inactive, unphosphorylated enzyme. The unphosphorylated enzyme can 
be reactivated by binding 2,3-BPG. For this reason, maximal activity of phosphoglyc- " 
erate mutase requires the presence of small amounts of 2,3-BPG. Ag 


Arg? 





Reaction 9: Dehydration by Enolase Creates PEP 


Recall that prior to synthesizing ATP in the phosphoglycerate kinase reaction, it was 

necessary to first make a substrate having a high-energy phosphate. Reaction 9 of The catalytic histidine (His'™) at the active site of 
glycolysis similarly makes a high-energy phosphate in preparation for ATP synthesis. mre a ear (pabid = 
Enolase catalyzes the formation of phosphoenolpyruvate from 2-phosphoglycerate. The l i 
reaction involves the removal of a water molecule to form the enol structure of PEP. 

The AG*’ for this reaction is relatively small at 1.8 kJ/mol (Keq = 0.5); and, under 

cellular conditions, AG is very close to zero. In light of this condition, it may be dif- 

ficult at first to understand how the enolase reaction transforms a substrate with a 

relatively low free energy of hydrolysis into a product (PEP) with a very high free 
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FIGURE 18.18 The yeast enolase dimer is asymmetric. The active site of one subunit (a) contains 
2-phosphoglycerate, the enolase substrate. Also shown are a Mg?* ion (blue), a Lit ion (purple), and His", 
which participates in catalysis. The other subunit (b) binds phosphoenolpyruvate, the product of the enolase 
reaction. An active site water molecule (yellow), Mg?* (blue), and His'*? are also shown (pdb id = 2ONE). 


energy of hydrolysis. This puzzle is clarified by realizing that 2-phosphoglycerate and 
PEP contain about the same amount of potential metabolic energy, with respect to 
decomposition to P;, COs, and H2O. What the enolase reaction does is rearrange the 
substrate into a form from which more of this potential energy can be released upon 
hydrolysis. The enzyme is strongly inhibited by fluoride ion in the presence of phos- 
phate. Thomas Nowak has shown that fluoride, phosphate, and a divalent cation 
form a transition-state-like complex in the enzyme active site, with fluoride appar- 
ently mimicking the hydroxide ion nucleophile in the enolase reaction. 

Yeast enolase is a dimer of identical subunits. However, if the enzyme is crystal- 
lized in the presence of a mixture of the substrate (2-phosphoglycerate) and the 
product (phosphoenolpyruvate), the crystallized dimer is asymmetric! One subunit 
active site contains 2-phosphoglycerate, and the other contains PEP (Figure 18.18), 
thus providing a “before-and-after” picture of this glycolytic enzyme. 


Reaction 10: Pyruvate Kinase Yields More ATP 


The second ATP-synthesizing reaction of glycolysis is catalyzed by pyruvate kinase, 
which brings the pathway at last to its pyruvate branch point. Pyruvate kinase me- 
diates the transfer of a phosphoryl group from phosphoenolpyruvate to ADP to 
make ATP and pyruvate. The reaction requires Mg?* ion and is stimulated by K* 
and certain other monovalent cations. 

The corresponding Keq at 25°C is 3.63 X 10°, and it is clear that the pyruvate ki- 
nase reaction equilibrium lies very far to the right. Concentration effects reduce the 
magnitude of the free energy change somewhat in the cellular environment, but 
the A Gin erythrocytes is still quite favorable at —23.0 kJ/mol. The high free energy 
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The structure of the pyruvate kinase tetramer is sensitive to bound ligands. The inactive E. coli enzyme in the 
absence of ligands (left, pdb id = 1EOU). The active form of the yeast dimer, with fructose-1,6-bisphosphate (an 
allosteric regulator, blue), substrate analog (red), and K* (gold) (pdb id = 1A3W). 





OOC O— POJ OOC OH OOC O 
So Ke a Ae 
[ za IK > i 
Cedegi H T H 
Z X YN 
H H ADP ATP H H H 
PEP Enol Keto 
tautomer tautomer 
L Y J 
Pyruvate 


change for the conversion of PEP to pyruvate is due largely to the highly favorable 
and spontaneous conversion of the enol tautomer of pyruvate to the more stable 
keto form (Figure 18.19) following the phosphoryl group transfer step. 

The large negative AG of this reaction makes pyruvate kinase a suitable target site 
for regulation of glycolysis. For each glucose molecule in the glycolysis pathway, two 
ATPs are made at the pyruvate kinase stage (because two triose molecules were pro- 
duced per glucose in the aldolase reaction). Because the pathway broke even in terms 
of ATP at the phosphoglycerate kinase reaction (two ATPs consumed and two ATPs 
produced), the two ATPs produced by pyruvate kinase represent the “payoff” of 
glycolysis—a net yield of two ATP molecules. 

Pyruvate kinase possesses allosteric sites for numerous effectors. It is activated by 
AMP and fructose-1,6-bisphosphate and inhibited by ATP, acetyl-CoA, and alanine. 
(Note that alanine is the a-amino acid counterpart of the a-keto acid, pyruvate.) 
Furthermore, liver pyruvate kinase is regulated by covalent modification. Hormones 
such as glucagon activate a cAMP-dependent protein kinase, which transfers a phos- 
phoryl group from ATP to the enzyme. The phosphorylated form of pyruvate kinase 
is more strongly inhibited by ATP and alanine and has a higher K,, for PEP, so in 
the presence of physiological levels of PEP, the enzyme is inactive. Then PEP is used 
as a substrate for glucose synthesis in the gluconeogenesis pathway (to be described in 
Chapter 22), instead of going on through glycolysis and the citric acid cycle (or fer- 
mentation routes). A suggested active-site geometry for pyruvate kinase, based on 
NMR and EPR studies by Albert Mildvan and colleagues, is presented in Figure 
18.20. The carbonyl oxygen of pyruvate and the y-phosphorus of ATP lie within 
0.3 nm of each other at the active site, consistent with direct transfer of the phos- 
phoryl group without formation of a phosphoenzyme intermediate. 


FIGURE 18.19 The conversion of phosphoenolpyruvate 
(PEP) to pyruvate may be viewed as involving two steps: 
phosphoryl transfer followed by an enol-keto tautomer- 
ization. The tautomerization is spontaneous (AG” = 
—35-40 kJ/mol) and accounts for much of the free en- 
ergy change for PEP hydrolysis. 
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FIGURE 18.20 A mechanism for the pyruvate kinase 


reaction, based on 
Mildvan and colleag 


MR and EPR studies by Albert 
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1825» What Are the Metabolic Fates of NADH and Pyruvate 
Produced in Glycolysis? 


In addition to ATP, the products of glycolysis are NADH and pyruvate. Their pro- 
cessing depends upon other cellular pathways. NADH must be recycled to NAD*, 
lest NAD* become limiting in glycolysis. NADH can be recycled by both aerobic and 
anaerobic paths, either of which results in further metabolism of pyruvate. What a 
given cell does with the pyruvate produced in glycolysis depends in part on the avail- 
ability of oxygen. Under aerobic conditions, pyruvate can be sent into the citric acid 
cycle (also known as the TCA cycle; see Chapter 19), where it is oxidized to CO» 
with the production of additional NADH (and FADHo). Under aerobic conditions, 
the NADH produced in glycolysis and the citric acid cycle is reoxidized to NAD* in 
the mitochondrial electron-transport chain (see Chapter 20). 


Anaerobic Metabolism of Pyruvate Leads to Lactate or Ethanol 


Under anaerobic conditions, the pyruvate produced in glycolysis is processed differ- 
ently. In yeast, it is reduced to ethanol; in other microorganisms and in animals, it is 
reduced to lactate. These processes are examples of fermentation—the production of 
ATP energy by reaction pathways in which organic molecules function as donors and 
acceptors of electrons. In either case, reduction of pyruvate provides a means of re- 
oxidizing the NADH produced in the glyceraldehyde-3-phosphate dehydrogenase 
reaction of glycolysis (Figure 18.21). In yeast, alcoholic fermentation is a two-step 
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FIGURE 18.21 (a) Pyruvate reduction to ethanol in yeast provides a means for regenerating NAD* consumed 
in the glyceraldehyde-3-P dehydrogenase reaction. (b) In oxygen-depleted muscle, NAD* is regenerated in the 
lactate dehydrogenase reaction. 


process. Pyruvate is decarboxylated to acetaldehyde by pyruvate decarboxylase in an 
essentially irreversible reaction. Thiamine pyrophosphate (see page 568) is a required 
cofactor for this enzyme. The second step, the reduction of acetaldehyde to ethanol 
by NADH, is catalyzed by alcohol dehydrogenase (Figure 18.21). At pH 7, the reaction 
equilibrium strongly favors ethanol. The end products of alcoholic fermentation are 
thus ethanol and carbon dioxide. Alcoholic fermentations are the basis for the brew- 
ing of beers and the fermentation of grape sugar in wine making. Lactate produced 
by anaerobic microorganisms during lactic acid fermentation is responsible for the 
taste of sour milk and for the characteristic taste and fragrance of sauerkraut, which 
in reality is fermented cabbage. 


Lactate Accumulates Under Anaerobic Conditions in Animal Tissues 


In animal tissues experiencing anaerobic conditions, pyruvate is reduced to lactate. 
Pyruvate reduction occurs in tissues that normally experience minimal access to 
blood flow (for example, the cornea of the eye) and also in rapidly contracting 
skeletal muscle. When skeletal muscles are exercised strenuously, the available tis- 
sue oxygen is consumed and the pyruvate generated by glycolysis can no longer be 
oxidized in the TCA cycle. Instead, excess pyruvate is reduced to lactate by lactate 
dehydrogenase (Figure 18.21). The rate of anaerobic glycolysis in skeletal muscle 
can increase up to 2000-fold almost instantaneously, for example, to support the in- 
tense demands of a sprinting animal. Large amounts of ATP are generated rapidly, 
at the expense of lactate accumulation. In anaerobic muscle tissue, lactate repre- 
sents the end of glycolysis. Anyone who exercises to the point of depleting available 
muscle oxygen stores knows the cramps and muscle fatigue associated with the 
buildup of lactic acid in the muscle. Most of this lactate must be carried out of the 
muscle by the blood and transported to the liver, where it can be resynthesized into 
glucose in gluconeogenesis. Moreover, because glycolysis generates only a fraction 
of the total energy available from the breakdown of glucose (the rest is generated 
by the TCA cycle and oxidative phosphorylation), the onset of anaerobic conditions 
in skeletal muscle also means a reduction in the energy available from the break- 
down of glucose. 
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FIGURE 18.22 A comparison of free energy changes for 
he reactions of glycolysis (step 1 = hexokinase) under 
(a) standard-state conditions and (b) actual intracellular 
conditions in erythrocytes. The values of AG” provide 
ittle insight into the actual free energy changes that 
occur in glycolysis. On the other hand, under intracellu- 
ar conditions, seven of the glycolytic reactions operate 
near equilibrium (with AG near zero). The driving force 
or glycolysis lies in the hexokinase (1), phosphofructo- 
inase (3),and pyruvate kinase (10) reactions. The lactate 
dehydrogenase (step 11) reaction also exhibits a large 
negative AG under cellular conditions. 





18:6: How Do Cells Regulate Glycolysis? 


The elegance of nature’s design for the glycolytic pathway may be appreciated through 
an examination of Figure 18.22. The standard-state free energy changes for the 10 re- 
actions of glycolysis and the lactate dehydrogenase reaction (Figure 18.22a) are vari- 
ously positive and negative and, taken together, offer little insight into the coupling 
that occurs in the cellular milieu. On the other hand, the values of AG under cellular 
conditions (Figure 18.22b) fall into two distinct classes. For reactions 2 and 4 through 
9, AGis very close to zero, meaning these reactions operate essentially at equilibrium. 
Small changes in the concentrations of reactants and products could “push” any of 
these reactions either forward or backward. By contrast, the hexokinase, phospho- 
fructokinase, and pyruvate kinase reactions all exhibit large negative AG values under 
cellular conditions. These reactions are thus the sites of glycolytic regulation. When 
these three enzymes are active, glycolysis proceeds and glucose is readily metabolized 
to pyruvate or lactate. Inhibition of the three key enzymes by allosteric effectors brings 
glycolysis to a halt. When we consider gluconeogenesis—the biosynthesis of glucose— 
in Chapter 22, we will see that different enzymes are used to carry out reactions 1, 3, 
and 10 in reverse, effecting the net synthesis of glucose. The maintenance of reactions 
2 and 4 through 9 at or near equilibrium permits these reactions (and their respective 
enzymes!) to operate effectively in either the forward or reverse direction. 


18:7 Are Substrates Other Than Glucose Used in Glycolysis? 


The glycolytic pathway described in this chapter begins with the breakdown of 
glucose, but other sugars, both simple and complex, can enter the cycle if they 
can be converted by appropriate enzymes to one of the intermediates of glycoly- 
sis. Figure 18.23 shows the routes by which several simple metabolites can enter 
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FIGURE 18.23 Mannose, galactose, fructose, and 
other simple metabolites can enter the glycolytic 
pathway. 
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Tumor Diagnosis Using Positron Emission Tomography (PET) 


More than 70 years ago, Otto Warburg at the Kaiser Wilhelm In- 
stitute of Biology in Germany demonstrated that most animal and 
human tumors displayed a very high rate of glycolysis compared 
to that of normal tissue. This observation from long ago is the ba- 
sis of a modern diagnostic method for tumor detection called 
positron emission tomography, or PET. PET uses molecular 
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probes that contain a neutron-deficient, radioactive element such 
as carbon-11 or fluorine-18. An example is 2-[!8F] fluoro-2-deoxy- 
glucose (FDG), a molecular mimic of glucose. The '8F nucleus is 
unstable and spontaneously decays by emission of a positron (an 
antimatter* particle) from a proton, thus converting a proton to 
a neutron and transforming the I8F to 18O. The emitted positron 
typically travels a short distance (less than a millimeter) and col- 
lides with an electron, annihilating both particles and creating a 
pair of high-energy photons—gamma rays. Detection of the 
gamma rays with special cameras can be used to construct three- 
dimensional models of the location of the radiolabeled molecu- 
lar probe in the tissue of interest. 

FDG is taken up by human cells and converted by hexokinase 
to 2-[!SF] fluoro-2-deoxy-glucose-6-phosphate in the first step of 
glycolysis. Cells of a human brain, for example, accumulate FDG 
in direct proportion to the amount of glycolysis occuring in those 
cells. Tumors can be identified in PET scans as sites of unusually 
high FDG accumulation. 


(c) PET image of human brain following administration of 
ISFDG. Red area indicates a large malignant tumor. 
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*The existence of antimatter in the form of positrons was first postulated by Robert Oppenheimer, the father of the atomic bomb. 
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the glycolytic pathway. Fructose, for example, which is produced by breakdown of 
sucrose, may participate in glycolysis by at least two different routes. In the liver, 
fructose is phosphorylated at C-1 by the enzyme fructokinase: 


D-Fructose + ATP’ —> p-fructose-1-phosphate?~ + ADP” + H+ (18.9) 


Subsequent action by fructose-1-phosphate aldolase cleaves fructose-1-P in a man- 
ner like the fructose bisphosphate aldolase reaction to produce dihydroxyacetone 
phosphate and D-glyceraldehyde: 


p-Fructose-1-P?” —> p-glyceraldehyde + dihydroxyacetone phosphate?~ (18.10) 


Dihydroxyacetone phosphate is of course an intermediate in glycolysis. D-Glycer- 
aldehyde can be phosphorylated by triose kinase in the presence of ATP to form 
b-glyceraldehyde-3-phosphate, another glycolytic intermediate. 

In the kidney and in muscle tissues, fructose is readily phosphorylated by hex- 
okinase, which, as pointed out previously, can utilize several different hexose sub- 
strates. The free energy of hydrolysis of ATP drives the reaction forward: 


p-Fructose + ATP —> p-fructose-6-phosphate?~ + ADP®~ + H+ (18.11) 
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FIGURE 18.24 Galactose metabolism via the Leloir 
pathway. 


Fructose-6-phosphate generated in this way enters the glycolytic pathway directly in 
step 3, the second priming reaction. This is the principal means for channeling 
fructose into glycolysis in adipose tissue, which contains high levels of fructose. 


Mannose Enters Glycolysis in Two Steps 


Another simple sugar that enters glycolysis at the same point as fructose is 
mannose, which occurs in many glycoproteins, glycolipids, and polysaccharides 
(see Chapter 7). Mannose is also phosphorylated from ATP by hexokinase, and 
the mannose-6-phosphate thus produced is converted to fructose-6-phosphate by 
phosphomannoisomerase. 


p-Mannose + ATP* — p-mannose-6-phosphate*~ + ADP?- + H+ (18.12) 


p-Mannose-6-phosphate?~- ——> D-fructose-6-phosphate?~ (18.13) 


Galactose Enters Glycolysis Via the Leloir Pathway 


A somewhat more complicated route into glycolysis is followed by galactose, an- 
other simple hexose sugar. The process, called the Leloir pathway after Luis 
Leloir, its discoverer, begins with phosphorylation from ATP at the C-1 position by 
galactokinase: 


p-Galactose + ATP} ——> p-galactose-1-phosphate?~ + ADP? + H* (18.14) 


Galactose-1-phosphate is then converted into UDP-galactose (a sugar nucleotide) 
by galactose-1-phosphate uridylyltransferase (Figure 18.24), with concurrent pro- 
duction of glucose-l-phosphate and consumption of a molecule of UDP-glucose. 
The uridylyltransferase reaction (Figure 18.25) proceeds via a “ping-pong” mech- 
anism (see Chapter 13, page 406) with a covalent enzyme-UMP intermediate. The 
glucose-1-phosphate produced by the transferase reaction is a substrate for the 
phosphoglucomutase reaction (Figure 18.24), which produces  glucose-6- 
phosphate, a glycolytic substrate. The other transferase product, UDP-galactose, is 
converted to UDP-glucose by UDP-glucose-4-epimerase. The combined action of 
the uridylyltransferase and epimerase thus produces glucose-1-P from galactose- 
1-P, with regeneration of UDP-glucose. 

A rare hereditary condition known as galactosemia involves defects in galactose- 
1-P uridylyltransferase that render the enzyme inactive. Toxic levels of galactose 
accumulate in afflicted individuals, causing cataracts and permanent neurological 
disorders. These problems can be prevented by removing galactose and lactose 
from the diet. In adults, the toxicity of galactose appears to be less severe, due in 
part to the metabolism of galactose-1-P by UDP-glucose pyrophosphorylase, which 
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apparently can accept galactose-1-P in place of glucose-1-P (Figure 18.26). The 
levels of this enzyme may increase in galactosemic individuals in order to accom- 
modate the metabolism of galactose. 


An Enzyme Deficiency Causes Lactose Intolerance 


A much more common metabolic disorder, lactose intolerance, occurs com- 
monly in most parts of the world (notable exceptions being some parts of Africa 
and northern Europe). Lactose intolerance is an inability to digest lactose be- 
cause of the absence of the enzyme lactase in the intestines of adults. The symp- 
toms of this disorder, which include diarrhea and general discomfort, can be re- 
lieved by eliminating milk from the diet. 


Glycerol Can Also Enter Glycolysis 


Glycerol is the last important simple substance whose ability to enter the glycolytic 
pathway must be considered. This metabolite, which is produced in substantial 
amounts by the decomposition of triacylglycerols (see Chapter 23), can be converted 
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FIGURE 18.25 The galactose-1-phosphate uridylyl- 
transferase reaction involves a “ping-pong” kinetic 
mechanism. 


FIGURE 18.26 The UDP-glucose pyrophosphorylase 
reaction also works with galactose-1-P. 
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HUMAN BIOCHEMISTRY 


Lactose is an interesting sugar in many ways. In placental mam- 
mals, it is synthesized only in the mammary gland, and then only 
during late pregnancy and lactation. The synthesis is carried out 
by lactose synthase, a dimeric complex of two proteins: galactosyl 
transferase and a-lactalbumin. Galactosyl transferase is present in 
all human cells, and it is normally involved in incorporation of 
galactose into glycoproteins. In late pregnancy, the pituitary 
gland in the brain releases a protein hormone, prolactin, which 
triggers production of a-lactalbumin by certain cells in the breast. 
a-Lactalbumin, a 123-residue protein, associates with galactosyl 
transferase to form lactose synthase, which catalyzes the reaction: 


UDP-galactose + glucose—-> lactose + UDP 


Lactose breakdown by lactase in the small intestine provides 
newborn mammals with essential galactose for many purposes, 
including the synthesis of gangliosides in the developing brain. 
Lactase is a B-galactosidase that cleaves lactose to yield galactose 
and glucose—in fact, the only human enzyme that can cleave a 
B-glycosidic linkage: 





Breakdown of lactose to galactose and glucose by lactase. 


Portions adapted from Hill, R., and Brew, K., 1975. Lactose synthetase. Ad- 
vances in Enzymology 43:411—485; and Bloch, K., 1994. Blondes in Venetian 
Paintings, the Nine-Banded Armadillo, and Other Essays in Biochemistry. New 
Haven, CT: Yale University Press. 





Lactose—From Mother’s Milk to Yogurt—and Lactose Intolerance 


Lactase is an inducible enzyme in mammals, and it appears in the 
fetus only during the late stages of gestation. Lactase activity peaks 
shortly after birth, but by the age of 3 to 5 years, it declines to a low 
level in nearly all human children. Low levels of lactase make many 
adults lactose intolerant. Lactose intolerance occurs commonly in 
most parts of the world (with the notable exception of some parts of 
Africa and northern Europe; see table). The symptoms of lactose in- 
tolerance, including diarrhea and general discomfort, can be 
relieved by eliminating milk from the diet. Alternatively, products 
containing -galactosidase are available commercially. 

Certain bacteria, including several species of Lactobacillus, thrive 
on the lactose in milk and carry out lactic acid fermentation, con- 
verting lactose to lactate via glycolysis. This is the basis of produc- 
tion of yogurt, which is now popular in the Western world but of 
Turkish origin. Other cultures also produce yogurtlike foods. No- 
madic Tatars in Siberia and Mongolia used camel milk to make 
koumiss, which was used for medicinal purposes. In the Caucasus, 
kefir is made much like yogurt, except that the starter culture con- 
tains (in addition to Lactobacillus) Streptococcus lactis and yeast, 
which convert some of the glucose to ethanol and COs, producing 
an effervescent and slightly intoxicating brew. 











Country Lactase Persistence (%) 
Sweden 99 
Denmark 97 
United Kingdom (Scotland) 95 
Germany 88 
Australia 82 
United States (Iowa) 81 
Spain 72 
France 58 
India 36 
Japan 10 
China (Singapore) 0 








Adapted from Bloch, K., 1994. Blondes in Venetian Paintings, the Nine-Banded Armadillo, 
and Other Essays in Biochemistry. New Haven, CT: Yale University Press. 





Glycerol kinase 


reaction 


CH,OH CH,OH 
| Mg?+ | 
HOCH + ATP ———> HOCH + ADP 
| | 
CHOH CH,OPO3- 
Glycerol sn-Glycerol-3-phosphate 





to glycerol-3-phosphate by the action of glycerol kinase and then oxidized to dihy- 
droxyacetone phosphate by the action of glycerol phosphate dehydrogenase, with 
NAD” as the required coenzyme. The dihydroxyacetone phosphate thereby produced 
enters the glycolytic pathway as a substrate for triose phosphate isomerase. 
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CH,OH CHOH 
H a H + NAD —— > l =0 + NADH + Ht Glycerol-P dehydrogenase 
| | reaction 
CH ,OPO3- CH,OPO3- 
sn-Glycerol-3-phosphate Dihydroxyacetone 
phosphate 


18:8: How Do Cells Respond to Hypoxic Stress? 


Glycolysis is an anaerobic pathway—it does not require oxygen. But as noted in Fig- 
ure 18.1, operation of the TCA cycle (the subject of Chapter 19) depends on oxy- 
gen, so it is aerobic. When oxygen is abundant, cells prefer aerobic metabolism, 
which yields more energy per glucose consumed. However, as Louis Pasteur first 
showed, when oxygen is limited, cells adapt to make the most of glycolysis, the less 
energetic, anaerobic alternative. In mammalian tissues, hypoxia (oxygen limitation) 
can cause changes in gene expression that result in increased angiogenesis (the 
growth of new blood vessels), increased synthesis of red blood cells, and increased 
levels of some glycolytic enzymes (and thus a higher rate of glycolysis). 

What is the molecular basis for the increased expression of glycolytic enzymes? 
One of the triggers for this expression is a DNA-binding protein called hypoxia 
inducible factor (HIF). HIF is a heterodimer of a constitutive nuclear subunit 
(HIF-18) and an inducible a-subunit. Both subunits are basic helix-loop-helix tran- 
scription factors that bind to hypoxia-inducible genes, and both subunits exist as a 
series of isoforms (for example, HIF-la, HIF-2a, and HIF-3a). HIF-a subunit regu- 
lation is a multistep process that includes gene splicing, phosphorylation, acetyla- 
tion, and hydroxylation. HIF-la is the best-studied HIF-a isoform. When oxygen is 
plentiful, HIF-la is hydroxylated by oxygen-dependent prolyl hydroxylases (PHDs) 
at Pro* and Pro®t. These hydroxylations ensure its binding to ubiquitin E3 ligase, 
which leads to rapid proteolysis by the 26S proteasome (see Chapter 31). HIF-la 
binding to the ligase is also promoted by acetylation of Lys? by the ARD1 acetyl- 
transferase. In addition, the presence of oxygen induces the hydroxylation of 
HIF-la Asn” by the hydroxylase factor—-inhibiting HIF (FIH-1). Hydroxylation in- 
hibits the transcription activity of HIF-la by preventing its interaction with the acti- 
vator p300. Figure 18.27 shows the structure of FIH bound to a fragment of HIF-la. 

Because PHDs and FIH-1 both are oxygen-dependent, lowering oxygen concen- 
tration means that HIF-la avoids degradation and is available to promote gene tran- 
scription (Figure 18.28). Phosphorylation of HIF-la by a protein kinase promotes 


= Oxygen levels 
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Transcription 





FIGURE 18.27 FIH (green) bound to HIF. 


FIGURE 18.28 The HIF transcription factor is composed 
of two subunits: a ubiquitous HIF-18 subunit and a 
hypoxia-responsive HIF-1œ subunit. In response to 
hypoxia, inactivation of the PHDs allows HIF-1q@ stabiliza- 
tion, dimerization with HIF-1B, binding of the dimer to 
the hypoxia responsive element (HRE) of HIF target 
genes, and activation of the transcription of these genes. 
VHL is the von Hippel Lindau subunit of the ubiquitin 

E3 ligase that targets proteins for proteasome degrada- 
tion. (Adapted from North, S., Moenner, M, and Bikfalvi, A., 2006. 
Recent developments in the regulation of the angiogenic 











switch by cellular switch factors in tumors. Cancer Letters 
218:1-14.) 
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binding of HIF-la@ to HIF-1£, which enhances transcription. HIF-la—HIF-18 dimers 
bind to hypoxia responsive elements (HREs), activating transcription of HRE- 
regulated genes, including genes for glycolytic enzymes. Pasteur observed more 
than 100 years ago that fermentation amounted to “life without air.” The “Pasteur 
effect” depends on HIFmediated activation of the genes encoding glycolytic 
enzymes in the absence of oxygen. The linking of glycolytic activity to oxygen level 
is the result of an exquisite dance of oxygen-sensitive enzymes with proteins, which 
undergo covalent modifications that control protein-protein and protein-DNA 
interactions, a dance that Pasteur could hardly have anticipated. 


SUMMARY 


Nearly every living cell carries out a catabolic process known as glycolysis— 
the stepwise degradation of glucose (and other simple sugars). Glycolysis 
is a paradigm of metabolic pathways. Localized in the cytosol of cells, it is 
basically an anaerobic process; its principal steps occur with no require- 
ment for oxygen. 


18.1 What Are the Essential Features of Glycolysis? Glycolysis con- 
sists of two phases. In the first phase, a series of five reactions, glucose is 
broken down to two molecules of glyceraldehyde-3-phosphate. In the 
second phase, five subsequent reactions convert these two molecules of 
glyceraldehyde-3-phosphate into two molecules of pyruvate. Phase 1 
consumes two molecules of ATP. The later stages of glycolysis result in 
the production of four molecules of ATP. The net is 
4 — 2 = 2 molecules of ATP produced per molecule of glucose. 


18.2 Why Are Coupled Reactions Important in Glycolysis? Coupled 
reactions permit the energy of glycolysis to be used for generation of 
ATP. Conversion of glucose to pyruvate in glycolysis drives the produc- 
tion of two molecules of ATP. 


18.3 What Are the Chemical Principles and Features of the First Phase 
of Glycolysis? In the first phase of glycolysis, glucose is converted into 
two molecules of glyceraldehyde-3-phosphate. First, glucose is phos- 
phorylated to glucose-6-P, which is isomerized to fructose-6-P. Another 
phosphorylation and then cleavage yields two 3-carbon intermediates. 
One of these is glyceraldehyde-3-P, and the other, dihydroxyacetone-P, 
is converted to glyceraldehyde-3-P. Energy released from this high- 
energy molecule in the second phase of glycolysis is then used to syn- 
thesize ATP. 


18.4 What Are the Chemical Principles and Features of the Second Phase 
of Glycolysis? The second half of the glycolytic pathway involves the re- 
actions that convert the metabolic energy in the glucose molecule into 
ATP. Phase 2 starts with the oxidation of glyceraldehyde-3-phosphate, a re- 
action with a large enough energy “kick” to produce a high-energy phos- 
phate, namely, 1,3-bisphosphoglycerate. Phosphoryl transfer from 1,3-BPG 
to ADP to make ATP is highly favorable. The product, 3-phosphoglycerate, 
is converted via several steps to phosphoenolpyruvate (PEP), another 
high-energy phosphate. PEP readily transfers its phosphoryl group to ADP 
in the pyruvate kinase reaction to make another ATP. 


18.5 What Are the Metabolic Fates of NADH and Pyruvate Produced in 
Glycolysis? In addition to ATP, the products of glycolysis are NADH 
and pyruvate. Their processing depends upon other cellular pathways. 
NADH must be recycled to NAD*, lest NAD* become limiting in gly- 
colysis. NADH can be recycled by both aerobic and anaerobic paths, ei- 


ther of which results in further metabolism of pyruvate. What a given 
cell does with the pyruvate produced in glycolysis depends in part on 
the availability of oxygen. Under aerobic conditions, pyruvate can be 
sent into the citric acid cycle, where it is oxidized to CO, with the pro- 
duction of additional NADH (and FADHg). Under aerobic conditions, 
the NADH produced in glycolysis and the citric acid cycle is reoxidized 
to NAD* in the mitochondrial electron-transport chain. 

Under anaerobic conditions, the pyruvate produced in glycolysis is 
not sent to the citric acid cycle. Instead, it is reduced to ethanol in yeast; 
in other microorganisms and in animals, it is reduced to lactate. These 
processes are examples of fermentation—the production of ATP energy 
by reaction pathways in which organic molecules function as donors 
and acceptors of electrons. In either case, reduction of pyruvate provides 
a means of reoxidizing the NADH produced in the glyceraldehyde-3- 
phosphate dehydrogenase reaction of glycolysis. 


18.6 How Do Cells Regulate Glycolysis?. The standard-state free en- 
ergy changes for the 10 reactions of glycolysis are variously positive and 
negative and, taken together, offer little insight into the coupling that 
occurs in the cellular milieu. On the other hand, the values of AG un- 
der cellular conditions fall into two distinct classes. For reactions 2 and 
4 through 9, AG is very close to zero, meaning these reactions operate 
essentially at equilibrium. Small changes in the concentrations of reac- 
tants and products could “push” any of these reactions either forward 
or backward. By contrast, the hexokinase, phosphofructokinase, and 
pyruvate kinase reactions all exhibit large negative A G values under cel- 
lular conditions. These reactions are thus the sites of glycolytic regula- 
tion. 


18.7 Are Substrates Other Than Glucose Used in Glycolysis? Fructose 
enters glycolysis by either of two routes. Mannose, galactose, and glyc- 
erol enter via reactions that are linked to the glycolytic pathway. 


18.8 How Do Cells Respond to Hypoxic Stress? Glycolysis is an anaer- 
obic pathway, but it normally feeds pyruvate into aerobic metabolic 
pathways. However, when oxygen is limited, cells adapt to make the 
most of glycolysis. In mammalian tissues, oxygen limitation (hypoxia) 
can cause changes in gene expression that result in increased angio- 
genesis, red blood cell synthesis, and elevated levels of some glycolytic 
enzymes. One of the triggers for this expression is a DNA-binding pro- 
tein, HIF, which binds to hypoxia-inducible genes. HIF-a regulation is a 
multistep process that includes gene splicing, phosphorylation, acetyla- 
tion, and hydroxylation. The Pasteur effect depends on HIF-mediated 
activation of the genes encoding glycolytic enzymes in the absence of 
oxygen. 
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PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 


1. 


10. 


List the reactions of glycolysis that 

a. are energy consuming (under standard-state conditions). 

b. are energy yielding (under standard-state conditions). 

c. consume ATP. 

d. yield ATP. 

e. are strongly influenced by changes in concentration of substrate 
and product because of their molecularity. 

f. are at or near equilibrium in the erythrocyte (see Table 18.2). 


- Determine the anticipated location in pyruvate of labeled carbons 


if glucose molecules labeled (in separate experiments) with C at 
each position of the carbon skeleton proceed through the glycolytic 
pathway. 


. In an erythrocyte undergoing glycolysis, what would be the effect of 


a sudden increase in the concentration of 


a. ATP? b. AMP? 
c. fructose-1,6-bisphosphate? d. fructose-2,6-bisphosphate? 
e. citrate? f. glucose-6-phosphate? 


. Discuss the cycling of NADH and NAD* in glycolysis and the related 


fermentation reactions. 


- For each of the following reactions, name the enzyme that carries 


out this reaction in glycolysis and write a suitable mechanism for the 
reaction. 





CH,OPO.2 
C=O 
CH,OPO3? CHO 
HOCH | | 
== C=O + HCOH 
HCOH | | 
CHOH CH,OPO,2- 
HCOH 
CH,OPO.2 
O OPO; 
da 
ies I 
i i —— HCOH 
CH,OPO;2— CH,OPO;2— 


. Write the reactions that permit galactose to be utilized in glycolysis. 


Write a suitable mechanism for one of these reactions. 


. (Integrates with Chapters 4 and 14.) How might iodoacetic acid 


affect the glyceraldehyde-3-phosphate dehydrogenase reaction in 
glycolysis? Justify your answer. 


. If P-labeled inorganic phosphate were introduced to erythrocytes 


undergoing glycolysis, would you expect to detect **P in glycolytic 
intermediates? If so, describe the relevant reactions and the °P 
incorporation you would observe. 


. Sucrose can enter glycolysis by either of two routes: 


Sucrose phosphorylase: 

Sucrose + P; == fructose + glucose-1-phosphate 
Invertase: 
Sucrose + HO == fructose + glucose 





Would either of these reactions offer an advantage over the other in 
the preparation of hexoses for entry into glycolysis? 

What would be the consequences of a Mg** ion deficiency for the 
reactions of glycolysis? 


11. 


12. 


13. 


14. 


15. 


(Integrates with Chapter 3.) Triose phosphate isomerase catalyzes 
the conversion of dihydroxyacetone-P to glyceraldehyde-3-P. The 
standard free energy change, A G°’, for this reaction is +7.6 kJ/mol. 
However, the observed free energy change (AG) for this reaction in 
erythrocytes is +2.4 kJ/mol. 

a. Calculate the ratio of [dihydroxyacetone-P] /[glyceraldehyde-3-P] 

in erythrocytes from AG. 
b. If [dihydroxyacetone-P] = 0.2 mM, what is [glyceraldehyde-3-P]? 


(Integrates with Chapter 3.) Enolase catalyzes the conversion of 
2-phosphoglycerate to phosphoenolpyruvate + HO. The standard 
free energy change, A G°’, for this reaction is +1.8 kJ/mol. If the 
concentration of 2-phosphoglycerate is 0.045 mM and the concen- 
tration of phosphoenolpyruvate is 0.034 mM, what is AG, the free 
energy change for the enolase reaction, under these conditions? 
(Integrates with Chapter 3.) The standard free energy change 
(AG°’) for hydrolysis of phosphoenolpyruvate (PEP) is —61.9 kJ/mol. 
The standard free energy change (AG°’) for ATP hydrolysis is 
—30.5 kJ/mol. 

a. What is the standard free energy change for the pyruvate kinase 

reaction: 


ADP + phosphoenolpyruvate—- ATP + pyruvate 


b. What is the equilibrium constant for this reaction? 

c. Assuming the intracellular concentrations of [ATP] and [ADP] 
remain fixed at 8 mM and | mM, respectively, what will be the 
ratio of [pyruvate] /[phosphoenolpyruvate] when the pyruvate ki- 
nase reaction reaches equilibrium? 

(Integrates with Chapter 3.) The standard free energy change 

(4G°’) for hydrolysis of fructose-1,6-bisphosphate (FBP) to fructose- 

6-phosphate (F-6-P) and P; is —16.7 kJ/mol: 


FBP + HO——> fructose-6-P + P; 


The standard free energy change (AG°’) for ATP hydrolysis is 
—30.5 kJ/mol: 


ATP + HoO— > ADP + P; 


a. What is the standard free energy change for the phosphofructo- 
kinase reaction: 


ATP + fructose-6-P——> ADP + FBP 


b. What is the equilibrium constant for this reaction? 

c. Assuming the intracellular concentrations of [ATP] and [ADP] 
are maintained constant at 4 mM and 1.6 mM, respectively, in a rat 
liver cell, what will be the ratio of [FBP] /[fructose-6-P] when the 
phosphofructokinase reaction reaches equilibrium? 

(Integrates with Chapter 3.) The standard free energy change 

(AG°’) for hydrolysis of 1,3-bisphosphoglycerate (1,3-BPG) to 3- 

phosphoglycerate (3-PG) and P; is —49.6 kJ/mol: 


1,3BPG + HO—>3-PG + P, 


The standard free energy change (AG°’) for ATP hydrolysis is 
—30.5 kJ/mol: 


ATP + HyO—> ADP + P; 


a. What is the standard free energy change for the phosphoglycerate 
kinase reaction: 


ADP + 1,3-BPG——> ATP + 3-PG 


b. What is the equilibrium constant for this reaction? 

c. If the steady-state concentrations of [1,3-BPG] and [3-PG] in an 
erythrocyte are 1 uM and 120 uM, respectively, what will be the 
ratio of [ATP]/ [ADP], assuming the phosphoglycerate kinase re- 
action is at equilibrium? 


562 Chapter18 = Glycolysis 


16. The standard-state free energy change, A G°’, for the hexokinase re- 
action is — 16.7 kJ/mol. Use the values in Table 18.2 to calculate the 
value of AG for this reaction in the erythrocyte at 37°C. 


17. Taking into consideration the equilibrium constant for the adeny- 
late kinase reaction (page 542), calculate the change in concentra- 
tion in AMP that would occur if 8% of the ATP in an erythrocyte 
(red blood cell) were suddenly hydrolyzed to ADP. In addition to 
the concentration values in Table 18.2, it may be useful to assume 
that the initial concentration of AMP in erythrocytes is 5 uM. 


18. Fructose bisphosphate aldolase in animal muscle is a class I aldolase, 
which forms a Schiff base intermediate between substrate (for ex- 
ample, fructose-1,6-bisphosphate or dihydroxyacetone phosphate) 
and a lysine at the active site (see Figure 18.12). The chemical evi- 
dence for this intermediate comes from studies with aldolase and 
the reducing agent sodium borohydride, NaBHy. Incubation of the 
enzyme with dihydroxyacetone phosphate and NaBH, inactivates 
the enzyme. Interestingly, no inactivation is observed if NaBH, is 
added to the enzyme in the absence of substrate. Write a mechanism 
that explains these observations and provides evidence for the for- 
mation of a Schiff base intermediate in the aldolase reaction. 

19. As noted on page 556, the galactose-1-phosphate uridylyltransferase 
reaction proceeds via a ping-pong mechanism. Consult Chapter 13, 
page 406, to refresh your knowledge of ping-pong mechanisms, and 


draw a diagram to show how a ping-pong mechanism would pro- 
ceed for the uridylyltransferase. 


20. Genetic defects in glycolytic enzymes can have serious conse- 
quences for humans. For example, defects in the gene for pyruvate 
kinase can result in a condition known as hemolytic anemia. Con- 
sult a reference to learn about hemolytic anemia, and discuss why 
such genetic defects lead to this condition. 


Preparing for the MCAT Exam 


21. Regarding phosphofructokinase, which of the following statements 
is true: 
a. Low ATP stimulates the enzyme, but fructose-2,6-bisphosphate 
inhibits. 
b. High ATP stimulates the enzyme, but fructose-2,6-bisphosphate 
inhibits. 
c. High ATP stimulates the enzyme, but fructose-2,6-bisphosphate 
inhibits. 
d. The enzyme is more active at low ATP than at high, and fructose- 
2,6-bisphosphate activates the enzyme. 
e. ATP and fructose-2,6-bisphosphate both inhibit the enzyme. 
22. Based on your reading of this chapter, what would you expect to be 
the most immediate effect on glycolysis if the steady-state concen- 
tration of glucose-6-P were 8.3 mM instead of 0.083 mM? 
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9 The Tricarboxylic Acid Cycle 


ESSENTIAL QUESTION 


The glycolytic pathway converts glucose to pyruvate and produces two molecules 
of ATP per glucose—only a small fraction of the potential energy available from 
glucose. Under anaerobic conditions, pyruvate is reduced to lactate in animals and 
to ethanol in yeast, and much of the potential energy of the glucose molecule 
remains untapped. In the presence of oxygen, however, a much more interesting 
and thermodynamically complete story unfolds. 


How is pyruvate oxidized under aerobic conditions, and what is the chemical 
logic that dictates how this process occurs? 


Under aerobic conditions, pyruvate from glycolysis is converted to acetyl-coenzyme 
A (acetyl-CoA) and oxidized to CO in the tricarboxylic acid (TCA) cycle (also 
called the citric acid cycle). The electrons liberated by this oxidative process are 
passed via NADH and FADH; through an elaborate, membrane-associated electron- 
transport pathway to Os, the final electron acceptor. Electron transfer is coupled to 
creation of a proton gradient across the membrane. Such a gradient represents an 
energized state, and the energy stored in this gradient is used to drive the synthesis 
of many equivalents of ATP. 

ATP synthesis as a consequence of electron transport is termed oxidative 
phosphorylation; the complete process is diagrammed in Figure 19.1. Aerobic 
pathways permit the production of 30 to 38 molecules of ATP per glucose oxidized. 
Although two molecules of ATP come from glycolysis and two more directly out of 
the TCA cycle, most of the ATP arises from oxidative phosphorylation. Specifically, 
reducing equivalents released in the oxidative reactions of glycolysis, pyruvate de- 
carboxylation, and the TCA cycle are captured in the form of NADH and enzyme- 
bound FADH,, and these reduced coenzymes fuel the electron-transport pathway 
and oxidative phosphorylation. 

Complete oxidation of glucose to CO» involves the removal of 24 electrons—that 
is, it is a 24-electron oxidation. In glycolysis, 4 electrons are removed as NADH, and 
4 more exit as two more NADH in the decarboxylation of two molecules of pyruvate 
to two acetyl-CoA (Figure 19.1). For each acetyl-CoA oxidized in the TCA cycle, 
8 more electrons are removed (as three NADH and one FADH;)): 


H,CCOO- + 2H,O + Ht —>2CO, + 8H 


In the electron-transport pathway these 8 electrons combine with oxygen to form 
water: 


8H + 20, —> 4H,O 
So, the net reaction for the TCA cycle and electron transport pathway is 
H;CCOO™ + 20, + Ht —>2CO, + 2H»O 


As German biochemist Hans Krebs showed in the 1930s, the eight-electron oxi- 
dation of acetate by the TCA cycle is accomplished with the help of oxaloacetate. 
(In his honor, the TCA cycle is often referred to as the Krebs cycle.) Beginning with 
acetate, a series of five reactions produces two molecules of COs, with four electrons 
extracted in the form of NADH and four electrons passed to oxaloacetate to pro- 
duce a molecule of succinate. The pathway becomes a cycle by three additional re- 
actions that accomplish a four-electron oxidation of succinate back to oxaloacetate. 
This special trio of reactions is used repeatedly in metabolism: first, oxidation of a single bond 
to a double bond, then addition of the elements of water across the double bond, and finally 
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A time-lapse photograph of a ferris wheel at 
night. Aerobic cells use a metabolic wheel—the 
tricarboxylic acid cycle—to generate energy by 
acetyl-CoA oxidation. 





Thus times do shift, each thing his turn does 
hold; 
New things succeed, as former things grow old. 


Robert Herrick 
Hesperides (1648), “Ceremonies for Christmas Eve” 





KEY QUESTIONS 


19.1 What Is the Chemical Logic of the TCA 
Cycle? 





19.2 ow ls Pyruvate Oxidatively Decarboxylated 
o Acetyl-CoA? 


19.3 ow Are Two CO; Molecules Produced 
from Acetyl-CoA? 














19.4 ow Is Oxaloacetate Regenerated to 
Complete the TCA Cycle? 


19.5 What Are the Energetic Consequences 
of the TCA Cycle? 

19.6 Can the TCA Cycle Provide Intermediates for 
Biosynthesis? 

19.7 What Are the Anaplerotic, or “Filling Up,” 
Reactions? 

19.8 How Is the TCA Cycle Regulated? 


19.9 Can Any Organisms Use Acetate as Their 
Sole Carbon Source? 
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(a) Glycolysis 











(b) 








N FIGURE 19.1 (a) Pyruvate produced in glycolysis is oxidized in (b) the tricar- 
boxylic acid (TCA) cycle. (c) Electrons liberated in this oxidation flow through the 
electron-transport chain and drive the synthesis of ATP in oxidative phosphory- 
lation. In eukaryotic cells, this overall process occurs in mitochondria. 
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oxidation of the resulting alcohol to a carbonyl. We will see it again in fatty acid oxidation 
(see Chapter 23), in reverse in fatty acid synthesis (see Chapter 24), and in amino 
acid synthesis and breakdown (see Chapter 25). 


AOs What Is the Chemical Logic of the TCA Cycle? 


The entry of new carbon units into the cycle is through acetyl-CoA. This entry 
metabolite can be formed either from pyruvate (from glycolysis) or from oxidation 
of fatty acids (discussed in Chapter 23). Transfer of the two-carbon acetyl group 
from acetyl-CoA to the four-carbon oxaloacetate to yield six-carbon citrate is cat- 
alyzed by citrate synthase. A dehydration—rehydration rearrangement of citrate 
yields isocitrate. Two successive decarboxylations produce a-ketoglutarate and then 
succinyl-CoA, a CoA conjugate of a four-carbon unit. Several steps later, oxaloac- 
etate is regenerated and can combine with another two-carbon unit of acetyl-CoA. 
Thus, carbon enters the cycle as acetyl-CoA and exits as COs. In the process, meta- 
bolic energy is captured in the form of ATP, NADH, and enzyme-bound FADH» 
(symbolized as [FADH»]). 


The TCA Cycle Provides a Chemically Feasible Way of Cleaving 
a Two-Carbon Compound 


The cycle shown in Figure 19.2 at first appears to be a complicated way to oxidize 
acetate units to COs, but there is a chemical basis for the apparent complexity. 
Oxidation of an acetyl group to a pair of CO» molecules requires C—C cleavage: 


CH3sCOO —> CO, + CO2 
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In many instances, C—C cleavage reactions in biological systems occur between car- 
bon atoms « and £ to a carbonyl group: 


1 





C—C, —Cs 
t 


Cleavage 


A good example of such a cleavage is the fructose bisphosphate aldolase reaction 
(see Figure 18.12). 
Another common type of C—C cleavage is a-cleavage of an a-hydroxyketone: 


Cleavage 


(We see this type of cleavage in the transketolase reaction described in Chapter 22.) 

Neither of these cleavage strategies is suitable for acetate. It has no B-carbon, and 
the second method would require hydroxylation—not a favorable reaction for ac- 
etate. Instead, living things have evolved the clever chemistry of condensing acetate 
with oxaloacetate and then carrying out a B-cleavage. The TCA cycle combines this 
-cleavage reaction with oxidation to form COs, regenerate oxaloacetate, and cap- 
ture the liberated metabolic energy in NADH and ATP. 


19:25 How Is Pyruvate Oxidatively Decarboxylated 
to Acetyl-CoA? 


Pyruvate produced by glycolysis is a significant source of acetyl-CoA for the TCA cy- 
cle. Because, in eukaryotic cells, glycolysis occurs in the cytoplasm, whereas the TCA 
cycle reactions and all subsequent steps of aerobic metabolism take place in the mi- 
tochondria, pyruvate must first enter the mitochondria to enter the TCA cycle. The 
oxidative decarboxylation of pyruvate to acetyl-CoA 


Pyruvate + CoA + NAD*——> acetyl-CoA + CO, + NADH 


is the connecting link between glycolysis and the TCA cycle. The reaction is cat- 
alyzed by pyruvate dehydrogenase, a multienzyme complex. 

The pyruvate dehydrogenase complex (PDC) is formed from multiple copies 
of three enzymes: pyruvate dehydrogenase (E1), dihydrolipoamide acetyltransferase (E2), 
and dihydrolipoamide dehydrogenase (E3). All are involved in the conversion of pyru- 
vate to acetyl-CoA. The active sites of all three enzymes are not far removed from 
one another, and the product of the first enzyme is passed directly to the second 
enzyme, and so on, without diffusion of substrates and products through the 
solution. 

Eukaryotic PDC, one of the largest-known multienzyme complexes (with a di- 
ameter of approximately 500 A) is a 9.5-megadalton assembly organized around an 
icosahedral 60-mer of E2 subunits, with 30 El heterotetramers and 12 homodimers 
of E3 (Figure 19.3). Eukaryotic PDC also contains an E3-binding protein (E3BP) 
that is required to bind E3 to the PDC. Trimeric units of E2 form the 20 vertices of 
the icosahedron, with E3BP bound in each of the 12 faces. The E2 subunits each 
carry a lipoic acid moiety covalently linked to a lysine residue. Flexible linker seg- 
ments in E2 and E3BP impart the flexibility that allows the lipoic acid groups to visit 
all three active sites during catalysis. 

The pyruvate dehydrogenase reaction (Figure 19.4) is a tour de force of mecha- 
nistic chemistry, involving as it does a total of three enzymes and five different coen- 
zymes. The first step of this reaction, decarboxylation of pyruvate and transfer of the 
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FIGURE 19.3 icosahedral model of PDC core structure 
(E3 not shown). E1 subunits (yellow) are joined to the E2 
core (green) by linkers (blue). (Adapted from Zhou, Z.H., 
McCarthy, D. B., O'Connor, C. M, Reed, L. J., and Stoops, J. K., 2001. 
The remarkable structural and functional organization of the 
eukaryotic pyruvate dehydrogenase complexes. Proc Natl Acad 





Sci U S A 98:14802-14807. Figure courtesy of Z. Hong Zhou.) 
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FIGURE 19.4 The reaction mechanism of the pyruvate dehydrogenase complex. Decarboxylation of pyruvate 
occurs with formation of hydroxyethyl-TPP (step 1).Transfer of the two-carbon unit to lipoic acid in step 2 is fol- 
lowed by formation of acetyl-CoA in step 3. Lipoic acid is reoxidized in step 4 of the reaction. 


acetyl group to lipoic acid, depends on accumulation of negative charge on the 
transferred two-carbon fragment, as facilitated by the quaternary nitrogen on the 
thiazolium group of thiamine pyrophosphate (TPP). As shown in Figure 19.5, this 
cationic imine nitrogen plays two distinct roles in TPP-catalyzed reactions: 


1. It provides electrostatic stabilization of the thiazole carbanion formed upon re- 
moval of the C-2 proton. (The sf? hybridization and the availability of vacant 
d orbitals on the adjacent sulfur probably also facilitate proton removal at C-2.) 

2. TPP attack on pyruvate leads to decarboxylation. The TPP cationic imine nitro- 
gen can act as an effective electron sink to stabilize the negative charge that must 
develop on the carbon that has been attacked. This stabilization takes place by 
resonance interaction through the double bond to the nitrogen atom. 


This resonance-stabilized intermediate can be protonated to give hydroxyethyl- 
TPP. The reaction of hydroxyethyl-TPP with the oxidized form of lipoic acid yields 
the energy-rich acetyl-thiol ester of reduced lipoic acid through oxidation of the 
hydroxyl-carbon of the two-carbon substrate unit. Nucleophilic attack by CoA on 
the carbonyl carbon (a characteristic feature of CoA chemistry) results in transfer of 
the acetyl group from lipoic acid to CoA. The subsequent oxidation of lipoic acid is 
catalyzed by the FAD-dependent dihydrolipoyl dehydrogenase, and NAD* is reduced. 
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A DEEPER LOOK 
The Coenzymes of the Pyruvate Dehydrogenase Complex 


Coenzymes are small molecules that bring unique chemistry to en- 
zyme reactions. Five coenzymes are used in the pyruvate dehydro- 
genase reaction. 


Thiamine Pyrophosphate 
TPP assists in the decarboxylation of a-keto acids (here) and in 


the formation and cleavage of a-hydroxy ketones (as in the trans- 
ketolase reaction, see Chapter 22). 
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(a) The Flavin Coenzymes—FAD/FADH, 

Flavin coenzymes can exist in any of three redox states, and 
this allows flavin coenzymes to participate in one-electron 
transfer and two-electron transfer reactions. Partly because 
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A DEEPER LOOK 


The Coenzymes of the Pyruvate Dehydrogenase Complex (cont’d) 
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19:38 How Are Two CO, Molecules Produced from Acetyl-CoA? 
The Citrate Synthase Reaction Initiates the TCA Cycle 


The first reaction within the TCA cycle, the one by which carbon atoms are intro- 
duced, is the citrate synthase reaction (Figure 19.6). Here acetyl-CoA reacts with ox- 
aloacetate in a Perkin condensation (a carbon-carbon condensation between a ketone 
or aldehyde and an ester). The acyl group is activated in two ways in an acyl-CoA mol- 
ecule: The carbonyl carbon is activated for attack by nucleophiles, and the C, carbon 
is more acidic and can be deprotonated to form a carbanion. The citrate synthase re- 
action depends upon the latter mode of activation. As shown in Figure 19.6, a general 
base on the enzyme accepts a proton from the methyl group of acetyl-CoA, producing 
a stabilized a-carbanion of acetyl-CoA. This strong nucleophile attacks the a-carbonyl 
of oxaloacetate, yielding citryl-CoA. This part of the reaction has an equilibrium 
constant near 1, but the overall reaction is driven to completion by the subsequent 





Citryl-CoA 


Citrate 


FIGURE 19.5 The mechanistic details of the first three 
steps of the pyruvate dehydrogenase complex reaction. 
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Oxaloacetate 


FIGURE 19.6 Citrate is formed in the citrate synthase reaction from oxaloacetate and 
acetyl-CoA. The mechanism involves nucleophilic attack by the carbanion of acetyl-CoA on 
the carbonyl carbon of oxaloacetate, followed by thioester hydrolysis. 
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TABLE 19.1 

Reaction Enzyme (kJ/mol) (kJ/mol) 
1. Acetyl-CoA + oxaloacetate + HO == CoASH + citrate Citrate synthase —31.4 —53.9 
2. Citrate === isocitrate Aconitase +6.7 +0.8 
3. Isocitrate + NAD+ == a-ketoglutarate + NADH + CO, Isocitrate dehydrogenase —8.4 -17.5 
4. a-Ketoglutarate + CoASH + NAD* == succinyl-CoA + NADH + COs a-Ketoglutarate —30 —43.9 

dehydrogenase complex 

5. Succinyl-CoA + GDP + P; = = succinate + GTP + CoASH Succinyl-CoA synthetase —3.3 =0 
6. Succinate + [FAD] == fumarate + [FADH»] Succinate dehydrogenase +0.4 #0 
7. Fumarate + HO == L-malate Fumarase —3.8 =0 
8. L-Malate + NAD* == oxaloacetate + NADH + H* Malate dehydrogenase +29.7 =0 








AG values from Newsholme, E. A., and Leech, A. R., 1983. Biochemistry for the Medical Sciences. New York: Wiley. 


hydrolysis of the high-energy thioester to citrate and free CoA. The overall A G°’ is 
—31.4 kJ/mol, and under standard conditions the reaction is essentially irreversible. 
Although the mitochondrial concentration of oxaloacetate is very low (much less than 
1 w~M—see example in Section 19.4), the strong, negative AG?’ drives the reaction 
forward. 


Citrate Synthase Is a Dimer Citrate synthase in mammals is a dimer of 49-kD sub- 
units (Table 19.1). On each subunit, oxaloacetate and acetyl-CoA bind to the active 
site, which lies in a cleft between two domains and is surrounded mainly by a-helical 
segments (Figure 19.7). Binding of oxaloacetate induces a conformational change 
that facilitates the binding of acetyl-CoA and closes the active site so that the reactive 
carbanion of acetyl-CoA is protected from protonation by water. 





NADH Is an Allosteric Inhibitor of Citrate Synthase Citrate synthase is the first 
step in this metabolic pathway, and as stated the reaction has a large negative AG”. 
As might be expected, it is a highly regulated enzyme. NADH, a product of the TCA 
cycle, is an allosteric inhibitor of citrate synthase, as is succinyl-CoA, the product of 
the fifth step in the cycle (and an acetyl-CoA analog). 


CoASH 


Citrate Is Isomerized by Aconitase to Form Isocitrate 


Citrate itself poses a problem: It is a poor candidate for further oxidation because 
it contains a tertiary alcohol, which could be oxidized only by breaking a 
carbon-carbon bond. An obvious solution to this problem is to isomerize the ter- 
tiary alcohol to a secondary alcohol, which the cycle proceeds to do in the next step. 

Citrate is isomerized to isocitrate by aconitase in a two-step process involving 
aconitate as an intermediate (Figure 19.8). In this reaction, the elements of water 
are first abstracted from citrate to yield aconitate, which is then rehydrated with 
H— and HO— adding back in opposite positions to produce isocitrate. The net ef- 
fect is the conversion of a tertiary alcohol (citrate) to a secondary alcohol (iso- 
citrate). Oxidation of the secondary alcohol of isocitrate involves breakage of a 


FIGURE 19.7 Citrate synthase. In the monomer shown C—H bond, a simpler matter than the C—C cleavage required for the direct oxi- 


here, citrate is shown in blue, and CoA is red. (Top: pdb id dation of citrate. 
= 1CTS; bottom: pdb id = 2CTS.) 





Inspection of the citrate structure shows a total of four chemically equivalent hy- 
drogens, but only one of these—the pro-R H atom of the pro-R arm of citrate—is ab- 
stracted by aconitase, which is quite stereospecific. Formation of the double bond of 
aconitate following proton abstraction requires departure of hydroxide ion from the 
C-3 position. Hydroxide is a relatively poor leaving group, and its departure is facili- 
tated in the aconitase reaction by coordination with an iron atom in an iron-sulfur 
cluster. 
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(a) 
pro-Sarm HO HC —COO- 
vine &—coo- 

ae —COO- 


pro-Rarm HO 














Citrate cisAconitate Isocitrate 


Aconitase removes the pro-R H 
of the pro-R arm of citrate 


FIGURE 19.8 (a) The aconitase reaction converts citrate to cis-aconitate and then to isocitrate. Aconitase is 
stereospecific and removes the pro-R hydrogen from the pro-R arm of citrate. (b) The active site of aconitase. 
The iron-sulfur cluster (pink) is coordinated by cysteines (orange) and isocitrate (purple) (pdb id = 1B0)J). 





Aconitase Utilizes an Iron-Sulfur Cluster Aconitase contains an iron-sulfur 
cluster consisting of three iron atoms and four sulfur atoms in a near-cubic 
arrangement (Figure 19.9). Cysteine residues from the enzyme coordinate the 
three iron atoms. In the inactive state of the enzyme, one corner of the cube is va- 
cant. Binding of iron (as Fe?*) to this position activates aconitase. The iron atom 
in this position can coordinate the C-3 carboxyl and hydroxyl groups of citrate. 
This iron atom thus acts as a Lewis acid, accepting an unshared pair of electrons 
from the hydroxyl, making it a better leaving group. The equilibrium for the 
aconitase reaction favors citrate, and an equilibrium mixture typically contains 
about 90% citrate, 4% cis-aconitate, and 6% isocitrate. The A G°’ is +6.7 kJ/mol. 


Fluoroacetate Blocks the TCA Cycle Fluoroacetate is an extremely poisonous 
agent that blocks the TCA cycle in vivo, although it has no apparent effect on any of 
the isolated enzymes. Its LDso, the lethal dose for 50% of animals consuming it, is 
0.2 mg per kilogram of body weight; it has been used as a rodent poison. The action 
of fluoroacetate has been traced to aconitase, which is inhibited in vivo by fluoro- 
citrate, which is formed from fluoroacetate in two steps. 
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Fluoroacetate readily crosses both the cellular and the mitochondrial membranes, 
and in mitochondria it is converted to fluoroacetyl-CoA by acetyl-CoA synthetase. 
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CENGAGENOW” ACTIVE FIGURE 19.9 The iron-sulfur cluster of aconitase. Binding of Fe?* to the vacant position 
of the cluster activates aconitase. The added iron atom coordinates the C-3 carboxyl and hydroxyl groups of citrate 
and acts as a Lewis acid, accepting an electron pair from the hydroxyl group and making it a better leaving group. 
Test yourself on the concepts in this figure at www.cengage.com/login. 
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(a) 


NAD* 





Fluoroacetyl-CoA is a substrate for citrate synthase, which condenses it with oxalo- 
acetate to form fluorocitrate. Fluoroacetate may thus be viewed as a trojan horse 
inhibitor. Analogous to the giant Trojan horse of legend—which the soldiers of Troy 
took into their city, not knowing that Greek soldiers were hidden inside it and wait- 
ing to attack—fluoroacetate enters the TCA cycle innocently enough, in the citrate 
synthase reaction. Citrate synthase converts fluoroacetate to inhibitory fluorocitrate 
for its TCA cycle partner, aconitase, blocking the cycle. 


Isocitrate Dehydrogenase Catalyzes the First Oxidative 
Decarboxylation in the Cycle 


In the next step of the TCA cycle, isocitrate is oxidatively decarboxylated to yield 
a-ketoglutarate, with concomitant reduction of NAD* to NADH in the isocitrate de- 
hydrogenase reaction (Figure 19.10). The reaction has a net AG°’ of —8.4 kJ/mol, 
and it is sufficiently exergonic to pull the aconitase reaction forward. This two-step 
reaction involves (1) oxidation of the C-2 alcohol of isocitrate to form oxalosucci- 
nate, followed by (2) a -decarboxylation reaction that expels the central carboxyl 
group as COs, leaving the product a-ketoglutarate. Oxalosuccinate, the B-keto acid 
produced by the initial dehydrogenation reaction, is unstable and thus is readily 
decarboxylated. 


Isocitrate Dehydrogenase Links the TCA Cycle and Electron Transport Isocit- 
rate dehydrogenase provides the first connection between the TCA cycle and the 
electron-transport pathway and oxidative phosphorylation, via its production of 
NADH. As a connecting point between two metabolic pathways, isocitrate dehy- 
drogenase is a regulated reaction. NADH and ATP are allosteric inhibitors, 
whereas ADP acts as an allosteric activator, lowering the K,,, for isocitrate by a fac- 
tor of 10. The enzyme is virtually inactive in the absence of ADP. Also, the product, 
a-ketoglutarate, is a crucial a-keto acid for aminotransferase reactions (see Chap- 
ters 13 and 25), connecting the TCA cycle (that is, carbon metabolism) with ni- 
trogen metabolism. 


CENGAGENOW” ANIMATED FIGURE 19.10 (a) The isocitrate dehydrogenase reaction. 


recited (b) The active site of isocitrate dehydrogenase. |socitrate is shown in blue, NADP* (as an 
H—=C— COO NAD* analog) is shown in gold, with Ca?* in red (pdb id = 1Al2). See this figure ani- 
S | mated at www.cengage.com/login. 
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a-Ketoglutarate Dehydrogenase Catalyzes the Second Oxidative 
Decarboxylation of the TCA Cycle 


A second oxidative decarboxylation occurs in the a-ketoglutarate dehydrogenase 
reaction. 





NAD*+ CoASH NADH (G63 
a — COO~ Ba — COO 
HC H.C 
| > | 
y m OE] a-Ketoglutarate C — SCoA 
Y dehydrogenase a 
a-Ketoglutarate Succinyl-CoA 


Like the pyruvate dehydrogenase complex, a-ketoglutarate dehydrogenase is a mul- 
tienzyme complex—consisting of a-ketoglutarate dehydrogenase, dihydrolipoyl transsucciny- 
lase, and dihydrolipoyl dehydrogenase—that employs five different coenzymes (Table 
19.2). The dihydrolipoyl dehydrogenase in this reaction is identical to that in the pyru- 
vate dehydrogenase reaction. The mechanism is analogous to that of pyruvate dehy- 
drogenase. As with the pyruvate dehydrogenase reaction, this reaction produces 
NADH and a thioester product—in this case, succinyl-CoA. Succinyl-CoA and NADH 
products are energy-rich species that are important sources of metabolic energy in sub- 
sequent cellular processes. 


19:4» How Is Oxaloacetate Regenerated to Complete 
the TCA Cycle? 


Succinyl-CoA Synthetase Catalyzes Substrate-Level Phosphorylation 


The NADH produced in the foregoing steps can be routed through the electron- 
transport pathway to make high-energy phosphates via oxidative phosphorylation. 
However, succinyl-CoA is itself a high-energy intermediate and is utilized in the next 
step of the TCA cycle to drive the phosphorylation of GDP to GTP (in mammals) or 
ADP to ATP (in plants and bacteria). 


GDP +P, GTP + CoASH 
H,C — COO- i So 


| 
HəC HC — COOT 
l > | 





/ C— SCoA Succinyl-CoA HC — COO™ 
o synthetase 
Succinyl-CoA Succinate 


The reaction is catalyzed by succinyl-CoA synthetase, sometimes called succinate 
thiokinase. The free energies of hydrolysis of succinyl-CoA and GTP or ATP are 


Number of 
Enzyme Number of Subunit Subunits 
Enzyme Coenzyme M, Subunits M, per Complex 
a-Ketoglutarate dehydrogenase Thiamine pyrophosphate 192,000 2 96,000 24 
Dihydrolipoyl transsuccinylase Lipoic acid, CoASH 1,700,000 24 70,000 24 
Dihydrolipoyl dehydrogenase FAD, NAD* 112,000 2 56,000 12 







Condensation: A reaction between two or 
more molecules that results in formation of a 
larger molecule, with elimination of some simpler 
molecule, such as water (as in dehydration 
synthesis). 


Synthase: A condensation reaction that does 
not require a nucleoside triphosphate as an en- 
ergy source. 


Synthetase: A condensation reaction that re- 
quires a nucleoside triphosphate (often ATP) as 
an energy source. 
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CENGAGENOW" ACTIVE FIGURE 19.11 The mechanism 
of the succinyl-CoA synthetase reaction. Test yourself 
on the concepts in this figure at www.cengage.com/ 
login. 


similar, and the net reaction has a AG°’ of —3.3 kJ/mol. Succinyl-CoA synthetase 
provides another example of a substrate-level phosphorylation (see Chapter 18), in 
which a substrate, rather than an electron-transport chain or proton gradient, pro- 
vides the energy for phosphorylation. It is the only such reaction in the TCA cycle. 
The GTP produced by mammals in this reaction can exchange its terminal phos- 
phoryl group with ADP via the nucleoside diphosphate kinase reaction: 


Nucleoside diphosphate 
kinase 

GTP + ADP ATP + GDP 
The Mechanism of Succinyl-CoA Synthetase Involves a Phosphohistidine The 
mechanism of succinyl-CoA synthetase is postulated to involve displacement of CoA 
by phosphate, forming succinyl phosphate at the active site, followed by transfer of 
the phosphoryl group to an active-site histidine (making a phosphohistidine inter- 
mediate) and release of succinate. The phosphoryl moiety is then transferred to 
GDP to form GTP (Figure 19.11). This sequence of steps “preserves” the energy of 
the thioester bond of succinyl-CoA in a series of high-energy intermediates that lead 
to a molecule of ATP: 


Thioester—— [succinyl-P] —> [phosphohistidine] —> GTP—— ATP 


The First Five Steps of the TCA Cycle Produce NADH, CO,, GTP (ATP), and Succi- 
nate This is a good point to pause in our trip through the TCA cycle and see what 
has happened. A two-carbon acetyl group has been introduced as acetyl-CoA and 
linked to oxaloacetate, and two CO» molecules have been liberated. The cycle has 
produced two molecules of NADH and one of GTP or ATP and has left a molecule 
of succinate. 

The TCA cycle can now be completed by converting succinate to oxaloacetate. 
This latter process represents a net oxidation. The TCA cycle breaks it down into 
(consecutively) an oxidation step, a hydration reaction, and a second oxidation 
step. The oxidation steps are accompanied by the reduction of an [FAD] and an 
NAD*. The reduced coenzymes, [FADH2] and NADH, subsequently provide re- 
ducing power in the electron-transport chain. (It will be seen in Chapter 23 that vir- 
tually the same chemical strategy is used in B-oxidation of fatty acids.) 


Succinate Dehydrogenase Is FAD-Dependent 


The oxidation of succinate to fumarate is carried out by succinate dehydrogenase, 
a membrane-bound enzyme that is actually part of the electron-transport chain. 


coo- FAD FADH, 
| Ho 7 COO 
CHo C 
Succinate | | 
CH» dehydrogenase ` 
| ere N 
COO OOC H 
Succinate FAD FADH, Fumarate 


As will be seen in Chapter 20, succinate dehydrogenase is identical with the suc- 
cinate-coenzyme Q reductase of the electron-transport chain. In contrast with all 
of the other enzymes of the TCA cycle, which are soluble proteins found in the 
mitochondrial matrix, succinate dehydrogenase is an integral membrane protein 
tightly associated with the inner mitochondrial membrane. Succinate oxidation 
involves removal of H atoms across a C—C bond, rather than a C—O or C—N 
bond, and produces the trans-unsaturated fumarate. This reaction (the oxidation 
of an alkane to an alkene) is not sufficiently exergonic to reduce NAD*, but it does 
yield enough energy to reduce [FAD]. (By contrast, oxidations of alcohols to ke- 
tones or aldehydes are more energetically favorable and provide sufficient energy 
to reduce NAD+.) 


19.4 How Is Oxaloacetate Regenerated to Complete the TCA Cycle? 


Succinate dehydrogenase is a dimeric protein, with subunits of molecular masses 
70 and 27 kD. FAD is covalently bound to the larger subunit; the bond involves a 
methylene group of C-8a of FAD and N-3 of a histidine on the protein (Figure 
19.12). Succinate dehydrogenase also contains three different iron-sulfur clusters: 
a 4Fe-4S cluster, a 3Fe4S cluster, and a 2Fe-2S cluster, shown below. 


Cys _LYys 


ee a “oe 
cys e aT 
Viewed from either end of the succinate molecule, the reaction involves dehydro- 
genation q,ß to a carbonyl (actually, a carboxyl) group. The dehydrogenation is 
stereospecific, with the pro-S hydrogen removed from one carbon atom and the 
pro-R hydrogen removed from the other. The electrons captured by [FAD] in this 
reaction are passed directly into the iron-sulfur clusters of the enzyme and on to 
coenzyme Q (UQ). The covalently bound FAD is first reduced to [FADHg] and then 
reoxidized to form [FAD] and the reduced form of coenzyme Q, UQHs. Electrons 
captured by UQHs then flow through the rest of the electron-transport chain in a 
series of events that will be discussed in detail in Chapter 20. 
Note that flavin coenzymes can carry out either one-electron or two-electron 


transfers. The succinate dehydrogenase reaction represents a net two-electron re- 
duction of FAD. 


Fumarase Catalyzes the Trans-Hydration of Fumarate to Form L-Malate 


Fumarate is hydrated in a stereospecific reaction by fumarase to give L-malate. 





COO OH 

As 7 Hee | 
C H—C— COoo—- 
| | 
C Fumarase HC — COO™ 

= SN 
OOC H 
Fumarate L-Malate 


The reaction involves trans-addition of the elements of water across the double 
bond. Recall that aconitase carries out a similar reaction and that trans-addition of 
—H and —OH occurs across the double bond of cis-aconitate. Although the exact 
mechanism is uncertain, it may involve protonation of the double bond to form an 
intermediate carbonium ion (Figure 19.13) or possibly attack by water or OH™ an- 
ion to produce a carbanion, followed by protonation. 


Carbonium ion mechanism 
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Histidine FAD 

FIGURE 19.12 The covalent bond between FAD and 
succinate dehydrogenase involves the C-8a methylene 
group of FAD and the N-3 of a histidine residue on the 
enzyme. 


FIGURE 19.13 Two possible mechanisms for the 
fumarase reaction. 
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FIGURE 19.14 The active site of malate dehydrogenase. 
Malate is shown in yellow; NAD* is pink (pdb id = 
1EMD). 


CENGAGENOW” Go to CengageNOW and click 
Cengagelnteractive to understand the structure and 
function of malate dehydrogenase. 


Malate Dehydrogenase Completes the Cycle by Oxidizing 
Malate to Oxaloacetate 


In the last step of the TCA cycle, L-malate is oxidized to oxaloacetate by malate 
dehydrogenase. This reaction is very endergonic, with a A G°’ of +30 kJ/mol. 


NADH 
NAD+ + Ht 
i. K 
N 
H—C—coo- Malate C—COO- 
| dehydrogenase | 
HC — COO- H C—COO7 
NAD* NADH 
L-Malate + Ht Oxaloacetate 


Consequently, the concentration of oxaloacetate in the mitochondrial matrix is 
usually quite low. The reaction, however, is pulled forward by the favorable citrate 
synthase reaction. Oxidation of malate is coupled to reduction of yet another mol- 
ecule of NAD*, the third one of the cycle. Counting the [FAD] reduced by succi- 
nate dehydrogenase, this makes the fourth coenzyme reduced through oxidation 
of a single acetate unit. 

Malate dehydrogenase is structurally and functionally similar to other dehydro- 
genases, notably lactate dehydrogenase (Figure 19.14). Both consist of alternating 
B-sheet and a-helical segments. Binding of NAD* causes a conformational change in 
the 20-residue segment that connects the D and E strands of the 6-sheet. The change 
is triggered by an interaction between the adenosine phosphate moiety of NAD* and 
an arginine residue in this loop region. Such a conformational change is consistent 
with an ordered single-displacement mechanism for NAD*-dependent dehydroge- 
nases (see Chapter 13). 


19:5» What Are the Energetic Consequences of the TCA Cycle? 


The net reaction accomplished by the TCA cycle, as follows, shows two molecules of 
COs, one ATP, and four reduced coenzymes produced per acetate group oxidized. 
The cycle is exergonic, with a net AG°’ for one pass around the cycle of approxi- 
mately —40 kJ/mol. Table 19.1 compares the AG” values for the individual reac- 
tions with the overall A G°’ for the net reaction. 


Acetyl-CoA + 3 NAD+ + [FAD] + ADP + P; + 2 H¥O—> 
2 CO, + 3 NADH + 3 H* + [FADH.] + ATP + CoASH 
AG” = —40 kJ/mol 





Glucose metabolized via glycolysis produces two molecules of pyruvate and thus two 
molecules of acetyl-CoA, which can enter the TCA cycle. Combining glycolysis and 
the TCA cycle gives the net reaction shown: 


Glucose + 2 HO + 10 NAD* + 2 [FAD] + 4 ADP + 4 P;—> 
6 CO; + 10 NADH + 10 H* + 2 [FADH] + 4 ATP 


All six carbons of glucose are liberated as COs, and a total of four molecules of ATP 
are formed thus far in substrate-level phosphorylations. The 12 reduced coen- 
zymes produced up to this point can eventually produce as many as 34 molecules 
of ATP in the electron-transport and oxidative phosphorylation pathways. A stoi- 
chiometric relationship for these subsequent processes would be 


NADH + H+ + 40, + 3 ADP + 3 P;>NAD* + 3 ATP + 4 HO 
[FADH.] + +O. + 2 ADP + 2 P;—> [FAD] + 2 ATP + 3H.O 


Thus, a total of 3 ATP per NADH and 2 ATP per FADH» may be produced through 
the processes of electron-transport and oxidative phosphorylation. 


19.5 What Are the Energetic Consequences of the TCA Cycle? 
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A DEEPER LOOK 
Steric Preferences in NAD*-Dependent Dehydrogenases 
The enzymes that require nicotinamide coenzymes are stereospe- genase, the citric acid cycle enzyme, transfers hydride to the Hp 
cific and transfer hydride to either the pro-R or the pro-S positions position of NADH, but glyceraldehyde-3-P dehydrogenase in the 
selectively. glycolytic pathway transfers hydride to the Hs position, as shown in 
What accounts for this stereospecificity? It arises from the fact the accompanying figure. Dehydrogenases are stereospecific with 
that the enzymes (and especially the active sites of enzymes) arein- respect to the substrates as well. Note that alcohol dehydrogenase 
herently asymmetric structures. The nicotinamide coenzyme (and removes hydrogen from the pro-R position of ethanol and trans- 
the substrate) fit the active site in only one way. Malate dehydro- fers it to the pro-R position of NADH. 
i 
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The Carbon Atoms of Acetyl-CoA Have Different Fates in the TCA Cycle 


It is instructive to consider how the carbon atoms of a given acetate group are 
routed through several turns of the TCA cycle. As shown in Figure 19.15, neither 
of the carbon atoms of a labeled acetate unit is lost as CO» in the first turn of the 
cycle. The CO% evolved in any turn of the cycle derives from the carboxyl groups 
of the oxaloacetate acceptor (from the previous turn), not from incoming acetyl- 
CoA. On the other hand, succinate labeled on one end from the original labeled 
acetate forms two different labeled oxaloacetates. The carbonyl carbon of acetyl- 
CoA is evenly distributed between the two carboxyl carbons of oxaloacetate, and 
the labeled methyl carbon of incoming acetyl-CoA ends up evenly distributed be- 
tween the methylene and carbonyl carbons of oxaloacetate. 

When these labeled oxaloacetates enter a second turn of the cycle, both of the 
carboxyl carbons are lost as COs, but the methylene and carbonyl carbons survive 
through the second turn. Thus, the methyl carbon of a labeled acetyl-CoA survives 
two full turns of the cycle. In the third turn of the cycle, one-half of the carbon 
from the original methyl group of acetyl-CoA has become one of the carboxyl car- 
bons of oxaloacetate and is thus lost as COs. In the fourth turn of the cycle, further 
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CENGAGENOW” ACTIVE FIGURE 19.15 The fate of the carbon atoms of acetate in successive TCA cycles. 
Assume at the start, labeled acetate is added to cells containing unlabeled metabolites. (a) The carbonyl car- 
bon of acetyl-CoA is fully retained through one turn of the cycle but is lost completely in a second turn of 
the cycle. (b) The methyl carbon of a labeled acetyl-CoA survives two full turns of the cycle but becomes 
equally distributed among the four carbons of oxaloacetate by the end of the second turn. In each subse- 
quent turn of the cycle, one-half of this carbon (the original labeled methyl group) is lost. Test yourself on 
the concepts in this figure at www.cengage.com/login. 


19.6 Can the TCA Cycle Provide Intermediates for Biosynthesis? 581 


“scrambling” results in loss of half of the remaining labeled carbon (one-fourth of 
the original methyl carbon label of acetyl-CoA), and so on. 

It can be seen that the carbonyl and methyl carbons of labeled acetyl-CoA have 
very different fates in the TCA cycle. The carbonyl carbon survives the first turn in- 
tact but is completely lost in the second turn. The methyl carbon survives two full 
turns, then undergoes a 50% loss through each succeeding turn of the cycle. 

It is worth noting that the carbon-carbon bond cleaved in the TCA pathway en- 
tered as an acetate unit in the previous turn of the cycle. Thus, the oxidative de- 
carboxylations that cleave this bond are just a cleverly disguised acetate C—C cleav- 
age and oxidation. 


19:6: Can the TCA Cycle Provide Intermediates 
for Biosynthesis? 


Until now we have viewed the TCA cycle as a catabolic process because it oxidizes ac- 
etate units to CO, and converts the liberated energy to ATP and reduced coenzymes. 
The TCA cycle is, after all, the end point for breakdown of food materials, at least in 
terms of carbon turnover. However, as shown in Figure 19.16, four-, five-, and six- 
carbon species produced in the TCA cycle also fuel a variety of biosynthetic 
processes. a-Ketoglutarate, succinyl-CoA, fumarate, and oxaloacetate are all precur- 
sors of important cellular species. (In order to participate in eukaryotic biosynthetic 
processes, however, they must first be transported out of the mitochondria.) A 
transamination reaction converts a-ketoglutarate directly to glutamate, which can 
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FIGURE 19.16 The TCA cycle provides intermediates for 
numerous biosynthetic processes in the cell. Amino 
acids are highlighted in orange. 
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HUMAN BIOCHEMISTRY 


Mitochondrial Diseases Are Rare 


Diseases arising from defects in mitochondrial enzymes are quite 
rare, because major defects in the TCA cycle (and the respiratory 
chain) are incompatible with life and affected embryos rarely sur- 
vive to birth. Even so, about 150 different hereditary mitochon- 
drial diseases have been reported. Even though mitochondria 
carry their own DNA, many of the reported diseases map to the 
nuclear genome, because most of the mitochondrial proteins are 


An interesting disease linked to mitochondrial DNA mutations is 
that of Leber’s hereditary optic neuropathy (LHON), in which the 
genetic defects are located primarily in the mitochondrial DNA cod- 
ing for the subunits of NADH-—CoQ reductase, also known as Com- 
plex I of the electron-transport chain (see Chapter 20). Leber’s dis- 
ease is the most common form of blindness in otherwise healthy 
young men and occurs less often in women. 


imported from the cytosol. 











then serve as a versatile precursor for proline, arginine, and glutamine (as described 
in Chapter 25). Succinyl-CoA provides most of the carbon atoms of the porphyrins. 
Oxaloacetate can be transaminated to produce aspartate. Aspartic acid itself is a 
precursor of the pyrimidine nucleotides and, in addition, is a key precursor for the 
synthesis of asparagine, methionine, lysine, threonine, and isoleucine. Oxaloacetate 
can also be decarboxylated to yield PEP, which is a key element of several pathways, 
namely (1) synthesis (in plants and microorganisms) of the aromatic amino acids 
phenylalanine, tyrosine, and tryptophan; (2) formation of 3-phosphoglycerate and 
conversion to the amino acids serine, glycine, and cysteine; and (3) gluconeogene- 
sis, which, as we will see in Chapter 22, is the pathway that synthesizes new glucose 
and many other carbohydrates. 

Finally, citrate can be exported from the mitochondria and then broken down by 
ATP-citrate lyase to yield oxaloacetate and acetyl-CoA, a precursor of fatty acids. Oxalo- 
acetate produced in this reaction is rapidly reduced to malate, which can then be 
processed in either of two ways: It may be transported into mitochondria, where it is re- 
oxidized to oxaloacetate, or it may be oxidatively decarboxylated to pyruvate by malic 
enzyme, with subsequent mitochondrial uptake of pyruvate. This cycle permits citrate 
to provide acetyl-CoA for biosynthetic processes, with return of the malate and pyruvate 
by-products to the mitochondria. 


197s What Are the Anaplerotic, or “Filling Up,” Reactions? 


In a sort of reciprocal arrangement, the cell also feeds many intermediates back into 
the TCA cycle from other reactions. Because such reactions replenish the TCA cycle 
intermediates, Hans Kornberg proposed that they be called anaplerotic reactions (lit- 
erally, the “filling up” reactions). Thus, PEP carboxylase and pyruvate carboxylase 
synthesize oxaloacetate from pyruvate (Figure 19.17). 

Pyruvate carboxylase is the most important of the anaplerotic reactions. It ex- 
ists in the mitochondria of animal cells but not in plants, and it provides a direct 
link between glycolysis and the TCA cycle. The enzyme is tetrameric and contains 
covalently bound biotin and an Mg?* site on each subunit. (It is examined in 
greater detail in our discussion of gluconeogenesis in Chapter 22.) Pyruvate car- 
boxylase has an absolute allosteric requirement for acetyl-CoA. Thus, when acetyl- 
CoA levels exceed the oxaloacetate supply, allosteric activation of pyruvate car- 
boxylase by acetyl-CoA raises oxaloacetate levels, so the excess acetyl-CoA can 
enter the TCA cycle. 

PEP carboxylase occurs in yeast, bacteria, and higher plants, but not in animals. 
The enzyme is specifically inhibited by aspartate, which is produced by transami- 
nation of oxaloacetate. Thus, organisms utilizing this enzyme control aspartate 
production by regulation of PEP carboxylase. Malic enzyme is found in the cytosol 
or mitochondria of many animal and plant cells and is an NADPH-dependent 
enzyme. 
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FIGURE 19.17 Pyruvate carboxylase, phosphoenol- 
pyruvate (PEP) carboxylase, and malic enzyme catalyze 
anaplerotic reactions, replenishing TCA cycle inter- 
mediates. 





Fool’s Gold and the Reductive Citric Acid Cycle—The First Metabolic Pathway? 


How did life arise on the planet Earth? It was once supposed that a 
reducing atmosphere, together with random synthesis of organic 
compounds, gave rise to a prebiotic “soup,” in which the first living 
things appeared. However, certain key compounds, such as arginine, 
lysine, and histidine; the straight-chain fatty acids; porphyrins; and 
essential coenzymes, have not been convincingly synthesized under 
simulated prebiotic conditions. This and other problems have led re- 
searchers to consider other models for the evolution of life. 

One of these alternative models, postulated by Günter 
Wachtershauser, involves an archaic version of the TCA cycle 
running in the reverse (reductive) direction. Reversal of the TCA 
cycle results in assimilation of CO, and fixation of carbon as 
shown. For each turn of the reversed cycle, two carbons are fixed 
in the formation of isocitrate and two more are fixed in the re- 
ductive transformation of acetyl-CoA to oxaloacetate. Thus, for 
every succinate that enters the reversed cycle, two succinates are 
returned, making the cycle highly autocatalytic. Because TCA 
cycle intermediates are involved in many biosynthetic pathways 
(see Section 19.6), a reversed TCA cycle would be a bountiful 
and broad source of metabolic substrates. 

A reversed, reductive TCA cycle would require energy input to 
drive it. What might have been the thermodynamic driving force 
for such a cycle? Wachtershauser hypothesizes that the anaerobic 
reaction of FeS and HoS to form insoluble FeS (pyrite, also known 
as fool’s gold) in the prebiotic milieu could have been the driving 
reaction: 


FeS + HoS—> FeS, (pyrite)) + He 


This reaction is highly exergonic, with a standard-state free 
energy change (AG°’) of —38 kJ/mol. Under the conditions 
that might have existed in a prebiotic world, this reaction would 
have been sufficiently exergonic to drive the reductive steps of a 
reversed TCA cycle. 


Wachtershauser has also suggested that early metabolic processes 
first occurred on the surface of pyrite and other related mineral ma- 
terials. The iron-sulfur chemistry that prevailed on these mineral 
surfaces may have influenced the evolution of the iron-sulfur pro- 
teins that control and catalyze many reactions in modern pathways 
(including the succinate dehydrogenase and aconitase reactions of 
the TCA cycle). This reductive citric acid cycle has been shown to oc- 
cur in certain extant archaea and bacteria, where it serves all their 
carbon needs. 
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It is worth noting that the reaction catalyzed by PEP carboxykinase could also 
function as an anaplerotic reaction, were it not for the particular properties of 
the enzyme. 


2-0;PO GDP GTP o 
[eon $ e P E C—COO- 
~ 7 | 
CHy Hc — G68 
PEP Oxaloacetate 


CO» binds weakly to PEP carboxykinase, whereas oxaloacetate binds very tightly 
(Kp = 2 X 107° M), and, as a result, the enzyme favors formation of PEP from 
oxaloacetate. 

The catabolism of amino acids provides pyruvate, acetyl-CoA, oxaloacetate, 
fumarate, a-ketoglutarate, and succinate, all of which may be oxidized by the TCA 
cycle. In this way, proteins may serve as excellent sources of nutrient energy, as 
seen in Chapter 25. 


19:8» How Is the TCA Cycle Regulated? 


Situated as it is between glycolysis and the electron-transport chain, the TCA cycle must 
be carefully controlled. If the cycle were permitted to run unchecked, large amounts 
of metabolic energy could be wasted in overproduction of reduced coenzymes and 
ATP; conversely, if it ran too slowly, ATP would not be produced rapidly enough to sat- 
isfy the needs of the cell. Also, as just seen, the TCA cycle is an important source of pre- 
cursors for biosynthetic processes and must be able to provide them as needed. 

What are the sites of regulation in the TCA cycle? Based on our experience 
with glycolysis (see Figure 18.22), we might anticipate that some of the reactions 
of the TCA cycle would operate near equilibrium under cellular conditions (with 
AG<0), whereas others—the sites of regulation—would be characterized by 
large negative AG values. Estimates for the values of AG in mitochondria, based 
on mitochondrial concentrations of metabolites, are summarized in Table 19.1. 
Three reactions of the cycle—citrate synthase, isocitrate dehydrogenase, and 
a-ketoglutarate dehydrogenase—operate with large negative AG values under 
mitochondrial conditions and are thus the primary sites of regulation in the cycle. 

The regulatory actions that control the TCA cycle are shown in Figure 19.18. As 
one might expect, the principal regulatory “signals” are the concentrations of acetyl- 
CoA, ATP, NAD*, and NADH, with additional effects provided by several other 
metabolites. The main sites of regulation are pyruvate dehydrogenase, citrate syn- 
thase, isocitrate dehydrogenase, and a-ketoglutarate dehydrogenase. All of these en- 
zymes are inhibited by NADH, so when the cell has produced all the NADH that can 
conveniently be turned into ATP, the cycle shuts down. For similar reasons, ATP is 
an inhibitor of pyruvate dehydrogenase and isocitrate dehydrogenase. The TCA 
cycle is turned on, however, when either the ADP/ATP or NAD*/NADH ratio is 
high, an indication that the cell has run low on ATP or NADH. Regulation of the 
TCA cycle by NADH, NAD*, ATP, and ADP thus reflects the energy status of the cell. 
On the other hand, succinyl-CoA is an intracycle regulator, inhibiting citrate syn- 
thase and a-ketoglutarate dehydrogenase. Acetyl-CoA acts as a signal to the TCA 
cycle that glycolysis or fatty acid breakdown is producing two-carbon units. Acetyl- 
CoA activates pyruvate carboxylase, the anaplerotic reaction that provides oxaloac- 
etate, the acceptor for increased flux of acetyl-CoA into the TCA cycle. 


Pyruvate Dehydrogenase Is Regulated by Phosphorylation/ 
Dephosphorylation 


As we shall see in Chapter 22, most organisms can synthesize sugars such as glucose 
from pyruvate. However, animals cannot synthesize glucose from acetyl-CoA. For 
this reason, the pyruvate dehydrogenase complex, which converts pyruvate to 
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FIGURE 19.18 Regulation of the TCA cycle. 


acetyl-CoA, plays a pivotal role in metabolism. Conversion to acetyl-CoA commits 
nutrient carbon atoms either to oxidation in the TCA cycle or to fatty acid synthe- 
sis (see Chapter 24). Because this choice is so crucial to the organism, pyruvate de- 
hydrogenase is a carefully regulated enzyme. It is subject to product inhibition and 
is further regulated by nucleotides. Finally, activity of pyruvate dehydrogenase is 
regulated by phosphorylation and dephosphorylation of the enzyme complex itself. 

High levels of either product, acetyl-CoA or NADH, allosterically inhibit the 
pyruvate dehydrogenase complex. Acetyl-CoA specifically blocks dihydrolipoyl 
transacetylase, and NADH acts on dihydrolipoyl dehydrogenase. The mammalian 
pyruvate dehydrogenase is also regulated by covalent modifications. As shown in 
Figure 19.19, a Mg**-dependent pyruvate dehydrogenase kinase is associated with 
the enzyme in mammals. This kinase is allosterically activated by NADH and acetyl- 
CoA, and when levels of these metabolites rise in the mitochondrion, they stimulate 
phosphorylation of a serine residue on the pyruvate dehydrogenase subunit, block- 
ing the first step of the pyruvate dehydrogenase reaction, the decarboxylation of 
pyruvate. Inhibition of the dehydrogenase in this manner eventually lowers the lev- 
els of NADH and acetyl-CoA in the matrix of the mitochondrion. Reactivation of 
the enzyme is carried out by pyruvate dehydrogenase phosphatase, a Ca**-activated 
enzyme that binds to the dehydrogenase complex and hydrolyzes the phosphoser- 
ine moiety on the dehydrogenase subunit. At low ratios of NADH to NAD* and low 
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FIGURE 19.19 Regulation of the pyruvate dehydroge- 
nase reaction. 
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acetyl-CoA levels, the phosphatase maintains the dehydrogenase in an activated 
state, but a high level of acetyl-CoA or NADH once again activates the kinase and 
leads to the inhibition of the dehydrogenase. Insulin and Ca** ions activate de- 
phosphorylation, and pyruvate inhibits the phosphorylation reaction. 

Pyruvate dehydrogenase is also sensitive to the energy status of the cell. AMP ac- 
tivates pyruvate dehydrogenase, whereas GTP inhibits it. High levels of AMP are a 
sign that the cell may become energy-poor. Activation of pyruvate dehydrogenase 
under such conditions commits pyruvate to energy production. 


Isocitrate Dehydrogenase Is Strongly Regulated 


The mechanism of regulation of isocitrate dehydrogenase is in some respects the 
reverse of pyruvate dehydrogenase. The mammalian isocitrate dehydrogenase is 
subject only to allosteric activation by ADP and NAD* and to inhibition by ATP and 
NADH. Thus, high NAD*/NADH and ADP/ATP ratios stimulate isocitrate dehy- 
drogenase and TCA cycle activity. 

It may seem surprising that isocitrate dehydrogenase is strongly regulated, be- 
cause it is not an apparent branch point within the TCA cycle. However, the citrate/ 
isocitrate ratio controls the rate of production of cytosolic acetyl-CoA, because acetyl- 
CoA in the cytosol is derived from citrate exported from the mitochondrion. (Break- 
down of cytosolic citrate produces oxaloacetate and acetyl-CoA, which can be used 
in a variety of biosynthetic processes.) Thus, isocitrate dehydrogenase activity in the 
mitochondrion favors catabolic TCA cycle activity over anabolic utilization of acetyl- 
CoA in the cytosol. 

Interestingly, the Escherichia coli isocitrate dehydrogenase is regulated by cova- 
lent modification. Serine residues on each subunit of the dimeric enzyme are 
phosphorylated by a protein kinase, causing inhibition of the isocitrate dehydro- 
genase activity. Activity is restored by the action of a specific phosphatase. When 
TCA cycle and glycolytic intermediates—such as isocitrate, 3-phosphoglycerate, 
pyruvate, PEP, and oxaloacetate—are high, the kinase is inhibited, the phos- 
phatase is activated, and the TCA cycle operates normally. When levels of these 
intermediates fall, the kinase is activated, isocitrate dehydrogenase is inhibited, 
and isocitrate is diverted to the glyoxylate pathway, as explained in the next 
section. 


19.9 Can Any Organisms Use Acetate as Their Sole Carbon Source? 


19:9» Can Any Organisms Use Acetate as Their Sole 
Carbon Source? 


Plants (particularly seedlings, which cannot yet accomplish efficient photosynthe- 
sis), as well as some bacteria and algae, can use acetate as the only source of carbon 
for all the carbon compounds they produce. Although we saw that the TCA cycle 
can supply intermediates for some biosynthetic processes, the cycle gives off 2 CO% 
for every two-carbon acetate group that enters and cannot effect the net synthesis 
of TCA cycle intermediates. Thus, it would not be possible for the cycle to produce 
the massive amounts of biosynthetic intermediates needed for acetate-based growth 
unless alternative reactions were available. In essence, the TCA cycle is geared pri- 
marily to energy production, and it “wastes” carbon units by giving off CO». Modi- 
fication of the cycle to support acetate-based growth would require eliminating the 
CO>,-producing reactions and enhancing the net production of four-carbon units 
(that is, oxaloacetate). Plants and bacteria employ a modification of the TCA cycle 
called the glyoxylate cycle to produce four-carbon dicarboxylic acids (and eventu- 
ally even sugars) from two-carbon acetate units. The glyoxylate cycle bypasses the 
two oxidative decarboxylations of the TCA cycle and instead routes isocitrate 
through the isocitrate lyase and malate synthase reactions (Figure 19.20). Glyoxy- 
late produced by isocitrate lyase reacts with a second molecule of acetyl-CoA to form 
L-malate. The net effect is to conserve carbon units, using two acetyl-CoA molecules 
per cycle to generate oxaloacetate. Some of this is converted to PEP and then to glu- 
cose by pathways discussed in Chapter 22. 
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FIGURE 19.20 The glyoxylate cycle. The first two steps are 
identical to TCA cycle reactions. The third step bypasses 
the CO;-evolving steps of the TCA cycle to produce succi- 
nate and glyoxylate. The malate synthase reaction forms 
malate from glyoxylate and another acetyl-CoA. The re- 
sult is that one turn of the cycle consumes one oxalo- 
acetate and two acetyl-CoA molecules but produces two 
molecules of oxaloacetate. (Succinate produced in the 
isocitrate lyase reaction is converted to oxaloacetate by 
TCA cycle reactions.) The net for this cycle is one oxalo- 
acetate from two acetyl-CoA molecules. 
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FIGURE 19.21 The isocitrate lyase reaction. 
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The Glyoxylate Cycle Operates in Specialized Organelles 


The enzymes of the glyoxylate cycle in plants are contained in glyoxysomes, or- 
ganelles devoted to this cycle. Yeast and algae carry out the glyoxylate cycle in the 
cytoplasm. The enzymes common to both the TCA and glyoxylate pathways exist as 
isozymes, with spatially and functionally distinct enzymes operating independently 
in the two cycles. 


Isocitrate Lyase Short-Circuits the TCA Cycle by Producing 
Glyoxylate and Succinate 


The isocitrate lyase reaction (Figure 19.21) produces succinate, a four-carbon 
product of the cycle, as well as glyoxylate, which can then combine with a second 
molecule of acetyl-CoA. Isocitrate lyase catalyzes an aldol cleavage and is similar to 
the reaction mediated by aldolase in glycolysis. The malate synthase reaction (see 
Figure 19.20), a Claisen condensation of acetyl-CoA with the aldehyde of glyoxylate 
to yield malate, is quite similar to the citrate synthase reaction. Compared with the 
TCA cycle, the glyoxylate cycle (1) contains only five steps (as opposed to eight), 
(2) lacks the CO--liberating reactions, (3) consumes two molecules of acetyl-CoA 
per cycle, and (4) produces four-carbon units (oxaloacetate) as opposed to one- 
carbon units. 


The Glyoxylate Cycle Helps Plants Grow in the Dark 


The existence of the glyoxylate cycle explains how certain seeds grow underground 
(or in the dark), where photosynthesis is impossible. Many seeds (peanuts, soy- 
beans, and castor beans, for example) are rich in lipids, and as we will see in Chap- 
ter 23, most organisms degrade the fatty acids of lipids to acetyl-CoA. Glyoxysomes 
form in seeds as germination begins, and the glyoxylate cycle uses the acetyl-CoA 
produced in fatty acid oxidation to provide large amounts of oxaloacetate and 
other intermediates for carbohydrate synthesis. Once the growing plant begins 
photosynthesis and can fix CO, to produce carbohydrates (see Chapter 21), the 
glyoxysomes disappear. 


Glyoxysomes Must Borrow Three Reactions from Mitochondria 


Glyoxysomes do not contain all the enzymes needed to run the glyoxylate cycle: 
Succinate dehydrogenase, fumarase, and malate dehydrogenase are absent. Conse- 
quently, glyoxysomes must cooperate with mitochondria to run their cycle (Figure 
19.22). Succinate travels from the glyoxysomes to the mitochondria, where it is con- 
verted to oxaloacetate. Transamination to aspartate follows because oxaloacetate can- 
not be transported out of the mitochondria. Aspartate formed in this way then moves 
from the mitochondria back to the glyoxysomes, where a reverse transamination with 
a-ketoglutarate forms oxaloacetate, completing the shuttle. Finally, to balance the 
transaminations, glutamate shuttles from glyoxysomes to mitochondria. 
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FIGURE 19.22 Glyoxysomes lack three of the enzymes 
needed to run the glyoxylate cycle. Succinate dehy- 
drogenase, fumarase, and malate dehydrogenase are all 
“borrowed” from the mitochondria in a shuttle in which 
succinate and glutamate are passed to the mitochondria 
and a-ketoglutarate and aspartate are passed to the 
glyoxysome. 





The glycolytic pathway converts glucose to pyruvate and produces two 
molecules of ATP per glucose—only a small fraction of the potential 
energy available from glucose. In the presence of oxygen, pyruvate 
is oxidized to COs, releasing the rest of the energy available from 
glucose via the TCA cycle. 


19.1 What Is the Chemical Logic of the TCA Cycle? The entry of new 
carbon units into the cycle is through acetyl-CoA. Transfer of the two- 
carbon acetyl group from acetyl-CoA to the four-carbon oxaloacetate 
to yield six-carbon citrate is catalyzed by citrate synthase. A dehydra- 
tion-rehydration rearrangement of citrate yields isocitrate. Two suc- 
cessive decarboxylations produce a-ketoglutarate and then succinyl- 
CoA, a CoA conjugate of a four-carbon unit. Several steps later, 
oxaloacetate is regenerated and can combine with another two-carbon 
unit of acetyl-CoA. 


19.2 How Is Pyruvate Oxidatively Decarboxylated to Acetyl-CoA? 
The pyruvate dehydrogenase complex (PDC) is a noncovalent assembly 
of three different enzymes operating in concert to catalyze successive 
steps in the conversion of pyruvate to acetyl-CoA. 


19.3 How Are Two CO, Molecules Produced from Acetyl-CoA? Citrate 
synthase combines acetyl-CoA with oxaloacetate in a Perkin condensa- 
tion (a carbon-carbon condensation between a ketone or aldehyde and 
an ester). A general base on the enzyme accepts a proton from the 
methyl group of acetyl-CoA, producing a stabilized a-carbanion of acetyl- 
CoA. This strong nucleophile attacks the a-carbonyl of oxaloacetate, 
yielding citryl-CoA, which is hydrolyzed to citrate and CoASH. 

Citrate is isomerized to isocitrate by aconitase in a two-step process 
involving aconitate as an intermediate. The elements of water are first 


abstracted from citrate to yield aconitate, which is then rehydrated with 
H— and HO— adding back in opposite positions to produce isocitrate. 
The net effect is the conversion of a tertiary alcohol (citrate) to a sec- 
ondary alcohol (isocitrate). 

The two-step isocitrate dehydrogenase reaction involves (1) oxida- 
tion of the C-2 alcohol of isocitrate to form oxalosuccinate, followed by 
(2) a B-decarboxylation reaction that expels the central carboxyl group 
as COs, leaving the product a-ketoglutarate. Oxalosuccinate, the B-keto 
acid produced by the initial dehydrogenation reaction, is unstable and 
thus is readily decarboxylated. 

a-Ketoglutarate dehydrogenase is a multienzyme complex—consisting 
of a-ketoglutarate dehydrogenase, dihydrolipoyl transsuccinylase, and 
dihydrolipoyl dehydrogenase—that employs five different coenzymes. The 
dihydrolipoyl dehydrogenase in this reaction is identical to that in the 
pyruvate dehydrogenase reaction. The mechanism is an oxidative decar- 
boxylation analogous to that of pyruvate dehydrogenase. Succinyl-CoA is 
the product. 


19.4 How Is Oxaloacetate Regenerated to Complete the Cycle? 
Succinyl-CoA synthetase catalyzes a substrate-level phosphorylation: 
Succinyl-CoA is a high-energy intermediate and is used to drive the 
phosphorylation of GDP to GTP (in mammals) or ADP to ATP (in 
plants and bacteria). 

Succinate dehydrogenase (succinate-coenzyme Q reductase of the 
electron-transport chain) catalyzes removal of H atoms across a C—C 
bond and produces the trans-unsaturated fumarate. 

Fumarate is hydrated in a stereospecific reaction by fumarase to give 
L-malate. The reaction involves trans-addition of the elements of water 
across the double bond. 
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Malate dehydrogenase completes the TCA cycle. This reaction is very 
endergonic, with a AG” of +30 kJ/mol. Consequently, the concentra- 
tion of oxaloacetate in the mitochondrial matrix is usually quite low. Ox- 
idation of malate to oxaloacetate is coupled to reduction of yet another 
molecule of NAD*, the third one of the cycle. 


19.5 What Are the Energetic Consequences of the TCA Cycle? The 
cycle is exergonic, with a net AG” for one pass around the cycle of 
approximately —40 kJ/mol. Three NADH, one [FADH»], and one ATP 
equivalent are produced in each turn of the cycle. 


19.6 Can the TCA Cycle Provide Intermediates for Biosynthesis? 
a-Ketoglutarate, succinyl-CoA, fumarate, and oxaloacetate are all pre- 
cursors of important cellular species. A transamination reaction con- 
verts a-ketoglutarate directly to glutamate, which can then serve as a 
precursor for proline, arginine, and glutamine. Succinyl-CoA provides 
most of the carbon atoms of the porphyrins. Oxaloacetate can be 
transaminated to produce aspartate. Aspartic acid itself is a precursor of 
the pyrimidine nucleotides and, in addition, is a key precursor for the 
synthesis of asparagine, methionine, lysine, threonine, and isoleucine. 
Oxaloacetate can also be decarboxylated to yield PEP, which is a key 
element of several pathways. 


19.7 What Are the Anaplerotic, or “Filling Up,” Reactions? Ana- 
plerotic reactions replenish the TCA cycle intermediates. Examples in- 


clude PEP carboxylase and pyruvate carboxylase, both of which synthe- 
size oxaloacetate from pyruvate. 


19.8 How Is the TCA Cycle Regulated? The main sites of regulation 
are pyruvate dehydrogenase, citrate synthase, isocitrate dehydrogenase, 
and a-ketoglutarate dehydrogenase. All of these enzymes are inhibited 
by NADH. ATP is an inhibitor of pyruvate dehydrogenase and isocitrate 
dehydrogenase. The TCA cycle is turned on, however, when either the 
ADP/ATP or NAD*/NADH ratio is high. Regulation of the TCA cycle 
by NADH, NAD*, ATP, and ADP thus reflects the energy status of the 
cell. Succinyl-CoA is an intracycle regulator, inhibiting citrate synthase 
and a-ketoglutarate dehydrogenase. Acetyl-CoA activates pyruvate car- 
boxylase, the anaplerotic reaction that provides oxaloacetate, the ac- 
ceptor for acetyl-CoA entry into the TCA cycle. 


19.9 Can Any Organisms Use Acetate as Their Sole Carbon Source? 
Plants and bacteria employ a modification of the TCA cycle called the 
glyoxylate cycle to produce four-carbon dicarboxylic acids (and eventu- 
ally even sugars) from two-carbon acetate units. The glyoxylate cycle by- 
passes the two oxidative decarboxylations of the TCA cycle and instead 
routes isocitrate through the isocitrate lyase and malate synthase reac- 
tions. Glyoxylate produced by isocitrate lyase reacts with a second mol- 
ecule of acetyl-CoA to form L-malate. The net effect is to conserve 
carbon units, using two acetyl-CoA molecules per cycle to generate 
oxaloacetate. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. Describe the labeling pattern that would result from the introduc- 
tion into the TCA cycle of glutamate labeled at C, with MC. 

2. Describe the effect on the TCA cycle of (a) increasing the con- 
centration of NAD*, (b) reducing the concentration of ATP, and 
(c) increasing the concentration of isocitrate. 

3. (Integrates with Chapter 15.) The serine residue of isocitrate dehy- 
drogenase that is phosphorylated by protein kinase lies within the 
active site of the enzyme. This situation contrasts with most other 
examples of covalent modification by protein phosphorylation, 
where the phosphorylation occurs at a site remote from the active 
site. What direct effect do you think such active-site phosphoryla- 
tion might have on the catalytic activity of isocitrate dehydrogenase? 
(See Barford, D., 1991. Molecular mechanisms for the control of 
enzymic activity by protein phosphorylation. Biochimica et Biophysica 
Acta 1133:55-62.) 

4. The first step of the a-ketoglutarate dehydrogenase reaction involves 
decarboxylation of the substrate and leaves a covalent TPP interme- 
diate. Write a reasonable mechanism for this reaction. 

5. In a tissue where the TCA cycle has been inhibited by fluoroacetate, 
what difference in the concentration of each TCA cycle metabolite 
would you expect, compared with a normal, uninhibited tissue? 

6. On the basis of the descriptions of the physical properties of FAD 
and FADH,, suggest a method for the measurement of the enzyme 
activity of succinate dehydrogenase. 

7. Starting with citrate, isocitrate, a-ketoglutarate, and succinate, state 
which of the individual carbons of the molecule undergo oxidation 
in the next step of the TCA cycle. Which molecules undergo a net 
oxidation? 

8. In addition to fluoroacetate, consider whether other analogs of 
TCA cycle metabolites or intermediates might be introduced to in- 
hibit other, specific reactions of the cycle. Explain your reasoning. 

9. Based on the action of thiamine pyrophosphate in catalysis of the 
pyruvate dehydrogenase reaction, suggest a suitable chemical 
mechanism for the pyruvate decarboxylase reaction in yeast: 


Pyruvate —> acetaldehyde + COg 


10. (Integrates with Chapter 3.) Aconitase catalyzes the citric acid cycle 
reaction: 


Citrate —— isocitrate 





The standard free energy change, AG°’, for this reaction is +6.7 
kJ/mol. However, the observed free energy change (AG) for this re- 
action in pig heart mitochondria is +0.8 kJ/mol. What is the ratio 
of [isocitrate]/[citrate] in these mitochondria? If [isocitrate] = 
0.03 mM, what is [citrate]? 


11. Describe the labeling pattern that would result if ‘CO, were incor- 
porated into the TCA cycle via the pyruvate carboxylase reaction. 


12. Describe the labeling pattern that would result if the reductive, 
reversed TCA cycle (see A Deeper Look on page 583) operated 
with !COs. 

13. Describe the labeling pattern that would result in the glyoxylate 
cycle if a plant were fed acetyl-CoA labeled at the —CHs carbon. 


14, The malate synthase reaction, which produces malate from acetyl- 
CoA and glyoxylate in the glyoxylate pathway, involves chemistry sim- 
ilar to the citrate synthase reaction. Write a mechanism for the malate 
synthase reaction and explain the role of CoA in this reaction. 


15. In most cells, fatty acids are synthesized from acetate units in the cy- 
tosol. However, the primary source of acetate units is the TCA cycle 
in mitochondria, and acetate cannot be transported directly from the 
mitochondria to the cytosol. Cells solve this problem by exporting cit- 
rate from the mitochondria and then converting citrate to acetate 
and oxaloacetate. Then, because cells cannot transport oxaloacetate 
into mitochondria directly, they must convert it to malate or pyruvate, 
both of which can be taken up by mitochondria. Draw a complete 
pathway for citrate export, conversion of citrate to malate and pyru- 
vate, and import of malate and pyruvate by mitochondria. 

a. Which of the reactions in this cycle might require energy input? 
b. What would be the most likely source of this energy? 

c. Do you recognize any of the enzyme reactions in this cycle? 

d. What coenzymes might be required to run this cycle? 

16. A typical intramitochondrial concentration of malate is 0.22 mM. If 
the ratio of NAD* to NADH in mitochondria is 20, and if the malate 
dehydrogenase reaction is at equilibrium, calculate the concentra- 
tion of oxaloacetate in the mitochondrion at 20°C. A typical mito- 


chondrion can be thought of as a cylinder 1 wm in diameter and 
2 um in length. Calculate the number of molecules of oxaloacetate 
in a mitochondrion. In analogy with pH (the negative logarithm of 
[H*]), what is pOAA? 

17. Glycolysis, the pyruvate dehydrogenase reaction, and the TCA cycle 
result in complete oxidation of a molecule of glucose to COs. Re- 
view the calculation of oxidation numbers for individual atoms in 
any molecule, and then calculate the oxidation numbers of the car- 
bons of glucose, pyruvate, the acetyl carbons of acetyl-CoA, and the 
metabolites of the TCA cycle to convince yourself that complete ox- 
idation of glucose involves removal of 24 electrons and that each 
acetyl-CoA through the TCA cycle gives up 8 electrons. 


18. Recalling all reactions of the TCA cycle can be a challenging propo- 
sition. One way to remember these is to begin with the simplest mol- 
ecule—succinate, which is a symmetric four-carbon molecule. Be- 
gin with succinate, and draw the eight reactions of the TCA cycle. 
Remember that succinate —> oxaloacetate is accomplished by a 
special trio of reactions: oxidation of a single bond to a double 
bond, hydration across the double bond, and oxidation of an alco- 
hol to a ketone. From there, a molecule of acetyl-CoA is added. If 
you remember the special function of acetyl-CoA (see A Deeper 
Look, page 570), this is an easy reaction to draw. From there, you 
need only isomerize, carry out the two oxidative decarboxylations, 
and remove the CoA molecule to return to succinate. 


19. Aconitase is rapidly inactivated by 2R, 3R-fluorocitrate, which is pro- 
duced from fluoroacetate in the citrate synthase reaction. Interest- 
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ingly, inactivation by fluorocitrate is accompanied by stoichiometric 
release of fluoride ion (i.e., one F-ion is lost per aconitase active 
site). This observation is consistent with “mechanism-based inacti- 
vation” of aconitase by fluorocitrate. Suggest a mechanism for this 
inactivation, based on formation of 4-hydroxy-trans-aconitate, which 
remains tightly bound at the active site. To assess your answer, con- 
sult this reference: Lauble, H., Kennedy, M., et al., 1996. The reac- 
tion of fluorocitrate with aconitase and the crystal structure of the 
enzyme-inhibitor complex. Proceedings of the National Academy of Sci- 
ences 93:13699-13703. 


Preparing for the MCAT Exam 

20. Complete oxidation of a 16-carbon fatty acid can yield 129 mol- 
ecules of ATP. Study Figure 19.2 and determine how many ATP 
molecules would be generated if a 16-carbon fatty acid were 
metabolized solely by the TCA cycle, in the form of 8 acetyl-CoA 
molecules. 

21. Study Figure 19.18 and decide which of the following statements is 
false? 

. Pyruvate dehydrogenase is inhibited by NADH. 

. Pyruvate dehydrogenase is inhibited by ATP. 

. Citrate synthase is inhibited by NADH. 

. Succinyl-CoA activates citrate synthase. 

. Acetyl-CoA activates pyruvate carboxylase. 


can Tp 
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Wall Piece #IV (1985), a kinetic sculpture by George 
Rhoads. This complex mechanical art form can be 
viewed as a metaphor for the molecular apparatus 
underlying electron transport and ATP synthesis by 
oxidative phosphorylation. 


George Rhoads/Rock Stream Studios 





In all things of nature there is something of the 
marvelous. 


Aristotle (384-322 B.C.) 
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Electron Transport and 
Oxidative Phosphorylation 


ESSENTIAL QUESTION 


Living cells save up metabolic energy predominantly in the form of fats and carbohy- 
drates, and they “spend” this energy for biosynthesis, membrane transport, and move- 
ment. In both directions, energy is exchanged and transferred in the form of ATP. In 
Chapters 18 and 19 we saw that glycolysis and the TCA cycle convert some of the en- 
ergy available from stored and dietary sugars directly to ATP. However, most of the 
metabolic energy that is obtainable from substrates entering glycolysis and the TCA 
cycle is funneled via oxidation-reduction reactions into NADH and reduced flavopro- 
teins, the latter symbolized by [FADH,]. 


How do cells oxidize NADH and [FADH,] and convert their reducing potential 
into the chemical energy of ATP? 


Whereas ATP made in glycolysis and the TCA cycle is the result of substrate- 
level phosphorylation, NADH-dependent ATP synthesis is the result of oxidative 
phosphorylation. Electrons stored in the form of the reduced coenzymes, NADH or 
[FADHg], are passed through an elaborate and highly organized chain of proteins 
and coenzymes, the so-called electron-transport chain, finally reaching O (molec- 
ular oxygen), the terminal electron acceptor. Each component of the chain can ex- 
ist in (at least) two oxidation states, and each component is successively reduced 
and reoxidized as electrons move through the chain from NADH (or [FADHg9]) to 
Og. In the course of electron transport, a proton gradient is established across the 
inner mitochondrial membrane. It is the energy of this proton gradient that drives 
ATP synthesis. 


20x1 Where in the Cell Do Electron Transport and Oxidative 
Phosphorylation Occur? 


The processes of electron transport and oxidative phosphorylation are membrane 
associated. Prokaryotes are the simplest life form, and prokaryotic cells typically 
consist of a single cellular compartment surrounded by a plasma membrane and a 
more rigid cell wall. In such a system, the conversion of energy from NADH and 
[FADHg] to the energy of ATP via electron transport and oxidative phosphorylation 
is carried out at (and across) the plasma membrane. 

In eukaryotic cells, electron transport and oxidative phosphorylation are localized 
in mitochondria, which are also the sites of TCA cycle activity and (as we shall see in 
Chapter 23) fatty acid oxidation. Mammalian cells contain 800 to 2500 mitochondria; 
other types of cells may have as few as one or two or as many as half a million mito- 
chondria. Human erythrocytes, whose purpose is simply to transport oxygen to tis- 
sues, contain no mitochondria at all. The typical mitochondrion is about 0.5 + 0.3 mi- 
cron in diameter and from 0.5 micron to several microns long; its overall shape is 
sensitive to metabolic conditions in the cell. 


Mitochondrial Functions Are Localized in Specific Compartments 


Mitochondria are surrounded by a simple outer membrane and a more complex 
inner membrane (Figure 20.1). The space between the inner and outer membranes 
is referred to as the intermembrane space. Several enzymes that utilize ATP (such 
as creatine kinase and adenylate kinase) are found in the intermembrane space. 


20.2 What Are Reduction Potentials, and How Are They Used to Account for Free Energy Changes in Redox Reactions? 


Outer membrane 






Inner membrane 


Intermembrane 


Cristae 


(a) (b) 


FIGURE 20.1 (a) A drawing of a mitochondrion with components labeled. (b) Tomography of a rat liver mito- 
chondrion. The tubular structures in red, yellow, green, purple, and aqua represent individual cristae formed from 
the inner mitochondrial membrane. (b, Frey, T.G., and Mannella, C. A., 2000.The internal structure of mitochondria. Trends in 
Biochemical Sciences 25:319-324.) 


The smooth outer membrane is about 30% to 40% lipid and 60% to 70% protein 
and has a relatively high concentration of phosphatidylinositol. The outer mem- 
brane contains significant amounts of porin—a transmembrane protein, rich in 
B-sheets, that forms large channels across the membrane, permitting free diffusion 
of molecules with molecular weights of about 10,000 or less. The outer membrane 
plays a prominent role in maintaining the shape of the mitochondrion. The inner 
membrane is richly packed with proteins, which account for nearly 80% of its 
weight; thus, its density is higher than that of the outer membrane. The fatty acids 
of inner membrane lipids are highly unsaturated. Cardiolipin and diphosphatidyl- 
glycerol (see Chapter 8) are abundant. The inner membrane lacks cholesterol and 
is quite impermeable to molecules and ions. Species that must cross the mitochon- 
drial inner membrane—ions, substrates, fatty acids for oxidation, and so on—are 
carried by specific transport proteins in the membrane. Notably, the inner mem- 
brane is extensively folded (Figure 20.1). The folds, known as cristae, provide the 
inner membrane with a large surface area in a small volume. During periods of ac- 
tive respiration, the inner membrane appears to shrink significantly, leaving a com- 
paratively large intermembrane space. 


The Mitochondrial Matrix Contains the Enzymes of the TCA Cycle 


The space inside the inner mitochondrial membrane is called the matrix, and it 
contains most of the enzymes of the TCA cycle and fatty acid oxidation. (An im- 
portant exception, succinate dehydrogenase of the TCA cycle, is located in the in- 
ner membrane itself.) In addition, mitochondria contain circular DNA molecules, 
along with ribosomes and the enzymes required to synthesize proteins coded within 
the mitochondrial genome. Although some of the mitochondrial proteins are made 
this way, most are encoded by nuclear DNA and synthesized by cytosolic ribosomes. 


20:25 What Are Reduction Potentials, and How Are They Used 
to Account for Free Energy Changes in Redox Reactions? 


On numerous occasions in earlier chapters, we have stressed that NADH and reduced 
flavoproteins ([FADHp»]) are forms of metabolic energy. These reduced coenzymes 
have a strong tendency to be oxidized—that is, to transfer electrons to other species. 
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CENGAGENOW” ACTIVE FIGURE 20.2 Experimental 
apparatus used to measure the standard reduction 
potential of the indicated redox couples: (a) the 
acetaldehyde/ethanol couple, (b) the fumarate/ 
succinate couple. Test yourself on the concepts in 
this figure at www.cengage.com/login. 


Oxidative phosphorylation converts the energy of electron transfer into the energy of 
phosphoryl transfer stored in the phosphoric anhydride bonds of ATP. Just as the 
group transfer potential was used in Chapter 3 to quantitate the energy of phospho- 
ryl transfer, the standard reduction potential, denoted by @,’, quantitates the ten- 
dency of chemical species to be reduced or oxidized. The standard reduction poten- 
tial difference describing electron transfer between two species, 


Reduced donor > C Oxidized acceptor 
ne 


Oxidized donor Reduced acceptor 


is related to the free energy change for the process by 
AG” = -nFAE' (20.2) 


where n represents the number of electrons transferred; F is Faraday’s constant, 
96,485 J/V- mol; and AZ,’ is the difference in reduction potentials between the 
donor and acceptor. This relationship is straightforward, but it depends on a stan- 
dard of reference by which reduction potentials are defined. 


Standard Reduction Potentials Are Measured in Reaction Half-Cells 


Standard reduction potentials are determined by measuring the voltages generated 
in reaction half-cells (Figure 20.2). A half-cell consists of a solution containing 1 M 
concentrations of both the oxidized and reduced forms of the substance whose re- 
duction potential is being measured and a simple electrode. (Together, the oxidized 
and reduced forms of the substance are referred to as a redox couple.) Such a 
sample half-cell is connected to a reference half-cell and electrode via a conductive 
bridge (usually a salt-containing agar gel). A sensitive potentiometer (voltmeter) con- 
nects the two electrodes so that the electrical potential (voltage) between them can 
be measured. The reference half-cell normally contains 1 M H* in equilibrium with 
Hy gas at a pressure of 1 atm. The H*/Hp reference half-cell is arbitrarily assigned a 
standard reduction potential of 0.0 V. The standard reduction potentials of all other 
redox couples are defined relative to the H*/Hs reference half-cell on the basis of the 
sign and magnitude of the voltage (electromotive force, emf) registered on the po- 
tentiometer (Figure 20.2). 

If electron flow between the electrodes is toward the sample half-cell, reduction 
occurs spontaneously in the sample half-cell and the reduction potential is said to 
be positive. If electron flow between the electrodes is away from the sample half-cell 
and toward the reference cell, the reduction potential is said to be negative because 
electron loss (oxidation) is occurring in the sample half-cell. Strictly speaking, the 
standard reduction potential, €,’, is the electromotive force generated at 25°C and 
pH 7.0 by a sample half-cell (containing 1 M concentrations of the oxidized and re- 
duced species) with respect to a reference half-cell. (Note that the reduction po- 
tential of the hydrogen half-cell is pH-dependent. The standard reduction poten- 
tial, 0.0 V, assumes 1 M H*. The hydrogen half-cell measured at pH 7.0 has an €,’ 
of —0.421 V.) 


Two Examples Figure 20.2a shows a sample/reference half-cell pair for measure- 
ment of the standard reduction potential of the acetaldehyde/ethanol couple. Be- 
cause electrons flow toward the reference half-cell and away from the sample half-cell, 
the standard reduction potential is negative, specifically —0.197 V. In contrast, the 
fumarate/succinate couple (Figure 20.2b) causes electrons to flow from the reference 
half-cell to the sample half-cell; that is, reduction occurs, and the reduction potential 
is thus positive. For each halfcell, a half-cell reaction describes the reaction taking 
place. For the fumarate/succinate half-cell coupled to a H* /Hg reference half-cell, the 
reaction occurring is indeed a reduction of fumarate: 


Fumarate + 2 H* + 2 e ——> succinate o = +0.031 V (20.3) 
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However, the reaction occurring in the acetaldehyde/ethanol half-cell is the oxida- 
tion of ethanol: 


Ethanol—> acetaldehyde + 2 H? + 2 e Eo = —0.197 V (20.4) 


E. Values Can Be Used to Predict the Direction of Redox Reactions 


Some typical half-cell reactions and their respective standard reduction potentials are 
listed in Table 20.1. Whenever reactions of this type are tabulated, they are uniformly 
written as reduction reactions, regardless of what occurs in the given half-cell. The sign 
of the standard reduction potential indicates which reaction really occurs when the 
given half-cell is combined with the reference hydrogen half-cell. Redox couples that 
























TABLE 20.1 f E 
Reduction Half-Reaction Eo (V) 
50, + 2 H* + 2 e —> HO 0.816 
Fe?* + o€ —> Fe?t 0.771 
Photosystem P700 0.430 
NO; + 2 H* +2 € —> NO7 + HO 0.421 
Cytochrome f (Fe**) + © —> cytochrome f(Fe?*) 0.365 
Cytochrome as(Fe?t) + © —> cytochrome a3(Fe?*) 0.350 
Cytochrome a(Fe**) + © —> cytochrome a(Fe?*) 0.290 
Rieske Fe-S(Fe**) + & —> Rieske Fe-S(Fe?*) 0.280 
Cytochrome c (Fe?t) + © —> cytochrome c (Fe?*) 0.254 
Cytochrome q (Fet) + © —> cytochrome q(Fe°+) 0.220 
UQH -: + H* + & —> UQR;: (UQ = coenzyme Q) 0.190 
UQ + 2 H+ +2¢—>UQH 0.060 
Cytochrome bj,(Fe**) + © —> cytochrome by(Fe°*) 0.050 
Fumarate + 2 Ht + 2 € —> succinate 0.031 
UQ + Ht + e — UQH - 0.030 
Cytochrome b;(Fe**) + © —> cytochrome b; (Fe?*) 0.020 
[FAD] + 2 H* + 2 & —> [FADH]}:] 0.003-0.091* 
Cytochrome b,(Fe**) + © —> cytochrome b,(Fe?*) —0.100 
Oxaloacetate + 2 Ht + 2 © —> malate —0.166 
Pyruvate + 2 Ht + 2 & —> lactate —0.185 
Acetaldehyde + 2 H* + 2 © —> ethanol —0.197 
FMN + 2 Ht + 2 € —> FMNH: —0.219 
FAD + 2 H* + 2 € —> FADH: —0.219 
Glutathione (oxidized) + 2 Ht + 2 © —> 2 glutathione (reduced) —0.230 
Lipoic acid + 2 Ht + 2 © —> dihydrolipoic acid —0.290 
1,3-Bisphosphoglycerate + 2 H* + 2 € —> —0.290 
glyceraldehyde-3-phosphate + P; 
NAD* + 2 H? + 2 € —> NADH + Ht —0.320 
NADP* + 2 H* + 2 € —~ NADPH + H* —0.320 
Lipoyl dehydrogenase [FAD] + 2 H* + 2 € —> —0.340 
lipoyl dehydrogenase [FADH»] 

a-Ketoglutarate + CO», + 2 H* + 2 © —> isocitrate —0.380 
2H*+2¢ —> Ho —0.421 
Ferredoxin (spinach) (Fet) + © —— ferredoxin (spinach) (Fe?*) —0.430 
Succinate + CO, + 2 H+ + 2 & —> a-ketoglutarate + HO —0.670 











*Typical values for reduction of bound FAD in flavoproteins such as succinate dehydrogenase (see Bonomi, F., Pagani, S., 
Cerletti, P., and Giori, C., 1983. Modification of the thermodynamic properties of the electron-transferring groups in mito- 
chondrial succinate dehydrogenase upon binding of succinate. European Journal of Biochemistry 134:439-445). 
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have large positive reduction potentials have a strong tendency to accept electrons, and 
the oxidized form of such a couple (Os, for example) is a strong oxidizing agent. Re- 
dox couples with large negative reduction potentials have a strong tendency to un- 
dergo oxidation (that is, donate electrons), and the reduced form of such a couple 
(NADPH, for example) is a strong reducing agent. 


Eo Values Can Be Used to Analyze Energy Changes in Redox Reactions 


The half-reactions and reduction potentials in Table 20.1 can be used to analyze en- 
ergy changes in redox reactions. The oxidation of NADH to NAD* can be coupled 
with the reduction of a-ketoglutarate to isocitrate: 


NAD* + isocitrate—-> NADH + H* + a-ketoglutarate + CO, (20.5) 


This is the isocitrate dehydrogenase reaction of the TCA cycle. Writing the two half- 
cell reactions, we have 


NAD+ + 2 Ht + 2 @—>NADH + Ht 
o = —0.32 V (20.6) 


a-Ketoglutarate + CO, + 2 Ht + 2 ¢ ——isocitrate 
Go = —0.38 V (20.7) 


In a spontaneous reaction, electrons are donated by (flow away from) the half- 
reaction with the more negative reduction potential and are accepted by (flow to- 
ward) the half-reaction with the more positive reduction potential. Thus, in the pre- 
sent case, isocitrate donates electrons and NAD+ accepts electrons. The convention 
defines AG,’ as 


AE = €,' (acceptor) — €,’ (donor) (20.8) 
In the present case, isocitrate is the donor and NAD* the acceptor, so we write 

A,’ = —0.32 V — (—0.38 V) = +0.06 V (20.9) 
From Equation 20.2, we can now calculate A G°’ as 

AG” = — (2) (96.485 kJ/V - mol) (0.06 V) (20.10) 

AG’ = —11.58 kJ/mol 


ll 


Note that a reaction with a net positive A,’ yields a negative AG°’, indicating a 
spontaneous reaction. 


The Reduction Potential Depends on Concentration 


We have already noted that the standard free energy change for a reaction, AG°’, does 
not reflect the actual conditions in a cell, where reactants and products are not at 
standard-state concentrations (1 M). Equation 3.13 was introduced to permit calcu- 
lations of actual free energy changes under non-standard-state conditions. Similarly, 
standard reduction potentials for redox couples must be modified to account for the 
actual concentrations of the oxidized and reduced species. For any redox couple, 


ox + ne => red (20.11) 


the actual reduction potential is given by 


ox] 
[red] 





E = o + (RT/ nF) In (20.12) 

Reduction potentials can also be quite sensitive to molecular environment. The in- 
fluence of environment is especially important for flavins, such as FAD/FADHe and 
FMN/FMNH;3. These species are normally bound to their respective flavoproteins; 
the reduction potential of bound FAD, for example, can be very different from the 
value shown in Table 20.1 for the free FAD/FADH» couple of —0.219 V. Problem 7 at 
the end of the chapter addresses this case. 
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20:38 How Is the Electron-Transport Chain Organized? 


As we have seen, the metabolic energy from oxidation of food materials—sugars, fats, 
and amino acids—is funneled into formation of reduced coenzymes (NADH) and re- 
duced flavoproteins ([FADHp»]). The electron-transport chain reoxidizes the coen- 
zymes and channels the free energy obtained from these reactions into the creation 
of a proton gradient. This reoxidation process involves the removal of both protons 
and electrons from the coenzymes. Electrons move from NADH and [FADHg] to mo- 
lecular oxygen, Oo, which is the terminal acceptor of electrons in the chain. The re- 
oxidation of NADH, 


NADH (reductant) + H* + Og (oxidant) —> NAD* + HO (20.13) 

involves the following half-reactions: 
NAD+ + 2H* + 2¢@—>NADH+H* @,' = —0.32 V (20.14) 
+O, + 2H* + 2¢—>H,O E = +0.816V (20.15) 


Here, half-reaction 20.15 is the electron acceptor and half-reaction 20.14 is the elec- 
tron donor. Then 


AZ,’ = 0.816 — (—0.32) = 1.136 V (20.16) 


and, according to Equation 20.2, the standard-state free energy change, A G°’, is 
—219 kJ/mol. Molecules along the electron-transport chain have reduction poten- 
tials between the values for the NAD*/NADH couple and the oxygen/H,O couple, 
so electrons move down the energy scale toward progressively more positive reduc- 
tion potentials (Figure 20.3). 

Although electrons move from more negative to more positive reduction po- 
tentials in the electron-transport chain, it should be emphasized that the electron 
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FIGURE 20.3 €,’ and € values for the components of 
the mitochondrial electron-transport chain. Values indi- 
cated are consensus values for animal mitochondria. 
Black bars represent @,'; red bars, G. 
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carriers do not operate in a simple linear sequence. This will become evident 
when the individual components of the electron-transport chain are discussed in 
the following paragraphs. 


The Electron-Transport Chain Can Be Isolated in Four Complexes 


The electron-transport chain involves several different molecular species, 
including: 


1. Flavoproteins, which contain tightly bound FMN or FAD as prosthetic groups 
and which may participate in one- or two-electron transfer events. 

2. Coenzyme Q, also called ubiquinone (and abbreviated CoQ or UQ) (see Figure 
20.5), which can function in either one- or two-electron transfer reactions. 

3. Several cytochromes (proteins containing heme prosthetic groups [see Chapter 
5], which function by carrying or transferring electrons), including cytochromes 
b, c, cı, a, and as. Cytochromes are one-electron transfer agents in which the 
heme iron is converted from Fe?* to Fe?* and back. 

4. A number of iron-sulfur proteins, which participate in one-electron transfers in- 
volving the Fe** and Fe** states. 

5. Protein-bound copper, a one-electron transfer site that converts between Cu* 
and Cu?". 


All these intermediates except for cytochrome care membrane associated (either 
in the mitochondrial inner membrane of eukaryotes or in the plasma membrane of 
prokaryotes). Three types of proteins involved in this chain—flavoproteins, 
cytochromes, and iron-sulfur proteins—possess electron-transferring prosthetic 
groups. 

The components of the electron-transport chain can be purified from the mito- 
chondrial inner membrane. Solubilization of the membranes containing the electron- 
transport chain results in the isolation of four distinct protein complexes, and the com- 
plete chain can thus be considered to be composed of four parts: (I) NADH-—coenzyme 
Q reductase, (II) succinate-coenzyme Q reductase, (III) coenzyme Q-cytochrome c 
reductase, and (IV) cytochrome c oxidase (Figure 20.4). Complex I accepts electrons 
from NADH, serving as a link between glycolysis, the TCA cycle, fatty acid oxidation, 
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Succinate-coenzyme Q electron-transport chain. (Adapted from Nicholls, D. G, and Ferguson, S. J., 2002. Bioener- 
oxidoreductase getics 3. London: Academic Press.) 
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and the electron-transport chain. Complex II includes succinate dehydrogenase and 
thus forms a direct link between the TCA cycle and electron transport. Complexes I 
and II produce a common product, reduced coenzyme Q (UQHo), which is the sub- 
strate for coenzyme Q-cytochrome c reductase (Complex II). As shown in Figure 
20.4, there are two other ways to feed electrons to UQ: the electron-transferring 
flavoprotein, which transfers electrons from the flavoprotein-linked step of fatty acyl- 
CoA dehydrogenase, and sn-glycerophosphate dehydrogenase. Complex III oxidizes 
UQH)» while reducing cytochrome c, which in turn is the substrate for Complex IV, 
cytochrome coxidase. Complex IV is responsible for reducing molecular oxygen. Each 
of the complexes shown in Figure 20.4 is a large multisubunit complex embedded 
within the inner mitochondrial membrane. 


Complex | Oxidizes NADH and Reduces Coenzyme Q 


As its name implies, this complex transfers a pair of electrons from NADH to coen- 
zyme Q, a small, hydrophobic, yellow compound. Another common name for this en- 
zyme complex is NADH dehydrogenase. The complex (with an estimated mass of 
980 kD) involves at least 45 polypeptide chains, one molecule of flavin mononu- 
cleotide (FMN), and eight or nine Fe-S clusters, together containing a total of 20 to 
26 iron atoms (Table 20.2). By virtue of its dependence on FMN, NADH-UQ reduc- 
tase is a flavoprotein. 

Although the precise mechanism of the NADH—-UQ reductase is unknown, the 
first step involves binding of NADH to the enzyme on the matrix side of the inner 
mitochondrial membrane and transfer of electrons from NADH to tightly bound 
FMN: 


NADH + [FMN] + H*——> [FMNH.:] + NAD+ (20.17) 


The second step involves the transfer of electrons from the reduced [FMNHg] 
to a series of Fe-S proteins, including both 2Fe-2S and 4Fe-4S clusters (see page 
577). The versatile redox properties of the flavin group of FMN are important 
here. NADH is a two-electron donor, whereas the Fe-S proteins are one-electron 
transfer agents. The flavin of FMN has three redox states—the oxidized, semi- 
quinone, and reduced states. It can act as either a one-electron or a two-electron 
transfer agent and may serve as a critical link between NADH and the Fe-S 
proteins. 

The final step of the reaction involves the transfer of two electrons from iron- 
sulfur clusters to coenzyme Q. Coenzyme Q is a mobile electron carrier. Its isoprenoid 














i 0 
Mass Prosthetic 
Complex (kD) Subunits Group Binding Site for: 
NADH-UQ reductase 980 245 FMN NADH (matrix side) 
Fe-S UQ (lipid core) 
Succinate—UQ reductase 140 4 FAD Succinate (matrix side) 
Fe-S UQ (lipid core) 
UQ-Cyt c reductase 250 9-10 Heme b Cyt c (intermembrane 
Heme by space side) 
Heme cı 
Fe-S 
Cytochrome c 13 1 Heme c Cyt cı 
Cyt a 
Cytochrome c oxidase 162 13 Heme a Cyt c (intermembrane 
Heme as space side) 
Cu, 
Cup 
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FIGURE 20.5 (a) The three oxidation states of coenzyme Q. (b) A space-filling 
model of coenzyme Q. 





tail makes it highly hydrophobic, and it diffuses freely in the hydrophobic core of the 
inner mitochondrial membrane. As a result, it shuttles electrons from Complexes I 
and II to Complex II. The redox cycle of UQ is shown in Figure 20.5. The structural 
and functional organization of Complex I is shown in Figure 20.6. 


Complex | Transports Protons from the Matrix to the Cytosol The oxidation of 
one NADH and the reduction of one UQ by NADH-UQ reductase results in the net 





HUMAN BIOCHEMISTRY 


Solving a Medical Mystery Revolutionized Our Treatment of Parkinson’s Disease 





A tragedy among illegal drug users was the impetus for a revolu- 
tionary treatment of Parkinson’s disease. In 1982, several mysteri- 
ous cases of paralysis came to light in southern California. The vic- 
tims, some of them teenagers, were frozen like living statues, 
unable to talk or move. The case was baffling at first, but it was 
soon traced to a batch of synthetic heroin that contained MPTP 
(1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine) as a contaminant. 
MPTP is rapidly converted in the brain to MPP* (1-methyl-4- 
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phenylpyridine) by the enzyme monoamine oxidase B. MPP* is a 
potent inhibitor of mitochondrial Complex I (NADH-UQ reduc- 
tase), and it acts preferentially in the substantia nigra, an area of 
the brain that is essential to movement and also the region of the 
brain that deteriorates slowly in Parkinson’s disease. 

Parkinson’s disease results from the inability of the brain to 
produce sufficient quantities of dopamine, a neurotransmitter. 
Neurologist J. William Langston, asked to consult on the treat- 
ment of some of these patients, recognized that the symptoms of 
this drug-induced disorder were in fact similar to those of parkin- 
sonism. He began treatment of the patients with L-dopa, which is 
decarboxylated in the brain to produce dopamine. The treated pa- 
tients immediately regained movement. Langston then took a 
bold step. He implanted fetal brain tissue into the brains of several 
of the affected patients, prompting substantial recovery from the 
Parkinson-like symptoms. Langston’s innovation sparked a revolu- 
tion in the use of tissue implantation for the treatment of neu- 
rodegenerative diseases. 

Other toxins may cause similar effects in neural tissue. Timothy 
Greenmyre at Emory University has shown that rats exposed to the 
pesticide rotenone (see Figure 20.27) over a period of weeks expe- 
rience a gradual loss of function in dopaminergic neurons and then 
develop symptoms of parkinsonism, including limb tremors and 
rigidity. This finding supports earlier research that links long-term 
pesticide exposure to Parkinson’s disease. 
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CENGAGENOW” ACTIVE FIGURE 20.6 (a) Structural organization of mammalian Complex |, based on electron 
microscopy, showing functional relationships within the L-shaped complex. Electron flow from NADH to 
UQH, in the membrane pool is indicated. (b) Structure of the hydrophilic domain of Complex | from Thermus 
thermophilus is shown on a model of the membrane-associated complex (pdb id = 2FUG).The locations of 
individual subunits are indicated. (c) Arrangement of the redox centers in Complex |. The various iron-sulfur 
centers of Complex | are designated by capital N. (Part a adapted from Janssen, R. J., Nijtmans, L. G, van den Heuvel, L. P, 
and Smeitink, J. A., 2006. Mitochondrial complex |: Structure, function, and pathology. Journal of Inherited Metabolic Diseases 
29:499-515; and parts b and c adapted from Figure 1 of Sazanov, L., and Hinchliffe, P, 2006. Structure of the hydrophilic 
domain of respiratory Complex | from Thermus thermophilus. Science 311:1430-1436.) Test yourself on the concepts in 
this figure at www.cengage.com/login. 





transport of protons from the matrix side to the cytosolic side of the inner mem- 
brane. The cytosolic side, where H* accumulates, is referred to as the P (for posi- 
tive) face; similarly, the matrix side is the N (for negative) face. Some of the energy 
liberated by the flow of electrons through this complex is used in a coupled process 
to drive the transport of protons across the membrane. (This is an example of ac- 
tive transport, a phenomenon examined in detail in Chapter 9.) Available experi- 
mental evidence suggests a stoichiometry of four H* transported per two electrons 
passed from NADH to UQ. 


Complex II Oxidizes Succinate and Reduces Coenzyme Q 


Complex II is perhaps better known by its other name—succinate dehydrogenase, 
the only TCA cycle enzyme that is an integral membrane protein in the inner 
mitochondrial membrane. This complex (Figure 20.7) has a mass of 124 kD and 
is composed of two hydrophilic subunits, a flavoprotein (Fp, 68 kD) and an 
iron-sulfur protein (Ip, 29 kD), and two hydrophobic, membrane-anchored sub- 
units (15 kD and 11 kD), which contain one heme band provide the binding site 
for UQ. Fp contains an FAD covalently bound to a His residue (see Figure 19.12), 
and Ip contains three Fe-S centers: a 4Fe-4S cluster, a 3Fe-4S cluster, and a 2Fe-2S 
cluster. When succinate is converted to fumarate in the TCA cycle, concomitant re- 
duction of bound FAD to FADH; occurs in succinate dehydrogenase. This FADH» 
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CENGAGENOW” ACTIVE FIGURE 20.7 (a) A scheme for 
electron flow in Complex Il. Oxidation of succinate 
occurs with reduction of [FAD]. Electrons are then 
passed to Fe-S centers and then to coenzyme Q (UQ). 
Proton transport does not occur in this complex. (b) The 
structure of Complex II from pig heart (pdb id = 1ZOY). 
(c) The arrangement of redox centers. Electron flow is 
from bottom to top. Test yourself on the concepts in 
this figure at www.cengage.com/login. 
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transfers its electrons immediately to Fe-S centers, which pass them on to UQ. Elec- 
tron flow from succinate to UQ, 


Succinate —> fumarate + 2 Ht + 2 & (20.18) 
UQ + 2 H+ + 2 ¢—>»UQH: (20.19) 
Net rxn: Succinate + UQ —> fumarate + UQH; A,’ = 0.029 V (20.20) 


yields a net reduction potential of 0.029 V. (Note that the first half-reaction is writ- 
ten in the direction of the e flow. As always, AS,’ is calculated according to Equa- 
tion 20.8.) The small free energy change of this reaction does not contribute to the 
transport of protons across the inner mitochondrial membrane. 

This is a crucial point because (as we will see) proton transport is coupled with ATP 
synthesis. Oxidation of one FADH; in the electron-transport chain results in synthesis 
of approximately two molecules of ATP, compared with the approximately three 
ATPs produced by the oxidation of one NADH. Other flavoproteins can also supply 
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FIGURE 20.8 The fatty acyl-CoA dehydrogenase reaction, emphasizing that the reaction involves reduction of 
enzyme-bound FAD (indicated by brackets). 


electrons to UQ, including mitochondrial sn-glycerophosphate dehydrogenase, an 
inner membrane-bound shuttle enzyme, and the fatty acyl-CoA dehydrogenases, 
three soluble matrix enzymes involved in fatty acid oxidation (Figure 20.8; also see 
Chapter 23). The path of electrons from succinate to UQ is shown in Figure 20.7. 


Complex III Mediates Electron Transport from Coenzyme Q 
to Cytochrome c 


In the third complex of the electron-transport chain, reduced coenzyme Q (UQHo) 
passes its electrons to cytochrome c via a unique redox pathway known as the Q cycle. 
UQ-cytochrome c reductase (UQ-cyt c reductase), as this complex is known, involves 
three different cytochromes and an Fe-S protein. In the cytochromes of these and sim- 
ilar complexes, the iron atom at the center of the porphyrin ring cycles between the 
reduced Fe?* (ferrous) and oxidized Fe** (ferric) states. 

Cytochromes were first named and classified on the basis of their absorption spec- 
tra (Figure 20.9), which depend upon the structure and environment of their heme 
groups. The b cytochromes contain iron protoporphyrin IX (Figure 20.10), the same 
heme found in hemoglobin and myoglobin. The c cytochromes contain heme c, derived 
from iron protoporphyrin IX by the covalent attachment to cysteine residues from the 
associated protein. (One other heme variation, heme a, contains a 15-carbon iso- 
prenoid chain on a modified vinyl group and a formyl group in place of one of the 
methyls [see Figure 20.10]. Cytochrome a is found in two forms in Complex IV of the 
electron-transport chain, as we shall see.) UQ-cyt ¢ reductase (Figure 20.11) contains 
a b-type cytochrome, of 30 to 40 kD, with two different heme sites and one c-type cyto- 
chrome. The two hemes on the b cytochrome polypeptide in UQ-cyt ¢ reductase are 
distinguished by their reduction potentials and the wavelength (Ajax) of the so-called 


(a) Cytochrome c: reduced spectrum 
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(b) Cytochrome c: oxidized spectrum 
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FIGURE 20.9 Visible absorption spectra of cytochrome c. 
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FIGURE 20.10 The structures of iron protoporphyrin IX, 
heme c and heme a. 
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FIGURE 20.11 The structure of UQ-cyt c reductase, also 
known as the cytochrome bc, complex. The a-helical 
bundle near the top of the structure defines the trans- 
membrane domain of the protein (pdb id = 1BE3). 





a-band. One of these hemes, known as by or 0566, has a standard reduction potential, 
Éo’, of —0.100 V and a wavelength of maximal absorbance (A wax) of 566 nm. The other, 
known as by or b562, has a standard reduction potential of +0.050 V and a Amax of 562 
nm. (H and L here refer to high and low reduction potential.) 

The structure of the UQ-cyt c reductase, also known as the cytochrome bc; com- 
plex, has been determined by Johann Deisenhofer and his colleagues. (Deisenhofer 
was a co-recipient of the Nobel Prize in Chemistry for his work on the structure of 
a photosynthetic reaction center; see Chapter 21). The complex is a dimer, with 
each monomer consisting of 11 protein subunits and 2165 amino acid residues 
(monomer mass, 248 kD). The dimeric structure is pear-shaped and consists of a 
large domain that extends 75 A into the mitochondrial matrix, a transmembrane 
domain consisting of 13 transmembrane a-helices in each monomer and a small do- 
main that extends 38 A into the intermembrane space (Figure 20.11). Most of the 
Rieske protein (an Fe-S protein named for its discoverer) is mobile in the crystal 
(only 62 of its 196 residues are shown in the structure in Figure 20.11), and Deisen- 
hofer has postulated that mobility of this subunit could be required for electron 
transfer in the function of this complex. 


Complex Ill Drives Proton Transport As with Complex I, passage of electrons 
through the Q cycle of Complex III is accompanied by proton transport across the in- 
ner mitochondrial membrane. The postulated pathway for electrons in this system is 
shown in Figure 20.12. A large pool of UQ and UQHb exists in the inner mitochon- 
drial membrane. The Q cycle is initiated when a molecule of UQHe» from this pool 
diffuses to a site (called Q,) on Complex III near the cytosolic face of the membrane. 

Oxidation of this UQHs occurs in two steps. First, an electron from UQHs is trans- 
ferred to the Rieske protein and then to cytochrome q. This releases two H* to the 
cytosol and leaves UQ : ~, a semiquinone anion form of UQ, at the Q, site. The sec- 
ond electron is then transferred to the b, heme, converting UQ:~ to UQ. The 
Rieske protein and cytochrome c; are similar in structure; each has a globular do- 
main and is anchored to the inner mitochondrial membrane by a hydrophobic seg- 
ment. However, the hydrophobic segment is N-terminal in the Rieske protein and 
C-terminal in cytochrome ¢). 
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The electron on the b; heme facing the cytosolic side of the membrane is now 
passed to the by heme on the matrix side of the membrane. This electron trans- 
fer occurs against a membrane potential of 0.15 V and is driven by the loss of 
redox potential as the electron moves from b; (€,' = —0.100 V) to by (o = 
+0.050 V). The electron is then passed from by to a molecule of UQ at a second 
quinone-binding site, Q,, converting this UQ to UQ: ~. The resulting UQ ~ 
remains firmly bound to the Q, site. This completes the first half of the Q cycle 
(Figure 20.12a). 

The second half of the cycle (Figure 20.12b) is similar to the first half, with a sec- 
ond molecule of UQHs: oxidized at the Q, site, one electron being passed to cyto- 
chrome cı and the other transferred to heme b; and then to heme by. In this latter 
half of the Q cycle, however, the by electron is transferred to the semiquinone anion, 
UQ.:~, at the Q, site. With the addition of two H* from the mitochondrial matrix, 
this produces a molecule of UQHg, which is released from the Q, site and returns to 
the coenzyme Q pool, completing the Q cycle. 


The Q Cycle Is an Unbalanced Proton Pump Why has nature chosen this rather 
convoluted path for electrons in Complex III? First of all, Complex III takes up two 
protons on the matrix side of the inner membrane and releases four protons on 
the cytoplasmic side for each pair of electrons that passes through the Q cycle. The 
other significant feature of this mechanism is that it offers a convenient way for a 
two-electron carrier, UQH,, to interact with the b; and by hemes, the Rieske pro- 
tein Fe-S cluster, and cytochrome cj, all of which are one-electron carriers. 


CENGAGENOW” ACTIVE FIGURE 20.12 The Q cycle in 
mitochondria. (a) The electron-transfer pathway follow- 
ing oxidation of the first UQH; at the Q, site near the 
cytosolic face of the membrane. (b) The pathway fol- 
lowing oxidation of a second UQH);. Test yourself on 
the concepts in this figure at www.cengage.com/ 
login. 
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FIGURE 20.13 The structure of mitochondrial cyto- 
chrome c. The heme is shown at the center of the struc- 
ture. It is covalently linked to the protein via two sulfur 
atoms. A third sulfur from a methionine residue coordi- 
nates the iron (pdb id = 2B4Z). 





FIGURE 20.14 Bovine cytochrome c oxidase consists of 
13 subunits. The 3 largest subunits—I (purple), II (yellow), 
and III (blue)—contain the proton channels and the 
redox centers (pdb id = 2EWJ). 


Cytochrome c Is a Mobile Electron Carrier Electrons traversing Complex II are 
passed through cytochrome c; to cytochrome c. Cytochrome c is the only one of the 
mitochondrial cytochromes that is water soluble. Its structure (Figure 20.13) is glob- 
ular; the planar heme group lies near the center of the protein, surrounded pre- 
dominantly by hydrophobic amino acid residues. The iron in the porphyrin ring is 
coordinated both to a histidine nitrogen and to the sulfur atom of a methionine 
residue. Coordination with ligands in this manner on both sides of the porphyrin 
plane precludes the binding of oxygen and other ligands, a feature that distinguishes 
cytochrome c from hemoglobin (see Chapter 15). 

Cytochrome į like UQ, is a mobile electron carrier. It associates loosely with the 
inner mitochondrial membrane (in the intermembrane space on the cytosolic side 
of the inner membrane) to acquire electrons from the Fe-S—cyt cı aggregate of 
Complex HI, and then it migrates along the membrane surface in the reduced state, 
carrying electrons to cytochrome c oxidase, the fourth complex of the electron- 
transport chain. 


Complex IV Transfers Electrons from Cytochrome c to Reduce Oxygen 
on the Matrix Side 


Complex IV is called cytochrome c oxidase because it accepts electrons from cyto- 
chrome c and directs them to the four-electron reduction of Os to form H,O: 


4 cyt c (Fe®*) + 4H* + Og —>4 cyt c (Fe®*) + 2 HO (20.21) 


Thus, cytochrome c oxidase and O, are the final destination for the electrons de- 
rived from the oxidation of food materials. In concert with this process, cytochrome 
c oxidase also drives transport of protons across the inner mitochondrial mem- 
brane. The combined processes of oxygen reduction and proton transport involve 
a total of 8H* in each catalytic cycle—four H* for Og reduction and four H* trans- 
ported from the matrix to the intermembrane space. 

The total number of subunits in cytochrome c oxidase varies from 2—4 (in bacte- 
ria) to 13 (in mammals). Three subunits (I, II, and III) are common to most or- 
ganisms (Figure 20.14). This minimal complex, which contains two hemes (termed 
aand az) and three copper ions (two in the Cu, center and one in the Cug site), is 
sufficient to carry out both oxygen reduction and proton transport. 

The total mass of the protein in mammalian Complex IV (Figure 20.15) is 204 kD. 
In mammals, subunits I through IMI, the largest ones, are encoded by mitochondrial 
DNA, synthesized in the mitochondrion, and inserted into the inner membrane from 
the matrix side. The 10 smaller subunits are coded by nuclear DNA, are synthesized in 
the cytosol, and are presumed to play regulatory roles in the complex. 


FIGURE 20.15 The complete structure of bovine cyto- 
chrome c oxidase (pdb id = 2EU). 
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In the bovine structure, subunit I is cylindrical in shape and consists of 12 trans- 
membrane helices, without any significant extramembrane parts. Hemes a and as, 
which lie perpendicular to the membrane plane, and Cug are cradled by the helices 
of subunit I (Figure 20.16). Subunits II and III lie on opposite sides of subunit I and 
do not contact each other (see Figure 20.14). Subunit II has an extramembrane do- 
main on the outer face of the inner mitochondrial membrane. This domain con- 
sists of a 10-strand B-barrel that holds the two copper ions of the Cug site 7 A from 
the nearest surface atom of the subunit. Subunit III consists of seven transmem- 
brane helices with no significant extramembrane domains. 


Electron Transfer in Complex IV Involves Two Hemes and Two Copper Sites 
Electron transfer through Complex IV begins with binding of cytochrome c to the 
-barrel of subunit II. Four electrons are transferred sequentially (one each from 
four molecules of cytochrome ¢) first to the Cu, center, next to heme a, and finally 
to the Cug/heme az active site, where Où is reduced to HO (Figure 20.16): 


Cyt c— Cu, —> heme a—-> Cug/heme as —> Oo (20.22) 


A tryptophan residue, which lies 5A above the Cu, site (Figure 20.17a), is the entry 
point for electrons from cytochrome c<. It lies in a hydrophobic patch on subunit II, 
surrounded by a ring of negatively charged Asp and Glu residues. Electrons flow 
rapidly from Cu, to heme a, which is coordinated by a pair of His residues (Figure 
20.17b), and then to the Cug/heme a, complex. The Fe atom in heme 4; is five co- 
ordinate (Figure 20.17c), with four ligands from the heme plane and one from 
His*”®. This leaves a sixth position free, and this is the catalytic site where O% binds 
and is reduced. Cug is about 5A from the Fe atom of heme a; and is coordinated by 
three histidine ligands, including His*"°, His*°°, and His"! (Figure 20.17c). An un- 
usual crosslink between His" and Tyr?" lowers the pK, of the Tyr hydroxyl so that 
it can participate in proton transport across the membrane. 


2Ht 





2Ht 


CENGAGENOW” ACTIVE FIGURE 20.16 The electron- 
transfer pathway for cytochrome oxidase. Cytochrome c 
binds on the cytosolic side, transferring electrons through 
the copper and heme centers to reduce O; on the matrix 
side of the membrane (pdb id = 2ElJ). Test yourself on 
the concepts in this figure at www.cengage.com/ 
login. 
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FIGURE 20.18 (a) The proton channels of cytochrome c 
oxidase from R. sphaeroides. Functional residues in the 
D- and K-pathways are indicated. The D- and 
K-pathways converge at the Cug/heme a; center. The 
proton exit channel is lined by residues 320 to 340 of 
subunit | (pdb id = 1M56). (b) Protons are presumed to 
“hop’ along arrays of water molecules in the proton 
transport channels of cytochrome c oxidase. Such a 
chain of protonation and deprotonation events means 
that the proton eventually released from the exit chan- 
nel is far removed from the proton that entered the 
D-pathway and initiated the cascade. 
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FIGURE 20.17 Structures of the redox centers of bovine cytochrome c oxidase. (a) The Cus site, (b) the heme a 
site, and (c) the binuclear Cug/heme a; site (pdb id = 2ẸlJ). 


Proton Transport Across Cytochrome c Oxidase Is Coupled 
to Oxygen Reduction 


Proton transport in R. sphaeroides cytochrome c oxidase takes place via two channels 
denoted the D- and K-pathways (Figure 20.18a). Both these channels contain water 
molecules, and they are lined with polar residues that can either protonate and de- 
protonate or form hydrogen bonds. The D-pathway is named for Asp!’ at the chan- 
nel opening, and the K-pathway is named for Lys*®”, a critical residue located mid- 
way in the channel. These two channels converge at the binuclear Cug/heme az site 
midway across the complex and the membrane. Here, Glu**® serves as a branch 
point, shuttling protons either to the catalytic site for Og reduction (to form H0O) 
or to the exit channel (residues 320 to 340) that leads protons to the intermembrane 
space (Figure 20.18a). In each catalytic cycle, two H* pass through the K-pathway 
and six H” traverse the D-pathway. The K-pathway protons and two of the D-pathway 
protons participate in the reduction of one Oz to two H2O, and the remaining four 
D-pathway protons are passed across the membrane and released to the intermem- 
brane space. 
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20.3. How ls the Electron-Transport Chain Organized? 


How are protons driven across cytochrome c oxidase? The mechanism involves 
three key features: 


e The pK, values of protein side chains in the proton channels are shifted (by the 
local environment) to make them effective proton donors or acceptors during 
transport. For example, the pK, of Glu?*® is unusually high at 9.4. (This is simi- 
lar to the behavior of Asp% and Asp” in bacteriorhodopsin; see pages 285-286, 
Chapter 9.) 

e Electron transfer events induce conformation changes that control proton trans- 
port. For example, redox events at the Cug/heme az site are sensed by Glu?*® and 
an adjacent proton-gating loop (residues 169 to 175), controlling H* binding 
and release by Glu’? and proton movement through the exit channel. 

e Protons are “transported” via chains of hydrogen-bonded water molecules in the 
proton channels (Figure 20.18b). Sequential hopping of protons along these 
“proton wires” essentially transfers a “positive charge” between distant residues in 
the channel. (Note that the H* that arrives at an accepting residue is not the 
same proton that left the donating residue.) 


The Four Electron-Transport Complexes Are Independent 


It should be emphasized here that the four major complexes of the electron- 
transport chain operate quite independently in the inner mitochondrial mem- 
brane. Each is a multiprotein aggregate maintained by numerous strong associa- 
tions between peptides of the complex, but there is no evidence that the complexes 
associate with one another in the membrane. Measurements of the lateral diffusion 
rates of the four complexes, of coenzyme Q, and of cytochrome c in the inner mito- 
chondrial membrane show that the rates differ considerably, indicating that these 
complexes do not move together in the membrane. Kinetic studies with reconsti- 
tuted systems show that electron transport does not operate by means of connected 
sets of the four complexes. 


The Model of Electron Transport Is a Dynamic One The model that emerges for 
electron transport is shown in Figure 20.19. The four complexes are independently 
mobile in the membrane. Coenzyme Q collects electrons from NADH-UQ reduc- 
tase and succinate—UQ reductase and delivers them (by diffusion through the mem- 
brane core) to UQ-cyt c reductase. Cytochrome cis water soluble and moves freely 
in the intermembrane space, carrying electrons from UQ-cyt ¢ reductase to cyto- 
chrome c oxidase. In the process of these electron transfers, protons are driven 
across the inner membrane (from the matrix side to the intermembrane space). 
The proton gradient generated by electron transport represents an enormous 
source of potential energy. As seen in the next section, this potential energy is used 
to synthesize ATP as protons flow back into the matrix. 


Electron Transfer Energy Stored in a Proton Gradient: 
The Mitchell Hypothesis 


In 1961, Peter Mitchell, a British biochemist, proposed that the energy stored in a pro- 
ton gradient across the inner mitochondrial membrane by electron transport drives 
the synthesis of ATP in cells. The proposal became known as Mitchell’s chemiosmotic 
hypothesis. In this hypothesis, protons are driven across the membrane from the 
matrix to the intermembrane space and cytosol by the events of electron transport. 
This mechanism stores the energy of electron transport in an electrochemical 
potential. As protons are driven out of the matrix, the pH rises and the matrix 
becomes negatively charged with respect to the cytosol (Figure 20.20). Electron 
transport-driven proton pumping thus creates a pH gradient and an electrical gra- 
dient across the inner membrane, both of which tend to attract protons back into 
the matrix from the cytoplasm. Flow of protons down this electrochemical gradient, 
an energetically favorable process, drives the synthesis of ATP. 
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FIGURE 20.19 A model for the electron-transport pathway in the mitochondrial inner membrane. UQ/UQH; 


and cytochrome c are mobile electron carriers and function by transferring electrons between the complexes. 
The proton transport driven by Complexes |, Ill, and IV is indicated. 


Cytosol 


Intermembrane space 
(high [H*], low pH) 






Matrix 
(low [H*], high pH) 


FIGURE 20.20 The proton and electrochemical gradients existing across the inner mitochondrial membrane. 
The electrochemical gradient is generated by the transport of protons across the membrane by Complexes |, 
lll, and IV in the inner mitochondrial membrane. 


20.5 How Does a Proton Gradient Drive the Synthesis of ATP? 


The ratio of protons transported per pair of electrons passed through the chain— 
the so-called H*/2e~ ratio—has been an object of great interest for many years. Nev- 
ertheless, the ratio has remained extremely difficult to determine. The consensus esti- 
mate for the electron-transport pathway from succinate to Og is 6H*/2¢-. The ratio for 
Complex I by itself remains uncertain, but recent best estimates place it as high as 
4H*/2e¢. On the basis of this value, the stoichiometry of transport for the pathway 
from NADH to O, is 10H*/2e-. Although this is the value assumed in Figure 20.19, it 
is important to realize that this represents a consensus drawn from many experiments. 


20:4: What Are the Thermodynamic Implications 
of Chemiosmotic Coupling? 


Mitchell’s chemiosmotic hypothesis revolutionized our thinking about the energy 
coupling that drives ATP synthesis by means of an electrochemical gradient. How 
much energy is stored in this electrochemical gradient? For the transmembrane 
flow of protons across the inner membrane (from inside [matrix] to outside), we 
could write 


Hin — A ot (20.23) 


The free energy difference for protons across the inner mitochondrial membrane 
includes a term for the concentration difference and a term for the electrical po- 
tential. This is expressed as 


[co] 
AG=RTln F7 
la] 
where cı and cs are the proton concentrations on the two sides of the membrane, 
Zis the charge on a proton, # is Faraday’s constant, and Ay is the potential dif- 
ference across the membrane. For the case at hand, this equation becomes 


[H"a] 
[H"a] 


+ ZFAW (20.24) 





AG=RTIn + ZFAw (20.25) 


In terms of the matrix and cytoplasm pH values, the free energy difference is 
AG = —2.303 RT(pHouw — pHin) + FAY (20.26) 


Reported values for Ay and ApH vary, but the membrane potential is always found 
to be positive outside and negative inside, and the pH is always more acidic outside 
and more basic inside. Taking typical values of AW = 0.18 V and ApH = 1 unit, the 
free energy change associated with the movement of one mole of protons from in- 
side to outside is 


AG= 2.3 RT + ¥(0.18 V) (20.27) 
With F = 96.485 kJ/V - mol, the value of AG at 37°C is 
AG= 5.9 kJ + 17.4 kJ = 23.3 kJ (20.28) 


which is the free energy change for movement of a mole of protons across an inner 
membrane. Note that the free energy terms for both the pH difference and the po- 
tential difference are unfavorable for the outward transport of protons, with the latter 
term making the greater contribution. On the other hand, the AG for inward flow of 
protons is — 23.3 kJ/mol. It is this energy that drives the synthesis of ATP, in accord with 
Mitchell’s model. Peter Mitchell was awarded the Nobel Prize in Chemistry in 1978. 


20:55 How Does a Proton Gradient Drive the Synthesis of ATP? 


The great French chemist Antoine Lavoisier showed in 1777 that foods undergo 
combustion in the body. Since then, chemists and biochemists have wondered how 
energy from food oxidation is captured by living things. Mitchell paved the way by 
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FIGURE 20.21 The ATP synthase, a rotating molecular 
motor. The c-, y-,and e-subunits constitute the rotating 
portion (the rotor) of the motor. Flow of protons from 
the a-subunit through the c-subunit turns the rotor and 
drives the cycle of conformational changes in a and B 
that synthesize ATP (pdb id = 1C17; 1E79; 2A7U; 2CLY; 
and 2BO5). 





suggesting that a proton gradient across the inner mitochondrial membrane could 
drive the synthesis of ATP. But how could the proton gradient be coupled to ATP 
production? The answer lies in a mitochondrial complex called ATP synthase, or 
sometimes F,F,>—-ATPase (for the reverse reaction it catalyzes). The F, portion of the 
ATP synthase was first identified in early electron micrographs of mitochondrial 
preparations as spherical, 8.5-nm projections or particles on the inner membrane. 
The purified particles catalyze ATP hydrolysis, the reverse reaction of the ATP syn- 
thase. Stripped of these particles, the membranes can still carry out electron transfer 
but cannot synthesize ATP. In one of the first reconstitution experiments with mem- 
brane proteins, Efraim Racker showed that adding the particles back to stripped 
membranes restored electron transfer-dependent ATP synthesis. 


ATP Synthase Is Composed of F, and Fy 


ATP synthase is a remarkable molecular machine. It is an enzyme, a proton pump, and 
a rotating molecular motor. Nearly all the ATP that fuels our cellular processes is made 
by this multifaceted molecular superstar. The spheres observed in electron micro- 
graphs make up the F; unit, which catalyzes ATP synthesis (Figure 20.21). These F, 
spheres are attached to an integral membrane protein aggregate called the Fp unit. F; 
consists of five polypeptide chains named a, B, y, 6, and e, with a subunit stoichiome- 
try asB3yde (Table 20.3 and Figure 20.22). Fp includes three hydrophobic subunits de- 
noted by a, b, and c, with an apparent stoichiometry of @¢945. Fo forms the trans- 
membrane pore or channel through which protons move to drive ATP synthesis. 

The a and b subunits of Fy form part of the stator—a stationary component 
anchored in the membrane—and a ring of 10 to 15 c-subunits (see Table 20.3) 
constitutes a major component of the rotor of the motor. Protons flowing through the 
a-ccomplex cause the c-ring to rotate in the membrane. Each csubunit is a folded pair 
of a-helices joined by a short loop, whereas the a-subunit is presumed to be a cluster 
of a-helices. The b-subunit, together with the d-and h-subunits and the oligomycin sen- 
sitivity-conferring protein (OSCP), form a long, slender stalk that connects Fp in the 
membrane with F,, which extends out into the matrix. The b, d, and h subunits form 
long a@-helical segments that comprise the stalk, and OSCP adds a helical bundle cap 
that sits at the bottom of an a-subunit of F; (Figure 20.21). The stalk is a stable link be- 
tween Fy and Fj, essentially joining the two, both structurally and functionally. 


The Catalytic Sites of ATP Synthase Adopt Three Different Conformations 


The F; structure appears at first to be a symmetric hexamer of a- and B-subunits. 
However, it is asymmetric in several ways. The a- and 6-subunits, arranged in an al- 
ternating pattern in the hexamer, are similar but not identical. The hexamer con- 











TABLE 20.3 
Protein Subunit Mass 

Complex Function (kD) Stoichiometry 

Fy a 55.4 3 Stator 
B 51.3 3 Stator 
y 30.6 1 Rotor 
ô 14.6 1 Rotort 
E€ 6.6 1 Rotor 

Fo a 27.9 1 Stator 
b 23.3 1 Stator 
c 7.8 10-15* Rotor 
d 19.7 1 Stator 
h 10.4 1 Stator 

OSCP 20.9 1 Stator 











*The number of c subunits varies among organisms: yeast mitochondria, 10; Ilyobacter tartaricus, 11; Escherichia coli, 12; 
spinach chloroplasts, 14; Spirulina platensis, 15. 

‘The subunit nomenclature can be confusing. E. coli ATP synthase lacks a -subunit in its rotor; its 5-subunit is analogous 
structrually and functionally to the mitochondrial OSCP. 
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(b) 





tains six ATP-binding sites, each of them arranged at the interface of adjacent sub- 
units. Three of these, each located mostly on a -subunit but with some residues 
contributed by an a-subunit, are catalytic sites for ATP synthesis. The other three, 
each located mostly on an a-subunit but with residues contributed by a 6-subunit, 
are noncatalytic and inactive. The noncatalytic a-sites have similar structures, 
but the three catalytic B-sites have three quite different conformations. In the crystal struc- 
ture first characterized by John Walker, one of the -subunit ATP sites contains 
AMP-PNP (a nonhydrolyzable analog of ATP), another contains ADP, and the third 
site is empty. 





NHo 
N o B Y 
N ) \ O (0) (0) 
N I lanl 
N N CHO — P—O— P P= O7 
| | 
O o7 o- l- (20.29) 
H H 
H H Nonhydrolyzable 
OH OH B-y bond 


b. y-Imidoadenosine 5'-triphosphate (AMP-PNP) 


Walker’s work provided structural verification for a novel hypothesis first advanced 
by Paul Boyer, the binding change mechanism for ATP synthesis. Walker and Boyer, 
whose efforts provided complementary insights into the workings of this molecular 
motor, shared in the Nobel Prize for Chemistry in 1997. 


Boyer’s 180 Exchange Experiment Identified the Energy-Requiring Step 


The elegant studies by Boyer of !8O exchange in ATP synthase provided important in- 
sights into the mechanism of the enzyme. Boyer and his colleagues studied the ability 
of the synthase to incorporate labeled oxygen from Hy'8O into P;. This reaction (Fig- 
ure 20.23) occurs via synthesis of ATP from ADP and P,, followed by hydrolysis of ATP 
with incorporation of oxygen atoms from the solvent. Although net production of ATP 
requires coupling with a proton gradient, Boyer observed that this exchange reaction 
occurs readily, even in the absence of a proton gradient. The exchange reaction was so 
facile that, eventually, all four oxygens of phosphate were labeled with '°O. This im- 
portant observation indicated that the formation of enzyme-bound ATP does not re- 
quire energy. The experiments that followed, by Boyer, Harvey Penefsky, and others, 
showed clearly that the energy-requiring step in the ATP synthase was actually the 





FIGURE 20.22 (a) An axial view of the F; unit of the FyFo- 
ATP synthase, showing alternating œ and B subunits in a 
hexameric array, with the y subunit (purple) visible in 
the center of the structure. (b) A side view of the F1 
unit, with one a subunit and one B subunit removed to 
show how the y subunit (red) extends through the cen- 
ter of the œB, hexamer. Also shown are the 6 subunit 
(aqua) and the e subunit (pink), which link the y subunit 
to the Fo unit (pdb id = 1E79). 
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FIGURE 20.23 ATP—ADP exchange in the absence of a proton gradient. Exchange leads to incorporation of 1'80 
in phosphate as shown. Boyer's experiments showed that 180 could be incorporated into all four positions of 
phosphate, demonstrating that the free energy change for ATP formation from enzyme-bound ADP + P; is 
close to zero. (From Parsons, D. F., 1963. Science 140:985.) 





release of newly synthesized ATP from the enzyme (Figure 20.24). Eventually, it would 
be shown that flow of protons through Fo drives the enzyme conformational changes 
that result in the binding of substrates on ATP synthase, ATP synthesis, and the release 
of products. 


Boyer’s Binding Change Mechanism Describes the Events 
of Rotational Catalysis 


Boyer proposed that these conformation changes occurred in a rotating fashion. His 
rotational catalysis model, the binding change mechanism (Figure 20.24), suggested 
that at any instant the three 6 subunits of F, existed in three different conformations, 
that these different states represented the three steps of ATP synthesis, and that each 
site stepped through the three states to make ATP. A site beginning with ADP and 
phosphate bound (the first state) would synthesize ATP (producing the second state) 
and then release ATP, leaving an empty site (the third state). In the binding change 
mechanism, the three catalytic sites thus cycle concertedly through the three interme- 
diate states of ATP synthesis. 


Proton Flow Through Fo Drives Rotation of the Motor 
and Synthesis of ATP 


How might the cycling proposed by Boyer’s binding change mechanism occur? Im- 
portant clues have emerged from several experiments that show that the y-subunit 
rotates with respect to the a6 complex. How such rotation might be linked to trans- 
membrane proton flow and ATP synthesis is shown in Figure 20.25. The ring of 
c-subunits is a rotor that turns with respect to the @subunit, a stator component con- 
sisting of five transmembrane a-helices with proton access channels on either side of 
the membrane. The y-subunit is the link between the functions of F; and Fo. In one 
complete rotation, the y-subunit drives conformational changes in each -subunit 
that lead to ATP synthesis. Thus, three ATPs are synthesized per turn. 

But how does the Fọ complex couple the events of proton transport and ATP 
synthesis? The a-subunit contains two half-channels, a proton inlet channel that 
opens to the intermembrane space and a proton outlet channel that opens to the 
matrix. The c-subunits are proton carriers that transfer protons from the inlet 
channel to the outlet channel only by rotation of the c-ring. Each c-subunit con- 
tains a protonatable residue, Asp®!. Protons flowing from the intermembrane 
space through the inlet halfchannel protonate the Asp®! of a passing c-subunit and 
ride the rotor around the ring until they reach the outlet channel and flow out into 
the matrix. 


CENGAGENOW” ANIMATED FIGURE 20.24 The binding change mechanism for ATP synthesis by ATP synthase. 
This model assumes that F, has three interacting and conformationally distinct active sites: an open (O) confor- 
mation with almost no affinity for ligands, a loose (L) conformation with low affinity for ligands, and a tight (T) 
conformation with high affinity for ligands. Synthesis of ATP is initiated (step 1) by binding of ADP and P; to an 
L site. In the second step, an energy-driven conformational change converts the L site to a T conformation and 
converts T to O and O to L. In the third step, ATP is synthesized at the T site and released from the O site. Two 
additional passes through this cycle produce two more ATPs and return the enzyme to its original state. See 
this figure animated at www.cengage.com/login. 
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CENGAGENOW" ANIMATED FIGURE 20.25 

(a) Protons entering the inlet half-channel in 
the a-subunit are transferred to binding sites 
on c-subunits. Rotation of the c-ring delivers 
protons to the outlet half-channel in the 
a-subunit. Flow of protons through the struc- 
ture turns the rotor and drives the cycle of con- 
formational changes in B that synthesize ATP. 
(b) Arg”? on the a-subunit lies between the 
end of the inlet half-channel (Asn2"*) and the 
end of the outlet half-channel (Ser?) (pdb id = 
1C17). (c) A view looking down into the plane 
of the membrane. Transported protons flow 
from the inlet half-channel to Asp®' residues on 
the c-ring, around the ring, and then into the 
outlet half-channel. When Asp®' is protonated, 
the outer helix of the c-subunit rotates clock- 
wise to bury the protonated carboxyl group for 
its trip around the c-ring. Counterclockwise ring 
rotation then brings another protonated Asp°! 
to the a-subunit, where an exiting proton is 
transferred to the outlet half-channel. See this 
figure animated atwww.cengage.com/login. 
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Mitochondrial 
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CENGAGENOW” ANIMATED FIGURE 20.26 The reconsti- 
tuted vesicles containing ATP synthase and bacterio- 
rhodopsin used by Stoeckenius and Racker to confirm 
the Mitchell chemiosmotic hypothesis. See this figure 
animated at www.cengage.com/login. 


The molecular details of this process are shown in Figure 20.25. Each c-subunit 
in the c-ring has an inner helix and an outer helix. Asp®! is located midway along 
the outer a-helix. When protonated, the Asp carboxyl faces into the adjacent sub- 
unit. Rotation of the entire outer a-helix exposes Asp® to the outside when it is de- 
protonated. Arg?!°, located midway on a transmembrane helix of the a-subunit, 
forms hydrogen bonds with Asp®! residues on two adjacent c-subunits. The half 
channels of the a-subunit extend up and down from Arg?!°. The inlet channel ter- 
minates in Asn?!4, whereas the outlet channel terminates at Ser20°, 

The structure of the c-subunit complex is exquisitely suited for proton transport. 
When a proton enters the a-subunit inlet channel and is transferred from a-subunit 
Asn?! to c-subunit Asp®, the a-helix of that c-subunit rotates clockwise to bury the Asp 
carboxyl group (Figure 20.25c). Each Asp®! remains protonated once it leaves the a-sub- 
unit interface, because the hydrophobic environment of the membrane interior makes 
deprotonation (and charge formation) highly unfavorable. However, when a protonated 
Asp residue approaches the a-subunit outlet channel, the proton is transferred to Ser? 
and exits through the outlet channel. The a-subunit Arg?" side chain orients adjacent 
Asp®! groups and promotes transfers of entering protons from a-subunit Asn?" to Asp®! 
and transfers of exiting protons from Asp®! to a-subunit Ser, Arg?!®, because it is pro- 
tonated, also prevents direct proton transfer from Asn?!* to Ser, which would circum- 
vent ring rotation and motor function. 

ATP synthesis occurs in concert with rotation of the c-ring, because the y-subunit 
is anchored to the rotating c-ring and rotates with it. Rotation causes the y-subunit 
to turn relative to the three 6-subunit nucleotide sites of F;, changing the confor- 
mation of each in sequence, so ADP is first bound, then phosphorylated, then re- 
leased, according to Boyer’s binding change mechanism. 


Racker and Stoeckenius Confirmed the Mitchell Model 
in a Reconstitution Experiment 


When Mitchell first described his chemiosmotic hypothesis in 1961, little evidence 
existed to support it and it was met with considerable skepticism by the scientific 
community. Eventually, however, considerable evidence accumulated to support this 
model. It is now clear that the electron-transport chain generates a proton gradient, 
and careful measurements have shown that ATP is synthesized when a pH gradient 
is applied to mitochondria that cannot carry out electron transport. Even more rel- 
evant is a simple but crucial experiment reported in 1974 by Efraim Racker and 
Walther Stoeckenius, which provided specific confirmation of the Mitchell hypothe- 
sis. In this experiment, the bovine mitochondrial ATP synthase was reconstituted in 
simple lipid vesicles with bacteriorhodopsin, a light-driven proton pump from 
Halobacterium halobium. As shown in Figure 20.26, upon illumination, bacterio- 
rhodopsin pumped protons into these vesicles, and the resulting proton gradient was 
sufficient to drive ATP synthesis by the ATP synthase. Because the only two kinds of 
proteins present were one that produced a proton gradient and one that used such 
a gradient to make ATP, this experiment essentially verified Mitchell’s chemiosmotic 
hypothesis. 


Inhibitors of Oxidative Phosphorylation Reveal Insights 
About the Mechanism 


Many details of electron transport and oxidative phosphorylation mechanisms have 
been gained from studying the effects of particular electron transport and oxidative 
phosphorylation inhibitors (Figure 20.27). The sites of inhibition by these agents 
are indicated in Figure 20.28. 


Inhibitors of Complexes I, Il, and III Block Electron Transport Rotenone is a com- 
mon insecticide that strongly inhibits the NADH—-UQ reductase. Rotenone is ob- 
tained from the roots of several species of plants. Natives in certain parts of the 
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FIGURE 20.27 The structures of several inhibitors of electron transport and oxidative phosphorylation. 


world have made a practice of beating the roots of trees along riverbanks to release 
rotenone into the water, where it paralyzes fish and makes them easy prey. Amytal and 
other barbiturates and the widely prescribed painkiller Demerol also inhibit Complex 
I. All these substances appear to inhibit reduction of coenzyme Q and the oxidation 
of the Fe-S clusters of NADH-UQ reductase. 


Cyanide, Azide, and Carbon Monoxide Inhibit Complex IV Complex IV, the cyto- 
chrome c oxidase, is specifically inhibited by cyanide, azide, and carbon monoxide 
(Figure 20.28). Cyanide and azide bind tightly to the ferric form of cytochrome as, 
whereas carbon monoxide binds only to the ferrous form. The inhibitory actions of 
cyanide and azide at this site are very potent, whereas the principal toxicity of car- 
bon monoxide arises from its affinity for the iron of hemoglobin. Herein lies an im- 
portant distinction between the poisonous effects of cyanide and carbon monoxide. 
Because animals (including humans) carry many, many hemoglobin molecules, 
they must inhale a large quantity of carbon monoxide to die from it. These same 
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FIGURE 20.28 The sites of action of several inhibitors of 
electron transport and/or oxidative phosphorylation. 
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FIGURE 20.29 Structures of several uncouplers, mole- 
cules that dissipate the proton gradient across the inner 
mitochondrial membrane and thereby destroy the tight 
coupling between electron transport and the ATP syn- 
thase reaction. 





organisms, however, possess comparatively few molecules of cytochrome as. Conse- 
quently, a limited exposure to cyanide can be lethal. The sudden action of cyanide 
attests to the organism’s constant and immediate need for the energy supplied by 
electron transport. 


Oligomycin Is an ATP Synthase Inhibitor Inhibitors of ATP synthase include 
oligomycin. Oligomycin is a polyketide antibiotic that acts directly on ATP synthase 
by binding to the OSCP subunit of Fo. Oligomycin also blocks the movement of pro- 
tons through Fo. 


Uncouplers Disrupt the Coupling of Electron Transport 
and ATP Synthase 


Another important class of reagents affects ATP synthesis, but in a manner that does 
not involve direct binding to any of the proteins of the electron-transport chain or the 
F\F,-ATPase. These agents are known as uncouplers because they disrupt the tight 
coupling between electron transport and the ATP synthase. Uncouplers act by dissi- 
pating the proton gradient across the inner mitochondrial membrane created by the 
electron-transport system. Typical examples include 2,4-dinitrophenol, dicumarol, 
and carbonyl cyanide-p-trifluoro-methoxyphenyl hydrazone (perhaps better known as 
fluorocarbonyl cyanide phenylhydrazone, or FCCP) (Figure 20.29). These com- 
pounds share two common features: hydrophobic character and a dissociable proton. 
As uncouplers, they function by carrying protons across the inner membrane. Their 
tendency is to acquire protons on the cytosolic surface of the membrane (where the 
proton concentration is high) and carry them to the matrix side, thereby destroying 
the proton gradient that couples electron transport and the ATP synthase. In mito- 
chondria treated with uncouplers, electron transport continues and protons are 
driven out through the inner membrane. However, they leak back in so rapidly via the 
uncouplers that ATP synthesis does not occur. Instead, the energy released in electron 
transport is dissipated as heat. 


ATP—ADP Translocase Mediates the Movement of ATP and ADP 
Across the Mitochondrial Membrane 


ATP, the cellular energy currency, must exit the mitochondria to carry energy 
throughout the cell, and ADP must be brought into the mitochondria for repro- 
cessing. Neither of these processes occurs spontaneously because the highly 
charged ATP and ADP molecules do not readily cross biological membranes. 
Instead, these processes are mediated by a single transport system, the ATP-ADP 
translocase. This protein tightly couples the exit of ATP with the entry of ADP so 
that the mitochondrial nucleotide levels remain approximately constant. For each 
ATP transported out, one ADP is transported into the matrix. The translocase, 
which accounts for approximately 14% of the total mitochondrial membrane pro- 
tein, isa homodimer of 30-kD subunits. The structure of the bovine translocase con- 
sists of six transmembrane a-helices. The helices are all tilted with respect to the 
membrane, and the first, third, and fifth helices are bent or kinked at proline 
residues in the middle of the membrane (Figure 20.30). Transport occurs via a sin- 
gle nucleotide-binding site, which alternately faces the matrix and the intermem- 
brane space. It binds ATP on the matrix side, reorients to face the outside, and ex- 
changes ATP for ADP, with subsequent rearrangement to face the matrix side of the 
inner membrane. 


Outward Movement of ATP Is Favored over Outward ADP Movement The charge 
on ATP at pH 7.2 or so is about —4, and the charge on ADP at the same pH is 
about —3. Thus, net exchange of an ATP (out) for an ADP (in) results in the net 
movement of one negative charge from the matrix to the cytosol. (This process is 





HUMAN BIOCHEMISTRY 
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Endogenous Uncouplers Enable Organisms to Generate Heat 


Certain cold-adapted animals, hibernating animals, and newborn 
animals generate large amounts of heat by uncoupling oxidative 
phosphorylation. These organisms have a type of fat known as 
brown adipose tissue, so called for the color imparted by the many 
mitochondria this adipose tissue contains. The inner membrane 
of brown adipose tissue mitochondria contains large amounts 
of an endogenous protein called thermogenin (literally, “heat 
maker”) or uncoupling protein 1 (UCP1). UCP1 creates a passive 
proton channel through which protons flow from the cytosol to 
the matrix. Mice that lack UCP1 cannot maintain their body tem- 
perature in cold conditions, whereas normal animals produce 
larger amounts of UCP1 when they are cold-adapted. Two other 
mitochondrial proteins, designated UCP2 and UCP3, have se- 
quences similar to UCP1. 

Because the function of UCP1 is so closely linked to energy uti- 
lization, there has been great interest in the possible roles of 
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UCP1, UCP2, and UCP3 as metabolic regulators and as factors in 
obesity. Under fasting conditions, expression of UCP1 mRNA is 
decreased, but expression of UCP2 and UCP3 is increased. There 
is no indication, however, that UCP2 and UCP3 actually function 
as uncouplers. There has also been interest in the possible roles 
of UCP2 and UCP3 in the development of obesity, especially be- 
cause the genes for these proteins lie on chromosome 7 of the 
mouse, close to other genes linked to obesity. 

Certain plants use the heat of uncoupled proton transport for a 
special purpose. Skunk cabbage and related plants contain floral 
spikes that are maintained as much as 20° above ambient tempera- 
ture in this way. The warmth of the spikes serves to vaporize odif- 
erous molecules, which attract insects that fertilize the flowers. Red 
tomatoes have very small mitochondrial membrane proton gradi- 
ents compared with green tomatoes—evidence that uncouplers 
are more active in red tomatoes. 
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equivalent to the movement of a proton from the cytosol to the matrix.) Recall that 
the inner membrane is positive outside (see Figure 20.20), and it becomes clear that 
outward movement of ATP is favored over outward ADP transport, ensuring that 
ATP will be transported out (Figure 20.30). Inward movement of ADP is favored over 
inward movement of ATP for the same reason. Thus, the membrane electrochemi- 


(a) Intermembrane space (b) 


Matrix 








FIGURE 20.30 (a) The bovine ATP-ADP translocase (pdb id = 2C3E). (b) Outward transport of ATP (via the 
ATP-ADP translocase) is favored by the membrane electrochemical potential. 
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cal potential itself controls the specificity of the ATP-ADP translocase. However, the 
electrochemical potential is diminished by the ATP-ADP translocase cycle and there- 
fore operates with an energy cost to the cell. The cell must compensate by passing 
yet more electrons down the electron-transport chain. 

What is the cost of ATP-ADP exchange relative to the energy cost of ATP synthe- 
sis itself? We already noted that moving one ATP out and one ADP in is the equiva- 
lent of one proton moving from the cytosol to the matrix. Synthesis of an ATP results 
from the movement of approximately three protons from the cytosol into the matrix 
through Fo. Altogether this means that approximately four protons are transported 
into the matrix per ATP synthesized. Thus, approximately one-fourth of the energy 
derived from the respiratory chain (electron transport and oxidative phosphoryla- 
tion) is expended as the electrochemical energy devoted to mitochondrial ATP-ADP 
transport. 


20:6: What Is the P/O Ratio for Mitochondrial Oxidative 
Phosphorylation? 


The P/O ratio is the number of molecules of ATP formed in oxidative phosphor- 
ylation per two electrons flowing through a defined segment of the electron- 
transport chain. In spite of intense study of this ratio, its actual value remains a matter 
of contention. 

The P/O ratio depends on the ratio of H* transported out of the matrix per 2 e 
passed from NADH to O» in the electron-transport chain and on the number of H~ 
that pass through the ATP synthase to synthesize an ATP. The latter number depends 
on the number of c-subunits in the Fo ring of the synthase. As noted in Table 20.3, the 
number of c-subunits in the ATP synthase ranges from 10 to 15, depending on the or- 
ganism. This would correspond to ratios of H* consumed per ATP from about 3 to 5, 
respectively, since each rotation of the ATP synthase rotor drives the formation of 
three ATP. Adding one H* for the action of the ATP-ADP translocase raises these val- 
ues to about 4 and 6, respectively. 

If we accept the value of 10 H* transported out of the matrix per 2 & passed from 
NADH to Oy, through the electron-transport chain, and agree that 4 H* are trans- 
ported into the matrix per ATP synthesized (and translocated), then the mitochon- 
drial P/O ratio is 10/4, or 2.5, for the case of electrons entering the electron- 
transport chain as NADH. This is somewhat lower than earlier estimates, which placed 
the P/O ratio at 3 for mitochondrial oxidation of NADH. For the portion of the chain 
from succinate to Os, the H*/2¢ ratio is 6 (as noted previously), and the P/O ratio in 
this case would be 6/4, or 1.5; earlier estimates placed this number at 2. The consen- 
sus of more recent experimental measurements of P/O ratios for these two cases has 
been closer to the values of 2.5 and 1.5. Many chemists and biochemists, accustomed 
to the integral stoichiometries of chemical and metabolic reactions, were once reluc- 
tant to accept the notion of nonintegral P/O ratios. At some point, as we learn more 
about these complex coupled processes, it may be necessary to reassess the numbers. 


2027/5 How Are the Electrons of Cytosolic NADH Fed 
into Electron Transport? 


Most of the NADH used in electron transport is produced in the mitochondrial 
matrix, an appropriate site because NADH is oxidized by Complex I on the ma- 
trix side of the inner membrane. Furthermore, the inner mitochondrial mem- 
brane is impermeable to NADH. Recall, however, that NADH is produced in 
glycolysis by glyceraldehyde-3-P dehydrogenase in the cytosol. If this NADH 
were not oxidized to regenerate NADY, the glycolytic pathway would cease to 
function due to NAD* limitation. Eukaryotic cells have a number of shuttle 
systems that harvest the electrons of cytosolic NADH for delivery to mitochon- 
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dria without actually transporting NADH across the inner membrane (Figures 
20.31 and 20.32). 


The Glycerophosphate Shuttle Ensures Efficient Use of Cytosolic NADH 


In the glycerophosphate shuttle, two different glycerophosphate dehydroge- 
nases, one in the cytosol and one on the outer face of the mitochondrial inner 
membrane, work together to carry electrons into the mitochondrial matrix (see 
Figure 20.31). NADH produced in the cytosol transfers its electrons to 
dihydroxyacetone phosphate, thus reducing it to glycerol-3-phosphate. This 
metabolite is reoxidized by the FAD*-dependent mitochondrial membrane en- 
zyme to reform dihydroxyacetone phosphate and enzyme-bound FADHo. The 
two electrons of [FADHg] are passed directly to UQ, forming UQHs. Thus, via 
this shuttle, cytosolic NADH can be used to produce mitochondrial [FADHo] 
and, subsequently, UQHb. As a result, cytosolic NADH oxidized via this shuttle 
route yields only 1.5 molecules of ATP. The cell “pays” with a potential ATP mol- 
ecule for the convenience of getting cytosolic NADH into the mitochondria. Al- 
though this may seem wasteful, there is an important payoff. The glycerophos- 
phate shuttle is essentially irreversible, and even when NADH levels are very low 
relative to NAD*, the cycle operates effectively. 


The Malate—Aspartate Shuttle Is Reversible 


The second electron shuttle system, called the malate—aspartate shuttle, is shown 
in Figure 20.32. Oxaloacetate is reduced in the cytosol, acquiring the electrons of 
NADH (which is oxidized to NAD*). Malate is transported across the inner mem- 
brane, where it is reoxidized by malate dehydrogenase, converting NAD* to NADH 
in the matrix. This mitochondrial NADH readily enters the electron-transport 
chain. The oxaloacetate produced in this reaction cannot cross the inner mem- 
brane and must be transaminated to form aspartate, which can be transported 
across the membrane to the cytosolic side. Transamination in the cytosol recycles 
aspartate back to oxaloacetate. In contrast to the glycerol phosphate shuttle, the 
malate—aspartate cycle is reversible, and it operates as shown in Figure 20.32 only 
if the NADH/NAD* ratio in the cytosol is higher than the ratio in the matrix. Be- 
cause this shuttle produces NADH in the matrix, the full 2.5 ATPs per NADH are 
recovered. 
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FIGURE 20.31 The glycerophosphate shuttle (also 
known as the glycerol phosphate shuttle) couples the 
cytosolic oxidation of NADH with mitochondrial reduc- 
tion of [FAD]. 
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FIGURE 20.32 The malate (oxaloacetate)—aspartate 
shuttle, which operates across the inner mitochondrial 
membrane. 


The Net Yield of ATP from Glucose Oxidation Depends 
on the Shuttle Used 


The complete route for the conversion of the metabolic energy of glucose to ATP 
has now been described in Chapters 18 through 20. Assuming appropriate P/O 
ratios, the number of ATP molecules produced by the complete oxidation of a 
molecule of glucose can be estimated. Keeping in mind that P/O ratios must be 
viewed as approximate, for all the reasons previously cited, we will assume the val- 
ues of 2.5 and 1.5 for the mitochondrial oxidation of NADH and succinate, re- 
spectively. In eukaryotic cells, the combined pathways of glycolysis, the TCA 
cycle, electron transport, and oxidative phosphorylation then yield a net of ap- 
proximately 30 to 32 molecules of ATP per molecule of glucose oxidized, de- 
pending on the shuttle route used (Table 20.4). 

The net stoichiometric equation for the oxidation of glucose, using the glycerol 
phosphate shuttle, is 


Glucose + 6 Oo + ~30 ADP + ~30 P;—> 


6 CO2 + ~30 ATP + ~36 HO (20.30) 


Because the 2 NADH formed in glycolysis are “transported” by the glycerol phos- 
phate shuttle in this case, they each yield only 1.5 ATP, as already described. On the 


20.7 How Are the Electrons of Cytosolic NADH Fed into Electron Transport? 









TABLE 20.4 J 


ATP Yield 














per Glucose 
Glycerol- Malate- 
Phosphate Aspartate 
Pathway Shuttle Shuttle 
Glycolysis: glucose to pyruvate (cytosol) 
Phosphorylation of glucose =] =1 
Phosphorylation of fructose-6-phosphate =] =i 
Dephosphorylation of 2 molecules of 1,3-BPG +2 +2 
Dephosphorylation of 2 molecules of PEP +2 +2 
Oxidation of 2 molecules of glyceraldehyde-3- 
phosphate yields 2 NADH 
Pyruvate conversion to acetyl-CoA (mitochondria) 
2 NADH 
Citric acid cycle (mitochondria) 
2 molecules of GTP from 2 molecules +2 +2 


of succinyl-CoA 


Oxidation of 2 molecules each of isocitrate, 
a-ketoglutarate, and malate yields 6 NADH 


Oxidation of 2 molecules of succinate yields 2 [FADHg] 


Oxidative phosphorylation (mitochondria) 


2 NADH from glycolysis yield 1.5 ATPs each if NADH +3 +5 
is oxidized by glycerol-phosphate shuttle; 2.5 ATP 
by malate—aspartate shuttle 


Oxidative decarboxylation of 2 pyruvate to 2 acetyl-CoA: 








2 NADH produce 2.5 ATPs each +5 +5 

2 [FADH.] from each citric acid cycle produce +3 +3 
1.5 ATPs each 

6 NADH from citric acid cycle produce 2.5 ATPs each +15 +15 

Net Yield ~ 30 32 











Note: These P/O ratios of 2.5 and 1.5 for mitochondrial oxidation of NADH and [FADH,] are “consensus values.” Because 
they may not reflect actual values and because these ratios may change depending on metabolic conditions, these esti- 
mates of ATP yield from glucose oxidation are approximate. 


other hand, if these 2 NADH take part in the malate—aspartate shuttle, each yields 
2.5 ATP, giving a total (in this case) of 32 ATP formed per glucose oxidized. Most of 
the ATP—26 out of 30 or 28 out of 32—is produced by oxidative phosphorylation; 
only 4 ATP molecules result from direct synthesis during glycolysis and the TCA 
cycle. 

The situation in bacteria is somewhat different. Prokaryotic cells need not carry 
out ATP-ADP exchange. Thus, bacteria have the potential to produce approxi- 
mately 38 ATP per glucose. 


3.5 Billion Years of Evolution Have Resulted in a Very Efficient System 


Hypothetically speaking, how much energy does a eukaryotic cell extract from the 
glucose molecule? Taking a value of 50 kJ/mol for the hydrolysis of ATP under cel- 
lular conditions (see Chapter 3), the production of 32 ATPs per glucose oxidized 
yields 1600 kJ/mol of glucose. The cellular oxidation (combustion) of glucose 
yields AG = —2937 kJ/mol. We can calculate an efficiency for the pathways of gly- 
colysis, the TCA cycle, electron transport, and oxidative phosphorylation of 
1600/2937 X 100% = 54%. 
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ROS: Reactive oxygen species, such as oxygen 
ions, free radicals, and peroxides. 


FIGURE 20.33 (a) Cytochrome c is anchored at the inner 
mitochondrial membrane by association with cardio- 
lipin (diphosphatidylglycerol). The peroxidase activity of 
cytochrome c oxidizes a cardiolipin lipid chain, releasing 
cytochrome c from the membrane. (b) The opening of 
pores in the outer membrane, induced by a variety of 
triggering agents, releases cytochrome c to the cytosol, 
where it initiates the events of apoptosis. 


20:8: How Do Mitochondria Mediate Apoptosis? 


Mitochondria not only are the home of the TCA cycle and oxidative phosphoryla- 
tion but also are a crossroads for several cell signaling pathways. Mitochondria take 
up Ca?* ions released from the endoplasmic reticulum, thus helping control intra- 
cellular Ca?* signals. They produce reactive oxygen species (ROS) that play signal- 
ing roles in cells, although ROS can also cause cellular damage. Mitochondria also 
participate in the programmed death of cells, a process known as apoptosis (the sec- 
ond “p” is silent in this word). 

Apoptosis is a mechanism through which certain cells are eliminated from 
higher organisms. It is central to the development and homeostasis of multicellular 
organisms, and it is the route by which unwanted or harmful cells are eliminated. 
Under normal circumstances, apoptosis is suppressed through compartmentation 
of the involved activators and enzymes. Mitochondria play a major role in this sub- 
cellular partitioning of the apoptotic activator molecules. One such activator is cyto- 
chrome c, which normally resides in the intermembrane space, bound tightly to a 
lipid chain of cardiolipin in the membrane (Figure 20.33). A variety of triggering 
agents, including Ca?*, ROS, certain lipid molecules, and certain protein kinases, 
can induce the opening of pores in the mitochondrial membrane. For example, us- 
ing hydrogen peroxide as a substrate, cytochrome c can oxidize its bound cardio- 
lipin chain, releasing itself from the membrane. When the outer membrane is made 
permeable by other apoptotic signals, cytochrome c can enter the cytosol. 

Permeabilization events, which occur at points where outer and inner mito- 
chondrial membranes are in contact, involve association of the ATP—ADP translo- 
case in the inner membrane and the voltage-dependent anion channel (VDAC) in 
the outer membrane. This interaction leads to the opening of protein-permeable 
pores. Cytochrome c, as well as several other proteins, can pass through these pores. 
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Pore formation is carefully regulated by the BCL-2 family of proteins, which in- 
cludes both proapoptotic members (proteins known as Bax, Bid, and Bad) and 
antiapoptotic members (BCL-2 itself, as well as BCL-X; and BCL-W). 


Cytochrome c Triggers Apoptosome Assembly 


But how is the release of cytochrome c translated into the activation of the death 
cascade, a point of no return for the cell? The answer lies in the assembly, in the cy- 
tosol, of a signaling platform called the apoptosome (Figure 20.34). The function 
of the apoptosome is to activate a cascade of proteases called caspases. (Here, “c” is 
for cysteine and “asp” is for aspartic acid. Caspases have Cys at the active site and 
cleave their peptide substrates after Asp residues.) The apoptosome is a wheel- 
shaped, heptameric platform that looks like (and in some ways behaves like) an 
earth-orbiting space station. It is assembled from seven subunits of the apoptotic 
protease-activating factor 1 (Apaf-1), a multidomain protein. Apaf-1 contains an 
ATPase domain (which prefers dATP over ATP in some organisms), a caspase- 
recruitment domain (CARD), and a WD40 repeat domain. Normally (before the 
death-signaling cytochrome c is released from mitochondria), these three domains 
are folded against each other (Figure 20.34b), with dATP tightly bound, and Apaf- 
1 is “locked” in an inactive monomeric state. Binding of cytochrome c to the WD40 
domain, followed by dATP hydrolysis, converts Apaf-1 to an extended conforma- 
tion. Then, exchange of dADP for a new molecule of dATP prompts assembly of the 
heptameric platform (Figure 20.34), which goes on to activate the death-dealing 
caspase cascade. 
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FIGURE 20.34 (a) Apaf-1 is a multidomain protein, consisting of an N-terminal CARD, a nucleotide-binding 
and oligomerization domain (NOD), and several WD40 domains. (b) Binding of cytochrome c to the WD40 
domains and ATP hydrolysis unlocks Apaf-1 to form the semiopen conformation. Nucleotide exchange leads 
to oligomerization and apoptosome formation. (c) A model of the apoptosome, a wheellike structure with 
molecules of cytochrome c bound to the WD40 domains, which extend outward like spokes. 
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Mitochondria-mediated apoptosis is the mode of cell death for many neurons in 
the brain during strokes and other brain-trauma injuries. When a stroke occurs, the 
neurons at the site of oxygen deprivation die within minutes by a nonspecific 
process of necrosis, but cells adjacent to the immediate site of injury die more slowly 
by apoptosis. These latter cells have been saved by a variety of therapeutic inter- 
ventions that suppress apoptosis in laboratory studies, raising the hope that strokes 
and other neurodegenerative conditions may someday be treated clinically in simi- 


lar ways. 


SUMMARY 


20.1 Where in the Cell Do Electron Transport and Oxidative Phospho- 
rylation Occur? The processes of electron transport and oxidative 
phosphorylation are membrane associated. In prokaryotes, the conver- 
sion of energy from NADH and [FADHb] to the energy of ATP via elec- 
tron transport and oxidative phosphorylation is carried out at (and 
across) the plasma membrane. In eukaryotic cells, electron transport 
and oxidative phosphorylation are localized in mitochondria. Mito- 
chondria are surrounded by a simple outer membrane and a more com- 
plex inner membrane. The space between the inner and outer mem- 
branes is referred to as the intermembrane space. 


20.2 What Are Reduction Potentials, and How Are They Used to Ac- 
count for Free Energy Changes in Redox Reactions? Just as the group 
transfer potential is used to quantitate the energy of phosphoryl transfer, 
the standard reduction potential, denoted by @,’, quantitates the ten- 
dency of chemical species to be reduced or oxidized. Standard reduction 
potentials are determined by measuring the voltages generated in reac- 
tion halfcells. A half-cell consists of a solution containing 1 M concen- 
trations of both the oxidized and reduced forms of the substance whose 
reduction potential is being measured and a simple electrode. 


20.3 How Is the Electron-Transport Chain Organized? The com- 
ponents of the electron-transport chain can be purified from the mito- 
chondrial inner membrane as four distinct protein complexes: 
(I) NADH-coenzyme Q reductase, (II) succinate-coenzyme Q reduc- 
tase, (III) coenzyme Q-cytochrome c reductase, and (IV) cytochrome c 
oxidase. In complexes I, II, and IV, electron transfer drives the move- 
ment of protons from the mitochondrial matrix to the intermembrane 
space. 

Complex I (NADH dehydrogenase) involves more than 45 polypep- 
tide chains, 1 molecule of flavin mononucleotide (FMN), and as many 
as nine Fe-S clusters, together containing a total of 20 to 26 iron atoms. 
The complex transfers electrons from NADH to FMN, then to a series 
of FeS proteins, and finally to coenzyme Q. 

Complex II (succinate dehydrogenase) oxidizes succinate to fumarate, 
with concomitant reduction of bound FAD to FADH. This FADH; trans- 
fers its electrons immediately to Fe-S centers, which pass them on to UQ, 
Electrons flow from succinate to UQ. 

Complex III drives electron transport from coenzyme Q to cytochrome 
c via a unique redox pathway known as the Q cycle. UQ—cytochrome 
c reductase (UQ-cyt c reductase), as this complex is known, involves three 
different cytochromes and an Fe-S protein. In the cytochromes of these 
and similar complexes, the iron atom at the center of the porphyrin ring 
cycles between the reduced Fe?* (ferrous) and oxidized Fe** (ferric) 
states. 

Complex IV transfers electrons from cytochrome c to reduce oxygen 
on the matrix side. Complex IV (cytochrome c oxidase) accepts elec- 
trons from cytochrome c and directs them to the four-electron reduc- 
tion of Os to form 2H2O via Cu, sites, the heme iron of cytochrome a, 
Cug, and the heme iron of as. 


20.4 What Are the Thermodynamic Implications of Chemiosmotic 
Coupling? Peter Mitchell was the first to propose that electron transport 
leads to formation of a proton gradient that drives ATP synthesis. The 
free energy difference for protons across the inner mitochondrial mem- 


brane includes a term for the concentration difference and a term for the 
electrical potential. It is this energy that drives the synthesis of ATP, in ac- 
cord with Mitchell’s model. 


20.5 How Does a Proton Gradient Drive the Synthesis of ATP? The mi- 
tochondrial complex that carries out ATP synthesis is ATP synthase 
(F)}Fo-ATPase). ATP synthase consists of two principal complexes, desig- 
nated F; and Fp. Protons taken up from the cytosol by one of the proton 
access channels in the a-subunit of Fy ride the rotor of c-subunits until 
they reach the other proton access channel on a, from which they are re- 
leased into the matrix. Such rotation causes the y-subunit of F; to turn 
relative to the three 6-subunit nucleotide sites of F;, changing the con- 
formation of each in sequence, so ADP is first bound, then phosphory- 
lated, then released, according to Boyer’s binding change mechanism. 

The inhibitors of oxidative phosphorylation include rotenone, a 
common insecticide that strongly inhibits the NADH-UQ reductase. 
Complex IV is specifically inhibited by cyanide (CN7), azide (N37), 
and carbon monoxide (CO). Cyanide and azide bind tightly to the 
ferric form of cytochrome as, whereas carbon monoxide binds only to 
the ferrous form. 

Uncouplers disrupt the coupling of electron transport and ATP syn- 
thase. Uncouplers share two common features: hydrophobic character 
and a dissociable proton. They function by carrying protons across the 
inner membrane, acquiring protons on the outer surface of the mem- 
brane (where the proton concentration is high) and carrying them to 
the matrix side. Uncouplers destroy the proton gradient that couples 
electron transport and the ATP synthase. 

ATP-ADP translocase mediates the movement of ATP and ADP across 
the mitochondrial membrane. The ATP-ADP translocase is an inner 
membrane protein that tightly couples the exit of ATP with the entry of 
ADP so that the mitochondrial nucleotide levels remain approximately 
constant. For each ATP transported out, one ADP is transported into the 
matrix. ATP—ADP translocase binds ATP on the matrix side, reorients to 
face the intermembrane space, and exchanges ATP for ADP, with subse- 
quent reorientation back to the matrix face of the inner membrane. 


20.6 What Is the P/O Ratio for Mitochondrial Oxidative Phosphoryla- 
tion? The P/O ratio is the number of molecules of ATP formed in ox- 
idative phosphorylation per two electrons flowing through a defined 
segment of the electron-transport chain. The consensus value for the 
mitochondrial P/O ratio is 10/4, or 2.5, for electrons entering the elec- 
tron-transport chain as NADH. For succinate to Os, the P/O ratio in 
this case would be 6/4, or 1.5. 


20.7 How Are the Electrons of Cytosolic NADH Fed into Electron Trans- 
port? Eukaryotic cells have a number of shuttle systems that collect the 
electrons of cytosolic NADH for delivery to mitochondria without actu- 
ally transporting NADH across the inner membrane. In the glyc- 
erophosphate shuttle, two different glycerophosphate dehydrogenases, 
one in the cytosol and one on the outer face of the mitochondrial inner 
membrane, work together to carry electrons into the mitochondrial ma- 
trix. In the malate—aspartate shuttle, oxaloacetate is reduced in the cy- 
tosol, acquiring the electrons of NADH (which is oxidized to NAD*). 
Malate is transported across the inner membrane, where it is reoxidized 
by malate dehydrogenase, converting NAD* to NADH in the matrix. 


20.8 How Do Mitochondria Mediate Apoptosis? Mitochondria are a 
crossroads for several cell signaling pathways. Mitochondria take up 
Ca?* ions released from the endoplasmic reticulum, helping control in- 
tracellular Ca?* signals. They produce ROS that play signaling roles in 
cells. They also participate in apoptosis, the programmed death of cells. 
Triggering agents, including Ca**, ROS, and certain lipid molecules 
and protein kinases, can induce the opening of pores in the mitochon- 
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drial membrane, releasing cytochrome c, which then binds to the WD40 
domain of Apaf-1, activating formation of the heptameric apoptosome 
platform. Mitochondria-mediated apoptosis is the mode of cell death of 
many neurons in the brain during strokes and other brain-trauma in- 
juries, and interventions that suppress apoptosis may eventually be use- 
ful in clinical settings. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 


1. For the following reaction, 
[FAD] + 2 cyt c (Fe?*) + 2 H*——> [FADHbp] + 2 cyt c (Fe**) 


determine which of the redox couples is the electron acceptor and 
which is the electron donor under standard-state conditions, calcu- 
late the value of A@,,’, and determine the free energy change for the 
reaction. 


2. Calculate the value of AC,’ for the glyceraldehyde-3-phosphate 
dehydrogenase reaction, and calculate the free energy change for 
the reaction under standard-state conditions. 


3. For the following redox reaction, 
NAD* + 2 H* + 2 €—>NADH + H+ 


suggest an equation (analogous to Equation 20.12) that predicts 
the pH dependence of this reaction, and calculate the reduction 
potential for this reaction at pH 8. 


4. Sodium nitrite (NaNOz) is used by emergency medical personnel as 
an antidote for cyanide poisoning (for this purpose, it must be 
administered immediately). Based on the discussion of cyanide 
poisoning in Section 20.5, suggest a mechanism for the lifesaving ef- 
fect of sodium nitrite. 


5. A wealthy investor has come to you for advice. She has been ap- 
proached by a biochemist who seeks financial backing for a com- 
pany that would market dinitrophenol and dicumarol as weight-loss 
medications. The biochemist has explained to her that these agents 
are uncouplers and that they would dissipate metabolic energy as 
heat. The investor wants to know if you think she should invest in 
the biochemist’s company. How do you respond? 


6. Assuming that 3 H* are transported per ATP synthesized in the mi- 
tochondrial matrix, the membrane potential difference is 0.18 V 
(negative inside), and the pH difference is 1 unit (acid outside, 
basic inside), calculate the largest ratio of [ATP]/ [ADP] [P;] under 
which synthesis of ATP can occur. 


7. Of the dehydrogenase reactions in glycolysis and the TCA cycle, all but 
one use NAD* as the electron acceptor. The lone exception is the suc- 
cinate dehydrogenase reaction, which uses covalently bound FAD of a 
flavoprotein as the electron acceptor. The standard reduction poten- 
tial for this bound FAD is in the range of 0.003 to 0.091 V (see Table 
20.1). Compared with the other dehydrogenase reactions of glycolysis 
and the TCA cycle, what is unique about succinate dehydrogenase? 
Why is bound FAD a more suitable electron acceptor in this case? 


8. a. Whatis the standard free energy change (A G°’) for the reduction 
of coenzyme Q by NADH as carried out by Complex I 
(NADH-coenzyme Q reductase) of the electron-transport path- 
way if €,’ (NAD*/NADH) = —0.320 V and é,’ (CoQ/CoQHs) = 
+0.060 V. 

b. What is the equilibrium constant (Keq) for this reaction? 

c. Assume that (1) the actual free energy release accompanying 
the NADH-coenzyme Q reductase reaction is equal to the 
amount released under standard conditions (as calculated in 
part a), (2) this energy can be converted into the synthesis of 
ATP with an efficiency = 0.75 (that is, 75% of the energy re- 


leased upon NADH oxidation is captured in ATP synthesis), and 
(3) the oxidation of 1 equivalent of NADH by coenzyme Q leads 
to the phosphorylation of 1 equivalent of ATP. 
Under these conditions, what is the maximum ratio of [ATP]/ 
[ADP] attainable by oxidative phosphorylation when [P;] = 1 mM? 
(Assume AG°’ for ATP synthesis = +30.5 kJ/mol.) 
9. Consider the oxidation of succinate by molecular oxygen as carried 
out via the electron-transport pathway 


Succinate + 30.—> fumarate + HO 


a. What is the standard free energy change (A G°’) for this reaction if 
E (Fum/Succ) = +0.031 V and €,' (;02/H,O) = +0.816 V. 

b. What is the equilibrium constant (Keq) for this reaction? 

c. Assume that (1) the actual free energy release accompanying 
succinate oxidation by the electron-transport pathway is equal to 
the amount released under standard conditions (as calculated in 
part a), (2) this energy can be converted into the synthesis of ATP 
with an efficiency = 0.7 (that is, 70% of the energy released upon 
succinate oxidation is captured in ATP synthesis), and (3) the oxi- 
dation of 1 succinate leads to the phosphorylation of 2 equivalents 
of ATP. 

Under these conditions, what is the maximum ratio of [ATP]/ 

[ADP] attainable by oxidative phosphorylation when [P;] = 1 mM? 

(Assume AG°’ for ATP synthesis = +30.5 kJ/mol.) 

10. Consider the oxidation of NADH by molecular oxygen as carried 
out via the electron-transport pathway 
NADH + H+ + 50,.—>NAD* + H,O 

a. What is the standard free energy change (AG°’) for this reac- 
tion if é’ (NAD*/NADH) = —0.320 V and ©,” (Oo/HsO) = 
+0.816 V. 

b. What is the equilibrium constant (Keq) for this reaction? 

c. Assume that (1) the actual free energy release accompanying 
NADH oxidation by the electron-transport pathway is equal to the 
amount released under standard conditions (as calculated in 
part a), (2) this energy can be converted into the synthesis of ATP 
with an efficiency = 0.75 (that is, 75% of the energy released upon 
NADH oxidation is captured in ATP synthesis), and (3) the oxi- 
dation of 1 NADH leads to the phosphorylation of 3 equivalents 
of ATP. 


Under these conditions, what is the maximum ratio of [ATP]/ 
[ADP] attainable by oxidative phosphorylation when [P;] = 2 mM? 
(Assume AG°’ for ATP synthesis = +30.5 kJ/mol.) 

11. Write a balanced equation for the reduction of molecular oxygen by 
reduced cytochrome c as carried out by Complex IV (cytochrome 
oxidase) of the electron-transport pathway. 

a. What is the standard free energy change (A G°’) for this reaction if 


AZ,’ cyt c(Fe**) /cyt c(Fe**) = +0.254 volts and 
Eo’ (O2/H2O) = 0.816 volts 
b. What is the equilibrium constant (Keq) for this reaction? 
c. Assume that (1) the actual free energy release accompanying 
cytochrome c oxidation by the electron-transport pathway is equal 
to the amount released under standard conditions (as calculated 
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in part a), (2) this energy can be converted into the synthesis of 
ATP with an efficiency = 0.6 (that is, 60% of the energy released 
upon cytochrome c oxidation is captured in ATP synthesis), and 
(3) the reduction of 1 molecule of Og by reduced cytochrome c 
leads to the phosphorylation of 2 equivalents of ATP. 
Under these conditions, what is the maximum ratio of [ATP]/ 
[ADP] attainable by oxidative phosphorylation when [P;] = 3 mM? 
(Assume AG°’ for ATP synthesis = +30.5 kJ/mol.) 

12. The standard reduction potential for (NAD*/NADH) is —0.320 V, 
and the standard reduction potential for (pyruvate/lactate) is 
—0.185 V. 

a. What is the standard free energy change (A G°’) for the lactate de- 
hydrogenase reaction: 


NADH + H+ + pyruvate—- lactate + NAD* 


b. What is the equilibrium constant (Keq) for this reaction? 

c. If [pyruvate] = 0.05 mM and [lactate] = 2.9 mM and AG for the 
lactate dehydrogenase reaction = —15 kJ/mol in erythrocytes, 
what is the [NAD*]/[NADH] ratio under these conditions? 

13. Assume that the free energy change (AG) associated with the move- 
ment of 1 mole of protons from the outside to the inside of a bac- 
terial cell is —23 kJ/mol and 3 H* must cross the bacterial plasma 
membrane per ATP formed by the bacterial FıFọ—-ATP synthase. 
ATP synthesis thus takes place by the coupled process: 


3 H*ou + ADP + P, 3 H+ + ATP + HO 


a. If the overall free energy change (A Gəveran) associated with ATP 
synthesis in these cells by the coupled process is —21 kJ/mol, what 
is the equilibrium constant (Keq) for the process? 

b. What is A Gynthesiss the free energy change for ATP synthesis, in 
these bacteria under these conditions? 

c. The standard free energy change for ATP hydrolysis (A G*’hydrotysis) 
is — 30.5 kJ/mol. If [P;] = 2 mMin these bacterial cells, what is the 
[ATP]/[ADP] ratio in these cells? 


14. Describe in your own words the path of electrons through the Q 
cycle of Complex III. 


15. Describe in your own words the path of electrons through the cop- 
per and iron centers of Complex IV. 


16. In the course of events triggering apoptosis, a fatty acid chain of car- 
diolipin undergoes peroxidation to release the associated cyto- 
chrome c. Lipid peroxidation occurs at a double bond. Suggest a 
mechanism for the reaction of hydrogen peroxide with an unsatura- 
tion in a lipid chain, and identify a likely product of the reaction. 


17. In problem 18 at the end of Chapter 19, you might have calculated 
the number of molecules of oxaloacetate in a typical mitochon- 
drion. What about protons? A typical mitochondrion can be 
thought of as a cylinder 1 wm in diameter and 2 um in length. If the 
pH in the matrix is 7.8, how many protons are contained in the 
mitochondrial matrix? 


18. Considering that all other dehydrogenases of glycolysis and the 
TCA cycle use NADH as the electron donor, why does succinate de- 
hydrogenase, a component of the TCA cycle and the electron trans- 
fer chain, use FAD as the electron acceptor from succinate, rather 
than NAD*? Note that there are two justifications for the choice of 
FAD here—one based on energetics and one based on the mecha- 
nism of electron transfer for FAD versus NAD*. 


Preparing for the MCAT Exam 


19. Based on your reading on the F)Fo-ATPase, what would you con- 
clude about the mechanism of ATP synthesis: 
a. The reaction proceeds by nucleophilic substitution via the S\y2 
mechanism. 
b. The reaction proceeds by nucleophilic substitution via the Syl 
mechanism. 
c. The reaction proceeds by electrophilic substitution via the El 
mechanism. 
d. The reaction proceeds by electrophilic substitution via the E2 
mechanism. 


20. Imagine that you are working with isolated mitochondria and you 
manage to double the ratio of protons outside to protons inside. In or- 
der to maintain the overall A Gat its original value (whatever it is), how 
would you have to change the mitochondria membrane potential? 
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Field of goldenrod. 





In a sun-flecked lane, 

Beside a path where cattle trod, 
Blown by wind and rain, 

Drawing substance from air and sod; 


In ruggedness, it stands aloof, 

The ragged grass and puerile leaves, 
Lending a hand to fill the woof 

In the pattern that beauty makes. 


What mystery this, hath been wrought; 
Beauty from sunshine, air, and sod! 
Could we thus gain the ends we sought- 
Tell us thy secret, Goldenrod. 


Rosa Staubus 
Oklahoma pioneer (1886-1966) 
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Photosynthesis 


ESSENTIAL QUESTIONS 


Photosynthesis is the primary source of energy for all life forms (except chemolitho- 
trophic prokaryotes). Much of the energy of photosynthesis is used to drive the syn- 
thesis of organic molecules from atmospheric CO3. 


How is solar energy captured and transformed into metabolically useful chemi- 
cal energy? How is the chemical energy produced by photosynthesis used to 
create organic molecules from carbon dioxide? 


The vast majority of energy consumed by living organisms stems from solar energy 
captured by the process of photosynthesis. Only chemolithotrophic prokaryotes 
are independent of this energy source. Of the 1.5 X 10? kJ of energy reaching the 
earth each day from the sun, 1% is absorbed by photosynthetic organisms and 
transduced into chemical energy.' This energy, in the form of biomolecules, be- 
comes available to other members of the biosphere through food chains. The 
transduction of solar, or light, energy into chemical energy is often expressed in 
terms of carbon dioxide fixation, in which hexose is formed from carbon dioxide 
and oxygen is evolved: 


Light 


6 CO, + 6 HO 2 C,H i905 + 6 Os (21.1) 


Estimates indicate that 10" tons of carbon dioxide are fixed globally per year, of 
which one-third is fixed in the oceans, primarily by photosynthetic marine micro- 
organisms. 

Although photosynthesis is traditionally equated with CO» fixation, light energy 
(or rather the chemical energy derived from it) drives all endergonic processes in 
phototrophic cells. The assimilation of inorganic forms of nitrogen and sulfur into 
organic molecules (see Chapter 25) represents two other metabolic conversions 
closely coupled to light energy in green plants. Our previous considerations of aer- 
obic metabolism (Chapters 18 through 20) treated cellular respiration (precisely 
the reverse of Equation 21.1) as the central energy-releasing process in life. It nec- 
essarily follows that the formation of hexose from carbon dioxide and water, the 
products of cellular respiration, must be endergonic. The necessary energy comes 
from light. Note that in the carbon dioxide fixation reaction described, light is used 
to drive a chemical reaction against its thermodynamic potential. 


2 \wls What Are the General Properties of Photosynthesis? 


Photosynthesis Occurs in Membranes 


Organisms capable of photosynthesis are very diverse, ranging from simple 
prokaryotic forms to the largest organisms of all, Sequoia gigantea, the giant red- 
wood trees of California. Despite this diversity, we find certain generalities regard- 
ing photosynthesis. An important one is that photosynthesis occurs in membranes. In 
photosynthetic prokaryotes, the photosynthetic membranes fill up the cell interior; 
in photosynthetic eukaryotes, the photosynthetic membranes are localized in large 
organelles known as chloroplasts (Figures 21.1 and 21.2). Chloroplasts are one 


'Of the remaining 99%, two-thirds is absorbed by the earth and oceans, thereby heating the planet; 
the remaining one-third is lost as light reflected back into space. 
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member in a family of related plant-specific organelles known as plastids. Chloro- 
plasts themselves show a range of diversity, from the single, spiral chloroplast that 
gives Spirogyra its name to the multitude of ellipsoidal plastids typical of higher 
plant cells (Figure 21.3). 

Characteristic of all chloroplasts, however, is the organization of the inner mem- 
brane system, the so-called thylakoid membrane. The thylakoid membrane is orga- 
nized into paired folds that extend throughout the organelle, as in Figure 21.2. 
These paired folds, or lamellae, give rise to flattened sacs or discs, thylakoid vesicles 
(from the Greek thylakos, meaning “sack”), which occur in stacks called grana. A sin- 
gle stack, or granum, may contain dozens of thylakoid vesicles, and different grana 
are joined by lamellae that run through the soluble portion, or stroma, of the or- 
ganelle. Chloroplasts thus possess three membrane-bound aqueous compartments: 
the intermembrane space, the stroma, and the interior of the thylakoid vesicles, the 
so-called thylakoid space (also known as the thylakoid lumen). As we shall see, this 
third compartment serves an important function in the transduction of light energy 
into ATP formation. The thylakoid membrane has a highly characteristic lipid com- 
position and, like the inner membrane of the mitochondrion, is impermeable to 


Inner membrane 


FIGURE 21.1 Electron micrograph of a representative 
chloroplast. 


FIGURE 21.2 Schematic diagram of an idealized 
chloroplast. 
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FIGURE 21.3 (a) Spirogyra—a freshwater green alga. (b) A higher plant cell. 


most ions and molecules. Chloroplasts, like their mitochondrial counterparts, pos- 
sess DNA, RNA, and ribosomes and consequently display a considerable amount of 
autonomy. However, many critical chloroplast components are encoded by nuclear 
genes, so autonomy is far from absolute. 


Photosynthesis Consists of Both Light Reactions and Dark Reactions 


If a chloroplast suspension is illuminated in the absence of carbon dioxide, oxygen 
is evolved. Furthermore, if the illuminated chloroplasts are now placed in the dark 
and supplied with COs, net hexose synthesis can be observed (Figure 21.4). Thus, 
the evolution of oxygen can be temporally separated from CO» fixation and also has 
a light dependency that CO, fixation lacks. The light reactions of photosynthesis, of 
which Og evolution is only one part, are associated with the thylakoid membranes. 
In contrast, the light-independent reactions, or so-called dark reactions, notably 
CO; fixation, are located in the stroma. A concise summary of the photosynthetic 
process is that radiant electromagnetic energy (light) is transformed by a specific 
photochemical system located in the thylakoids to yield chemical energy in the form 
of reducing potential (NADPH) and high-energy phosphate (ATP). NADPH and 
ATP can then be used to drive the endergonic process of hexose formation from 





Oy evolved CO, fixation 
into sugars 


CENGAGENOW” ANIMATED FIGURE 21.4 The light-dependent and light-independent reactions of photosyn- 
thesis. See this figure animated at www.cengage.com/login. 


21.2 How ls Solar Energy Captured by Chlorophyll? 


CO» by a series of enzymatic reactions found in the stroma (see Equation 21.3, 
which follows). 


Water Is the Ultimate e~ Donor for Photosynthetic NADP* Reduction 


In green plants, water serves as the ultimate electron donor for the photosynthetic 
generation of reducing equivalents. The reaction sequence 
2H,O + 2 NADP+ + xADP + xP,» 

Os + 2 NADPH + 2H* + xATP + xHO (21.2) 


describes the process, where nhv symbolizes light energy (n is some number of pho- 
tons of energy hv, where his Planck’s constant and v is the frequency of the light). 
Light energy is necessary to make the unfavorable reduction of NADP+ by HO 
(A. = —1.136 V; AG?’ = +219 kJ/mol NADP+) thermodynamically favorable. 
Thus, the light energy input, nhv, must exceed 219 kJ/mol NADP*. The stoi- 
chiometry of ATP formation depends on the pattern of photophosphorylation op- 
erating in the cell at the time and on the ATP yield in terms of the chemiosmotic 
ratio, ATP/H”*, as we will see later. Nevertheless, the stoichiometry of the metabolic 
pathway of CO» fixation is certain: 


12 NADPH + 12 H+ + 18 ATP + 6 CO. + 12 HO — 
CHO + 12 NADP+ + 18 ADP + 18 P; (21.3) 


A More Generalized Equation for Photosynthesis In 1931, comparative study of 
photosynthesis in bacteria led van Niel to a more general formulation of the over- 
all reaction: 


CO, + 2H,A = (CHO) + 2A + HO (21.4) 
Hydrogen Hydrogen Reduced Oxidized 
acceptor donor acceptor donor 


In photosynthetic bacteria, HA is variously HS (photosynthetic green and pur- 
ple sulfur bacteria), isopropanol, or some similar oxidizable substrate. [ (CHO) sym- 
bolizes a carbohydrate unit.] 


CO, + 2 HS —> (CH2O) + HO + 2 S 


O 
| 
COs + 2;CH;—CHOH—CHs —> (CHs0) + H2O + 2 CHs—C—CHs 


In cyanobacteria and the eukaryotic photosynthetic cells of algae and higher 
plants, HA is H2O, as implied earlier, and 2 A is Os. The accumulation of Ox to 
constitute 21% of the earth’s atmosphere is the direct result of eons of global oxy- 
genic photosynthesis. 


212 How Is Solar Energy Captured by Chlorophyll? 


Photosynthesis depends on the photoreactivity of chlorophyll. Chlorophylls are 
magnesium-containing substituted tetrapyrroles whose basic structure is reminis- 
cent of heme, the iron-containing porphyrin (see Chapters 5 and 20). Chloro- 
phylls differ from heme in a number of properties: Magnesium instead of iron is 
coordinated in the center of the planar conjugated ring structure; a long-chain 
alcohol, phytol, is esterified to a pyrrole ring substituent; and the methine bridge 
linking pyrroles HI and IV is substituted and crosslinked to ring II, leading to the 
formation of a fifth five-membered ring. The structures of chlorophyll a and b are 
shown in Figure 21.5a. 

Chlorophylls are excellent light absorbers because of their aromaticity. That is, 
they possess delocalized 7 electrons above and below the planar ring structure. The 
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FIGURE 21.5 Structures (a) and absorption spectra (b) of 
chlorophyll a and 6. The phytyl side chain of ring IV pro- 
vides a hydrophobic tail to anchor the chlorophyll in 
membrane protein complexes. 
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energy differences between electronic states in these 7 orbitals correspond to the 
energies of visible light photons. When light energy is absorbed, an electron is pro- 
moted to a higher orbital, enhancing the potential for transfer of this electron to 
a suitable acceptor. Loss of such a photoexcited electron to an acceptor is an 
oxidation—reduction reaction. The net result is the transduction of light energy into 
the chemical energy of a redox reaction. 


Chlorophylls and Accessory Light-Harvesting Pigments Absorb 


Light of Different Wavelengths 


The absorption spectra of chlorophylls aand b (Figure 21.5b) differ somewhat. Plants 
that possess both chlorophylls can harvest a wider spectrum of incident energy. Other 
pigments in photosynthetic organisms, so-called accessory light-harvesting pigments 
(Figure 21.6), increase the possibility for absorption of incident light of wavelengths 
not absorbed by the chlorophylls. Carotenoids and phycocyanobilins, like chloro- 
phyll, possess many conjugated double bonds and thus absorb visible light. 
Carotenoids have two primary roles in photosynthesis—light harvesting and photo- 
protection through destruction of reactive oxygen species that arise as by-products of 
photoexcitation. 


The Light Energy Absorbed by Photosynthetic Pigments 
Has Several Possible Fates 


Each photon represents a quantum of light energy. A quantum of light energy ab- 
sorbed by a photosynthetic pigment has four possible fates (Figure 21.7): 


1. Loss as heat. The energy can be dissipated as heat through redistribution into 
atomic vibrations within the pigment molecule. 
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FIGURE 21.6 Structures of representative accessory 
light-harvesting pigments in photosynthetic cells. 

(a) B-Carotene, an accessory light-harvesting pigment 
in leaves. (b) Phycocyanobilin, a blue pigment found 
in cyanobacteria. 
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Energy transfer to FIGURE 21.7 Possible fates of the quantum of light energy 
neighboring P molecule absorbed by photosynthetic pigments. 
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FIGURE 21.8 Model for light absorption by chlorophyll 
and transduction of light energy into an oxidation- 
reduction reaction. l: Photoexcitation of Chl creates 
Chi*. Il: Electron transfer from ChI* to A yields oxidized 
Chl (ChI*) and reduced A (A7) Ill: An electron-transfer 
pathway from A~ to NADP* leads to NADPH formation 
and restoration of oxidized A (A). IV: ChI* accepts an 
electron from B, restoring Chl and generating oxidized 
B (B+). V: B* is reduced back to B by an electron origi- 
nating in H,O.Water oxidation is the source of O3 
formation. 


2. Loss as light. Energy of excitation reappears as fluorescence (light emission); a 
photon of fluorescence is emitted as the e returns to a lower orbital. This fate is 
common only in saturating light intensities. For thermodynamic reasons, the 
photon of fluorescence has a longer wavelength and hence lower energy than 
the quantum of excitation. 

3. Resonance energy transfer. The excitation energy can be transferred by reso- 
nance energy transfer to a neighboring molecule if the energy level difference be- 
tween the two corresponds to the quantum of excitation energy. In this process, 
the energy transferred raises an electron in the receptor molecule to a higher 
energy state as the photoexcited e in the original absorbing molecule returns to 
ground state. This so-called Förster resonance energy transfer is the mechanism 
whereby quanta of light falling anywhere within an array of pigment molecules 
can be transferred ultimately to specific photochemically reactive sites. 

4. Energy transduction. The energy of excitation, in raising an electron to a higher 
energy orbital, dramatically changes the standard reduction potential, €,’, of the 
pigment such that it becomes a much more effective electron donor. That is, the 
excited-state species, by virtue of having an electron at a higher energy level 
through light absorption, has become a more potent electron donor. Reaction of 
this excited-state electron donor with an electron acceptor situated in its vicinity 
leads to the transformation, or transduction, of light energy (photons) to chem- 
ical energy (reducing power, the potential for electron-transfer reactions). Trans- 
duction of light energy into chemical energy, the photochemical event, is the essence of 
photosynthesis. 


The Transduction of Light Energy into Chemical Energy Involves 
Oxidation—Reduction 


The diagram presented in Figure 21.8 illustrates the fundamental transduction of 
light energy into chemical energy (an oxidation-reduction reaction) that is the ba- 
sis of photosynthesis. Chlorophyll (Chl) resides in a membrane in close association 
with molecules competent in e transfer, symbolized here as A and B. Chl absorbs 
a photon of light, becoming activated to Chl* in the process. Electron transfer 
from Chl* to A leads to oxidized Chl (Chl-*, a cationic free radical) and reduced 
A (A` in the diagram). Subsequent oxidation of A~ eventually culminates in re- 
duction of NADP* to NADPH. The electron “hole” in oxidized Chl (Ch1-*) is filled 
by transfer of an electron from B to Chl-*, restoring Chl and creating B*. B* is re- 
stored to B by an e donated by water. Où is the product of water oxidation. Note 
that the system is restored to its original state once NADPH is formed and HO is 
oxidized. Proton translocations accompany these light-driven electron-transport 
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reactions. Such H* translocations establish a chemiosmotic gradient across the 
photosynthetic membrane that can drive ATP synthesis. 


Photosynthetic Units Consist of Many Chlorophyll Molecules 
but Only a Single Reaction Center 


In the early 1930s, Emerson and Arnold investigated the relationship between the 
amount of incident light energy, the amount of chlorophyll present, and the amount 
of oxygen evolved by illuminated algal cells. Emerson and Arnold were seeking to de- 
termine the quantum yield of photosynthesis: the number of electrons transferred 
per photon of light. Their studies gave an unexpected result: When algae were illu- 
minated with very brief light flashes that could excite every chlorophyll molecule at 
least once, only one molecule of Op was evolved per 2400 chlorophyll molecules. 
This result implied that not all chlorophyll molecules are photochemically reactive, 
and it led to the concept that photosynthesis occurs in functionally discrete units. 

Chlorophyll serves two roles in photosynthesis. It is involved in light harvesting 
and the transfer of light energy to photoreactive sites by exciton transfer, and it par- 
ticipates directly in the photochemical events whereby light energy becomes chem- 
ical energy. A photosynthetic unit (Figure 21.9) can be envisioned as an antenna 
of several hundred light-harvesting chlorophyll molecules (green) plus a special 
pair of photochemically reactive chlorophyll a molecules called the reaction center 
(orange). The purpose of the vast majority of chlorophyll in a photosynthetic unit 
is to harvest light incident within the unit and funnel it, via resonance energy trans- 
fer, to the reaction center chlorophyll dimers that are photochemically active. Most 
chlorophyll thus acts as a large light-collecting antenna, and it is at the reaction cen- 
ters that the photochemical event occurs. Oxidation of chlorophyll leaves a cationic 
free radical, Chl-*, whose properties as an electron acceptor have important con- 
sequences for photosynthesis. Note that the Mg*t ion does not change in valence 
during these redox reactions. 


213" What Kinds of Photosystems Are Used 
to Capture Light Energy? 


All photosynthetic cells contain some form of photosystem. Photosynthetic bacteria 
have only one photosystem; furthermore, they lack the ability to use light energy to 
split HO and release Oy. Cyanobacteria, green algae, and higher plants are oxygenic 
phototrophs because they can generate Os from water. Oxygenic phototrophs have two 
distinct photosystems: photosystem I (PSI) and photosystem II (PSII). Type I photo- 
systems use ferredoxins as terminal electron acceptors; type II photosystems use 
quinones as terminal electron acceptors. PSI is defined by reaction center chloro- 
phylls with maximal red light absorption at 700 nm; PSII uses reaction centers that 
exhibit maximal red light absorption at 680 nm. The reaction center Chl of PSI is re- 
ferred to as P700 because it absorbs light of 700-nm wavelength; the reaction center 
Chl of PSII is called P680 for analogous reasons. Both P700 and P680 are chlorophyll 
a dimers situated within specialized protein complexes. A distinct property of PSII is 
its role in light-driven Og evolution. Interestingly, the photosystems of photosynthetic 
bacteria are type II photosystems that resemble eukaryotic PSII more than PSI, even 
though these bacteria lack Os-evolving capacity. 


Chlorophyll Exists in Plant Membranes in Association with Proteins 


Detergent treatment of a suspension of thylakoids dissolves the membranes, releasing 
complexes containing both chlorophyll and protein. These chlorophyll—protein com- 
plexes represent integral components of the thylakoid membrane, and their organi- 
zation reflects their roles as either light-harvesting complexes (LHC), PSI complexes, 
or PSII complexes. All chlorophyll is apparently localized within these three macro- 
molecular assemblies. 
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CENGAGENOW” ANIMATED FIGURE 21.9 Schematic 
diagram of a photosynthetic unit. See this figure 
animated at www.cengage.com/login. 
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FIGURE 21.10 Roles of the two photosystems, PSI 
and PSII. 
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PSI and PSII Participate in the Overall Process of Photosynthesis 


What are the roles of the two photosystems, and what is their relationship to each 
other? PSI provides reducing power in the form of NADPH. PSII splits water, pro- 
ducing Os, and feeds the electrons released into an electron-transport chain that 
couples PSII to PSI. Electron transfer between PSII and PSI pumps protons for 
chemiosmotic ATP synthesis. As summarized by Equation 21.2, photosynthesis in- 
volves the reduction of NADP*, using electrons derived from water and activated by 
light, hv. ATP is generated in the process. The standard reduction potential for the 
NADP*/NADPH couple is —0.32 V. Thus, a strong reductant with an ©,’ more neg- 
ative than —0.32 V is required to reduce NADP* under standard conditions. By sim- 
ilar reasoning, a very strong oxidant will be required to oxidize water to oxygen be- 
cause €,'(5O2/H,O) is +0.82 V. Separation of the oxidizing and reducing aspects 
of Equation 21.2 is accomplished in nature by devoting PSI to NADP* reduction 
and PSII to water oxidation. PSI and PSII are linked via an electron-transport chain 
so that the weak reductant generated by PSII can provide an electron to reduce the 
weak oxidant side of P700 (Figure 21.10). Thus, electrons flow from H,O to NADP*, 
driven by light energy absorbed at the reaction centers. Oxygen is a by-product of 
the photolysis, literally “light-splitting,” of water. Accompanying electron flow is pro- 
duction of a proton gradient and ATP synthesis (see Section 21.6). This light- 
driven phosphorylation is termed photophosphorylation. 


The Pathway of Photosynthetic Electron Transfer Is Called 
the ZScheme 


Photosystems I and II contain unique complements of electron carriers, and these 
carriers mediate the stepwise transfer of electrons from water to NADP*. When the 
individual redox components of PSI and PSII are arranged as an e transport chain 
according to their standard reduction potentials, the zigzag result resembles the let- 
ter Z laid sideways (Figure 21.11). The various electron carriers are indicated as fol- 
lows: “Mn complex” symbolizes the manganese-containing oxygen-evolving complex; 
D is its e acceptor and the immediate ¢ donor to P6807; Q a and Qs, represent spe- 
cial plastoquinone molecules (see Figure 21.13) and PQ the plastoquinone pool; 
Fe-S stands for the Rieske iron-sulfur center, and cyt f, cytochrome f. PC is the 
abbreviation for plastocyanin, the immediate ¢ donor to P700*; and Fy, Fp, and Fx 
represent the membrane-associated ferredoxins downstream from Ap (a specialized 
Chl a) and A; (a specialized PSI quinone). Fd is the soluble ferredoxin pool that 
serves as the e donor to the flavoprotein (Fp), called ferredoxin-NADP* reductase, 
which catalyzes reduction of NADP* to NADPH. Cyt(d¢) n, (be) p symbolizes the cyto- 
chrome bs moieties of the cytochrome bf complex. PQ and the cytochrome bęf com- 
plex also serve to transfer e from F/F back to P700* during cyclic photophos- 
phorylation (the pathway symbolized by the dashed arrow). 

Overall photosynthetic electron transfer is accomplished by three membrane- 
spanning supramolecular complexes composed of intrinsic and extrinsic poly- 
peptides (shown as shaded boxes bounded by solid black lines in Figure 22.11). 
These complexes are the PSII complex, the cytochrome bęf complex, and the PSI 
complex. The PSII complex is aptly described as a light-driven water : plastoquinone 
oxidoreductase; it is the enzyme system responsible for photolysis of water, and as 
such, it is also referred to as the oxygen-evolving complex, or OEC. PSII possesses a 
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CENGAGENOW” ACTIVE FIGURE 21.11 The Z scheme of photosynthesis. (a) The Z scheme is a diagrammatic rep- 
resentation of photosynthetic electron flow from H20 to NADP*.The energy relationships can be derived from the 
€,' scale beside the Z diagram. Energy input as light is indicated by two broad arrows, one photon appearing in 
P680 and the other in P700. P680* and P700* represent photoexcited states. The three supramolecular complexes 
(PSI, PSII, and the cytochrome bsf complex) are in shaded boxes. Proton translocations that establish the proton- 
motive force driving ATP synthesis are illustrated as well. (b) The functional relationships among PSII, the cyto- 
chrome bf complex, PSI, and the photosynthetic CF;CFo-ATP synthase within the thylakoid membrane. Test 
yourself on the concepts in this figure at www.cengage.com/login. 
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FIGURE 21.12 Oxygen evolution requires the accumula- 
tion of four oxidizing equivalents in PSII. (a) O2 evolution 


after brief light flashes. (b) The cycling of the PSII reac- 


tion center through five different oxidation states, Sp to Sy. 
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FIGURE 21.13 The structures of plastoquinone A and 


its reduced form, plastohydroquinone (or plastoquinol). 


Plastoquinone A has nine isoprene units and is the 
most abundant plastoquinone in plants and algae. 


metal cluster containing 4 Mn?* atoms that coordinate two water molecules. As P680 
undergoes four cycles of light-induced oxidation, four protons and four electrons 
are removed from the two water molecules and their O atoms are joined to form Os. 
A tyrosyl side chain of the PSIT complex (see following discussion) mediates electron 
transfer between the Mn?* cluster and P680. The Oo-evolving reaction requires Ca?* 
and CI“ ions in addition to the (Mn?*), cluster. 


Oxygen Evolution Requires the Accumulation of Four Oxidizing 
Equivalents in PSII 


When isolated chloroplasts that have been held in the dark are illuminated with 
very brief flashes of light, Os evolution reaches a peak on the third flash and every 
fourth flash thereafter (Figure 21.12a). The oscillation in O evolution dampens 
over repeated flashes and converges to an average value. These data are interpreted 
to mean that the P680 reaction center complex cycles through five different oxida- 
tion states, numbered So to S4. One electron and one proton are removed photo- 
chemically in each step. When S; is attained, an Os molecule is released (Figure 
21.12b) as PSII returns to oxidation state Sọ and two new water molecules bind. 
(The reason the first pulse of Os release occurred on the third flash [Figure 21.12a] 
is that the PSII reaction centers in the isolated chloroplasts were already poised at 
Sı reduction level.) 


Electrons Are Taken from H,0 to Replace Electrons Lost from P680 


The events intervening between HO and P680 involve D, the name assigned to a 
specific protein tyrosine residue that mediates e transfer from H20 via the Mn com- 
plex to P680* (see Figure 21.11). The oxidized form of D is a tyrosyl free radical 
species, D-*. To begin the cycle, an exciton of energy excites P680 to P680*, where- 
upon P680* transfers an electron to a nearby Chl a molecule, which is the direct elec- 
tron acceptor from P680". This Chl a then reduces a molecule of pheophytin, sym- 
bolized by “Pheo” in Figure 21.11. Pheophytin is like chlorophyll a, except 2 H+ 
replace the centrally coordinated Mg** ion. This special pheophytin is the direct 
electron acceptor from P680*. Loss of an electron from P680* creates P680*, the 
electron acceptor for D. Electrons flow from Pheo via specialized molecules of 
plastoquinone, represented by “Q” in Figure 21.11, to a pool of plastoquinone (PQ) 
within the membrane. Because of its lipid nature, plastoquinone is mobile within the 
membrane and hence serves to shuttle electrons from the PSII supramolecular com- 
plex to the cytochrome bęf complex. Alternate oxidation—reduction of plasto- 
quinone to its hydroquinone form involves the uptake of protons (Figure 21.13). 
The asymmetry of the thylakoid membrane is designed to exploit this proton uptake 
and release so that protons (H*) accumulate within the lumen of thylakoid vesicles, 
establishing an electrochemical gradient. Note that plastoquinone is an analog of 
coenzyme Q, the mitochondrial electron carrier (see Chapter 20). 


Electrons from PSII Are Transferred to PSI via 
the Cytochrome b,f Complex 


The cytochrome bf or plastoquinol: plastocyanin oxidoreductase is a large (210 kD) 
multimeric protein possessing 26 transmembrane a-helices. This membrane protein 
complex is structurally and functionally homologous to the cytochrome bc; complex 
(Complex III) of mitochondria (see Chapter 20). It includes the two heme-containing 
electron transfer proteins for which it is named, as well as iron-sulfur clusters, which also 
participate in electron transport. The purpose of this complex is to mediate the trans- 
fer of electrons from PSII to PSI and to pump protons across the thylakoid membrane 
via a plastoquinone-mediated Q cycle, analogous to that found in mitochondrial 
€ transport (see Chapter 20). Cytochrome f (f from the Latin folium, meaning 
“foliage”) is a c-type cytochrome, with a reduction potential of +0.365 V. Cytochrome 
bs in two forms (low- and high-potential) participates in the oxidation of plastoquinol 
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and the Q cycle of the baf complex. The cytochrome bf complex can also serve in an 
alternative cyclic electron transfer pathway. Under certain conditions, electrons de- 
rived from P700* are not passed on to NADP* but instead cycle down an alternative 
path, whereby reduced ferredoxin contributes its electron to PQ. This electron is then 
passed to the cytochrome 4fcomplex, and then back to P700*. This cyclic flow yields 
no Os evolution or NADP* reduction but can lead to ATP synthesis via so-called cyclic 
photophosphorylation, discussed later. 


Plastocyanin Transfers Electrons from the Cytochrome b,f 
Complex to PSI 


Plastocyanin (PC in Figure 21.11) is an electron carrier capable of diffusion along the 
inside of the thylakoid and migration in and out of the membrane, aptly suited to its 
role in shuttling electrons between the cytochrome bsf complex and PSI. Plastocyanin 
is a low-molecular-weight (10.4 kD) protein containing a single copper atom. PC func- 
tions as a single-electron carrier (@,’ = +0.32 V) as its copper atom undergoes alter- 
nate oxidation-reduction between the cuprous (Cu*) and cupric (Cu?*) states. PSI is 
a light-driven plastocyanin : ferredoxin oxidoreductase. When P700, the specialized 
chlorophyll a dimer of PSI, is excited by light and oxidized by transferring its æ to an 
adjacent chlorophyll a molecule that serves as its immediate ¢ acceptor, P700* is 
formed. (The standard reduction potential for the P700*/P700 couple is about 
+0.45 V.) P700* readily gains an electron from plastocyanin. 

The immediate electron acceptor for P700* is a special molecule of chlorophyll. 
This unique Chl a (Ao) rapidly passes the electron to a specialized quinone (A1), 
which in turn passes the e to the first in a series of membrane-bound ferredoxins. This 
Fd series ends with a soluble form of ferredoxin, Fd,, which serves as the immediate 
electron donor to the flavoprotein (Fp) that catalyzes NADP* reduction, namely, 
ferredoxin: NADP* reductase. 


2lc4 What Is the Molecular Architecture of Photosynthetic 
Reaction Centers? 


What molecular architecture couples the absorption of light energy to rapid 
electron-transfer events, in turn coupling these e transfers to proton translocations 
so that ATP synthesis is possible? Part of the answer to this question lies in the 
membrane-associated nature of the photosystems. A major breakthrough occurred 


Structure of the cyanobacterial cytochrome bef com- 
plex. The heme groups of cytochromes ben, beP, and f 
are shown in red; the iron-sulfur clusters are blue 

(pdb id = 1BF5).The upper bundle of «-helices defines 
the transmembrane domain. 
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FIGURE 21.14 The R. viridis reaction center (RC). (a) Diagram of the RC showing light activation and path of 
e` transfer. (b) Molecular graphic of the R. viridis RC. M and L are yellow and blue; H is orange; the cytochrome 
is green. (c) Deletion of the R. viridis RC protein chains reveals the spatial relationship between its heme, 
chlorophyll, and quinone prosthetic groups. The iron atom is represented by a sphere (pdb id = 1PRC). 


in 1984, when Johann Deisenhofer, Hartmut Michel, and Robert Huber reported 
the first X-ray crystallographic analysis of a membrane protein. To the great benefit 
of photosynthesis research, this protein was the reaction center from the photosyn- 
thetic purple bacterium Rhodopseudomonas viridis. This research earned these three 
scientists the 1984 Nobel Prize in Chemistry. 


The R. viridis Photosynthetic Reaction Center Is an Integral 
Membrane Protein 


R. viridisis a photosynthetic prokaryote with a single photosystem that resembles PSII 
(even though it lacks an OEC and the capacity to oxidize water). The reaction center 
(145 kD) of the R. viridis photosystem is localized in the plasma membrane of these 
photosynthetic bacteria and is composed of four different polypeptides, designated L 
(273 amino acid residues), M (323 residues), H (258 residues), and cytochrome (333 
amino acid residues) (Figure 21.14a). L and M each consist of five membrane- 
spanning a-helical segments; H has one such helix, the majority of the protein form- 
ing a globular domain in the cytoplasm (Figure 21.14b). The cytochrome subunit 
contains four heme groups; the N-terminal amino acid of this protein is cysteine. This 
cytochrome is anchored to the periplasmic face of the membrane via the hydropho- 
bic chains of two fatty acid groups that are esterified to a glyceryl moiety joined via a 
thioether bond to the Cys (Figure 21.14a). L and M each bear two bacteriochlorophyll 
molecules (the bacterial version of Chl) and one bacteriopheophytin. L also has a bound 
quinone molecule, Q 4. Together, L and M coordinate an Fe atom. The photochemi- 
cally active species of the R. viridis reaction center, P870, is composed of two bacterio- 
chlorophylls, one contributed by L and the other by M. 


Photosynthetic Electron Transfer by the R. viridis Reaction Center 


Leads to ATP Synthesis 


The prosthetic groups of the R. viridis reaction center (P870, BChl, BPheo, and 
the bound quinones) are fixed in a spatial relationship to one another that favors 
photosynthetic e transfer (Figure 21.14a,c). Photoexcitation of P870 (creation of 
P870*) leads to e loss (P870*) via electron transfer to the nearby bacteriochloro- 
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phyll (BChl). The e is then transferred via the L bacteriopheophytin (BPheo) to 
Qa, which is also an L prosthetic group. The corresponding site on Mis occupied by 
a loosely bound quinone, Qg, and electron transfer from Q4 to Qg takes place. An 
interesting aspect of the system is that no electron transfer occurs through M, even 
though it has components apparently symmetric to and identical to the L e transfer 
pathway. 

The reduced quinone formed at the Qp site is free to diffuse to a neighboring 
cytochrome bc, membrane complex, where its oxidation is coupled to H* transloca- 
tion and, hence, ultimately to ATP synthesis. The use of light energy to drive ATP 
synthesis by the concerted action of these membrane proteins is called photophos- 
phorylation (Figure 21.15). 

Cytochrome co, a periplasmic protein, serves to cycle electrons back to P870* via 
the four hemes of the reaction center cytochrome subunit. A specific tyrosine 
residue of L (Tyr!®) is situated between P870 and the closest cytochrome heme. 
This Tyr is the immediate e donor to P870* and completes the light-driven elec- 
tron transfer cycle. 


The Molecular Architecture of PSII Resembles the R. viridis Reaction 
Center Architecture 


Type II photosystems of higher plants, green algae, and cyanobacteria contain 
more than 20 subunits and are considerably more complex than the R. viridis 
reaction center. The structure of PSII from the thermophilic cyanobacterium 





CENGAGENOW” ACTIVE FIGURE 21.15 Photophosphorylation. Photoexcitation of the R. viridis RC leads to 
reduction of a quinone, Q, to form QH;. Oxidation of QH, by the cytochrome bc, complex leads to H* trans- 
location for ATP synthesis by the R. viridis F;Fo-ATP synthase. Test yourself on the concepts in this figure at 
www.cengage.com/login. 
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CENGAGENOW” Go to CengageNOW and click 
Cengagelnteractive to explore the R. viridis reaction 
center, a complex scaffold for transduction of light 
energy. 


FIGURE 21.16 Molecular architecture of the Synechococ- 
cus elongatus PSII dimer. (a) The arrow shows the path of 
electron transfer from P680* to Chip; to Phedp; to Qa 

on D2 and then, via the Fe atom, to Qgon D1. The Tyr'®! 
residue of D1, symbolized by Y, is situated between 
P680 and the (Mn), cluster. (b) Structure of S. elongatus 
PSII (pdb id = 1S5L). Chlorophylls of the reaction center 
and electron transfer path are shown in green; pheo- 
phytins, in blue. The OECs are shown in brick red. 
(Adapted from Barber, J., 2003. Photosystem II: The engine of life. 
Quarterly Review of Biophysics 36:71-89.) 


Synechococcus elongatus has been revealed by X-ray crystallography, providing insight 
into PSII structures in general. Interestingly, both type II and type I photosystems 
show significant similarity to the R. viridis reaction center, thus establishing a 
strong evolutionary connection between reaction centers. 

S. elongatus PSII is a homodimeric structure. Each “monomer” has a mass of 
almost 350 kD and 23 different protein subunits, the 4 largest being the reaction 
center pair of subunits (D1 and D2) and two chlorophyll-containing inner antenna 
subunits (CP43 and CP47) that bracket D1 and D2 (Figure 21.16). Together, CP43 
and CP47 have a total of 26 Chl a molecules, and exciton energy is collected and 
transferred from them to P680. Collectively, the protein subunits in a PSII 
“monomer” have at least 34 transmembrane a-helical segments, 22 of which are 
found in the D1-D2-CP43-CP47 “core” structure. D1 and D2 each have five 
membrane-spanning a-helices. Structurally and functionally, these two subunits are 
a direct counterpart of the Land M subunits of the R. viridis reaction center. P680 
consists of a pair of Chl a molecules, with D1 and D2 each contributing one. D1 and 
D2 each have two other Chl a molecules, one near each P680 (Chlp; and Chips, re- 
spectively) and another that interacts with CP43/CP47 (Chlzp, and Chlz.po, respec- 
tively) (Figure 21.16). Two equivalents of pheophytin (Pheo) are located on D1 and 
D2. The tyrosine species D is Tyr!*! in the D1 amino acid sequence. Complexed to 
D2 is a tightly bound plastoquinone molecule, Q,. Electrons flow from P680* to 
Chlp; and on to Pheop;. Pheop; then transfers the electron to Q, on D2, where it 
then moves to a second plastoquinone situated in the Qx site on D1 (Figure 21.16). 
Electron transfer from Q4 and Qp is assisted by the iron atom located between 
them. Each plastoquinone that enters the Qp site accepts two electrons derived 
from water and two H* from the stroma before it is released into the membrane as 
the hydroquinone PQH». Thus, two photons are required to reduce each PQ that 
enters the Qp site. The stoichiometry of the overall reaction catalyzed by PSII is 
2 HO + 2 PQ + 4 hu—> Os + 2 PQH». The (Mn), complex is located on the 
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lumenal side of the thylakoid membrane. Thus, protons liberated from HO mole- 
cules at the Mn site are deposited directly into the lumen. 


How Does PSII Generate 0, from H,0? 


PSII catalyzes what is probably the most thermodynamically difficult reaction in na- 
ture, the light-driven oxidation of water to molecular oxygen. The protons and elec- 
trons released in this reaction are used to reduce NADP* to NADPH and to estab- 
lish a proton gradient across the photosynthetic membrane that can be tapped to 
drive chemiosmotic ATP synthesis (see Figure 21.20). Accumulation of molecular 
oxygen in the atmosphere as a by-product of the photo-oxidation of water has trans- 
formed the planet since the evolutionary appearance of this reaction some 2 billion 
years ago in cyanobacteria. How does PSII evolve oxygen? 


The Structure of the Oxygen-Evolving Complex The architecture of the S. elon- 
gatus photosynthetic OEC reveals a large globular protein domain juxtaposed on the 
lumenal side of the D1 subunit of PSII (Figure 21.16). The active site of the OEC 
contains a cubelike metal cluster that consists of four manganese ions, one calcium 
ion, and five oxygen atoms bridging the Mn atoms, as shown in Figure 21.17. This 
metal cluster is held by Glu!®, Asp***, His***, and His??? of the PSII D1 subunit and 
Glu**! of CP43. Chloride ion (Cl-) is required for Og evolution, and CI” is believed 
to be a Ca?* ligand. Note also that Tyr!®! (Y,) is situated near the metal cluster, ide- 
ally poised to serve in electron transfer between HO and P680*. When four e have 
been removed from the cluster (one from each Mn atom) through e transfer to 
PSII via Tyr!®!, two H¿O molecules provide the four e needed to re-reduce the Mn 
atoms and Osis evolved. The four H* released contribute to the proton gradient. 


The Molecular Architecture of PSI Resembles the R. viridis Reaction 
Center and PSII Architecture 


The structure of PSI from the cyanobacterium S. elongatus also has been solved 
by X-ray crystallography, completing our view of reaction center structure and 
confirming the fundamental similarities in organization that exist in these energy- 
transducing integral membrane proteins. Because of direct correlations with infor- 
mation about eukaryotic PSI, this cyanobacterial PSI provides a general model for all 
P700-dependent photosystems. 

S. elongatus PSI exists as a cloverleaf-shaped trimeric structure. Each “monomer” 
(356 kD) consists of 12 different protein subunits and 127 cofactors: 96 chlorophyll a 
molecules, 2 phylloquinones, 3 Fe4S, clusters, 22 carotenoids, and 4 lipids that are an 
intrinsic part of the protein complex (Figure 21.18). All of the electron-transferring 
prosthetic groups essential to PSI function are localized to just three polypeptides: 
PsaA, PsaB, and PsaC. PsaA and PsaB (83 kD each) compose the reaction center het- 
erodimer, a structural pattern now seen as universal in photosynthetic reaction centers. 
PsaA and PsaB each have 11 transmembrane a-helices, with the 5 most C-terminal 
a-helices of each serving as the scaffold for the reaction center photosynthetic electron- 
transfer apparatus. PsaC interacts with the stromal face of the PsaA—PsaB heterodimer. 
PsaC carries the two Fe,S, clusters, F, and Fẹ, and interacts with PsaD. Together they 
provide a docking site for ferredoxin. The electron-transfer system of PSI consists of 
three pairs of chlorophyll molecules: P700 (a heterodimer of Chl a and an epimeric 
form, Chl a’) and two additional Chl a pairs (symbolized by Ao) that mediate e` trans- 
fer to the quinone acceptor. The S. elongatus quinone acceptor (A) is phylloquinone 
(also known as vitamin Kı). The Fx Fe,S, cluster bridges PsaA and PsaB; two of its four 
cysteine ligands come from PsaA, the other two from PsaB. Photochemistry begins 
with exciton absorption at P700, almost instantaneous electron transfer and charge 
separation (P700*: Ao), followed by transfer of the electron from Ap to A; and on 
to Fx and then F, and Fp, where it goes on to reduce a ferredoxin molecule at the 
“stromal” side of the membrane. The positive charge at P700* and the ¢ at Fa/Fk 
represent a charge separation across the membrane, an energized condition created 





FIGURE 21.17 Structure of the PSII oxygen-evolving 
complex (OEC). Four Mn atoms (red, lettered A-D) and a 
Ca atom (green) form the water-splitting metal cluster 
of the OEC. Bridging O atoms are purple. (Adapted from 
Figure 4 in Yano, J, et al., 2006. Where water is oxidized to dioxy- 


gen: Structure of the photosynthetic Mn,Ca cluster. 
Science 314:821-825.) 
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Ferredoxin 
docking 





Plastocyanin 


FIGURE 21.19 View of the plant PSI-LHC1 supercom- 
plex, from the stromal side of the thylakoid membrane. 
Chl molecules are shown in green and carotenoids 
and lipids in red. The 16 protein subunits are shown as 
ribbon diagrams in the background, with the positions 
of PsaG, PsaH, PsaK, and LHC1-4 subunits indicated. 
(Adapted from Figure 4 in Nelson, N., and Yocum, C. F, Structure 
and function of photosystems | and II. Annual Review of Plant 
Biology 57: 521-566 (2006). Figure courtesy of the authors.) 


Lumen 


(b) 





FIGURE 21.18 The molecular architecture of PSI. (a) Subunit organization. (Adapted 
from Golbeck, J. H., 1992. Structure and function of photosystem I. Annual Review of Plant Physi- 
ology and Plant Molecular Biology 43:293-324; and Fromme, P, Jordan, P, and Krausse, N., 2001. 
Structure of photosystem |. Biochimica Biophysica Acta 1507:5-31.) (b) Molecular graphic of 
theoretical model for PSI. PsaA is orange; PgaB is magenta; PsaC is yellow. The iron- 
sulfur clusters are red (pdb id = 1YO9). 





21.5 What Is the Quantum Yield of Photosynthesis? 


by light. Plastocyanin (or in cyanobacteria, a lumenal cytochrome designated cyto- 
chrome cs) delivers an electron to fill the electron hole in P700*. 


How Do Green Plants Carry Out Photosynthesis? 


Do the higher plant photosystems follow the structural and functional pattern 
first revealed in the bacterial RC and recapitulated in the cyanobacterial PSI and 
PSII? With new structural information for the higher plant PSI (from Pisum 
sativum, garden pea) and PSII (from spinach), the fundamental organization 
pattern for photosystems seen earlier is confirmed for higher plants. Further, the 
structure of a plant membrane protein supercomplex consisting of the PSI reac- 
tion center and its light-harvesting antenna LHCI1 (light-harvesting complex 1) 
has been described. This supercomplex exists as a “monomer” composed of 16 
distinct protein subunits and about 200 prosthetic groups, including 167 chloro- 
phylls, 2 phylloquinones, and 3 Fe,S, clusters (Figure 21.19). The four LHC1 
subunits form an arc around one side of the PSI RC. A second light-harvesting 
complex (LHC2) binds to another side. This plant PSI system, like all photosys- 
tems, is remarkably efficient, showing a quantum efficiency of nearly 1 (one elec- 
tron transferred per photon falling anywhere within the supercomplex). The 
many Chl and other light-harvesting molecules of the supercomplex form an in- 
tegrated network for highly efficient transfer of light energy into P700. 


2ledy What Is the Quantum Yield of Photosynthesis? 


The quantum yield of photosynthesis can be defined as the amount of product 
formed per equivalent of light input. In terms of exciton delivery to reaction center 
Chl dimers and subsequent e transfer, the quantum yield of photosynthesis typically 
approaches the theoretical limit of 1. The quantum yield of photosynthesis can also 
be expressed as the ratio of COs fixed or Os evolved per photon absorbed. Interest- 
ingly, an overall stoichiometry of three H* translocated into the thylakoid vesicle has 
been observed for each electron passing from HO to NADP*. Two photons per cen- 
ter would allow a pair of electrons to flow from HO to NADP* (see Figure 21.11), 
resulting in the formation of 1 NADPH and 50>. More appropriately, 4 hv per cen- 
ter (8 quanta total) would drive the evolution of 1 Os, the reduction of 2 NADP*, 
and the translocation of 12 H*. Current estimates suggest that 3 ATPs are formed for 
every 14 H* translocated, so (12/14)3 = 2.57 ATP would be synthesized from an in- 
put of 8 quanta. 

The energy of a photon depends on its wavelength, according to the equation 
E= hv = hc/À, where E is energy, c is the speed of light, and A is its wavelength. Ex- 
pressed in molar terms, an Einstein is the amount of energy in Avogadro’s number 
of photons: E = Nhc/X. Light of 700-nm wavelength is the longest-wavelength and 
the lowest-energy light acting in the eukaryotic photosystems. An Einstein of 
700-nm light is equivalent in energy to approximately 170 kJ. Eight Einsteins of 
this light, 1360 kJ, theoretically generate 2 moles of NADPH, 2.57 moles of ATP, and 
1 mole of Os. 


Calculation of the Photosynthetic Energy Requirements for Hexose 
Synthesis Depends on H*/hv and ATP/H* Ratios 


The fixation of carbon dioxide to form hexose, the dark reactions of photosynthesis, 
requires considerable energy. The overall stoichiometry of this process (see Equation 
21.3) involves 12 NADPH and 18 ATPs. Thus, the ATP/NADPH ratio for CO,» fixation 
is 1.5. To generate 12 equivalents of NADPH necessitates the consumption of 48 Ein- 
steins of light, minimally 170 kJ each. However, if the preceding ratio of 1.29 ATPs per 
NADPH is correct, only 15.5 or so ATPs would be produced for CO» fixation. To make 
up the deficit of 2.5 ATPs would require 35 H* or about 12 more ¢ transferred from 
HO to NADP* (an additional 24 Einsteins). From 72 Einsteins, or 12,240 kJ, 1 mole 
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of hexose would be synthesized. The standard free energy change, A G°’, for hexose 
formation from carbon dioxide and water (the exact reverse of cellular respiration) 
is +2870 kJ/mol. Note that many assumptions underlie these calculations, including 
assumptions about the ATP/H™ ratio, the H*/e ratio, and ultimately, the relation- 
ship between quantum input and overall yields of NADPH and ATP. Also, cyclic pho- 
tophosphorylation (see later section titled Cyclic Photophosphorylation Generates 
ATP but Not NADPH or Oz) leads to ATP synthesis and may aid in making up the ATP 
deficit just mentioned. 


2x6 How Does Light Drive the Synthesis of ATP? 


Light-driven ATP synthesis, or photophosphorylation, is a fundamental part of the 
photosynthetic process. The conversion of light energy to chemical energy results 
in electron-transfer reactions, which lead to the generation of reducing power (re- 
duced quinones or NADPH). Coupled with these electron transfers, protons are 
driven across the thylakoid membranes from the stromal side to the lumenal side. 
These proton translocations occur in a manner analogous to the proton transloca- 
tions accompanying mitochondrial electron transport that provide the driving force 
for oxidative phosphorylation (see Chapter 20). Figure 21.11 indicates that proton 
translocations can occur at a number of sites. For example, protons are produced 
in the thylakoid lumen upon photolysis of water by PSII. The oxidation—reduction 
events as electrons pass through the plastoquinone pool and the Q cycle are an- 
other source of proton translocations. The proton transfer accompanying NADP+ 
reduction also can be envisioned as protons being taken from the stromal side of 
the thylakoid vesicle. The current view is that three protons are translocated for 
each electron that flows from HO to NADP*. Because this electron transfer re- 
quires two photons, one falling at PSII and one at PSI, the overall yield is 1.5 pro- 
tons per quantum of light. 


The Mechanism of Photophosphorylation Is Chemiosmotic 


The thylakoid membrane is asymmetrically organized, or “sided,” like the mito- 
chondrial membrane. It also shares the property of being a barrier to the passive 
diffusion of H* ions. Photosynthetic electron transport thus establishes an electro- 
chemical gradient, or proton-motive force, across the thylakoid membrane with the 
interior, or lumen, side accumulating H* ions relative to the stroma of the chloro- 
plast. Like oxidative phosphorylation, the mechanism of photophosphorylation is 
chemiosmotic. 

A proton-motive force of approximately —250 mV is needed to achieve ATP syn- 
thesis. This proton-motive force, Ap, is composed of a membrane potential, Ay, and 
a pH gradient, ApH (see Chapter 20). The proton-motive force is defined as the 
free energy difference, AG, divided by F, Faraday’s constant: 


Ap = AG/¥ = A — (2.3 RT/F)ApH (21.5) 


In chloroplasts, the value of Aw is typically —50 to —100 mV and the pH gradient is 
equivalent to about 3 pH units, so — (2.3 RT/F)ApH = —170 mV. This situation con- 
trasts with the mitochondrial proton-motive force, where the membrane potential 
contributes relatively more to Ap than does the pH gradient. 


CF,CF,—-ATP Synthase Is the Chloroplast Equivalent 
of the Mitochondrial F-F,—ATP Synthase 


The transduction of the electrochemical gradient into the chemical energy repre- 
sented by ATP is carried out by the chloroplast ATP synthase, which is highly analo- 
gous to the mitochondrial FıFọ—-ATP synthase. The chloroplast enzyme complex is 
called CF,CF)-ATP synthase, “C” symbolizing chloroplast. Like the mitochondrial 
complex, CF,}CFj-ATP synthase is a heteromultimer of a-, B-, y-, 6-, €-, a-, b-, and 


21.6 How Does Light Drive the Synthesis of ATP? 649 






Ferredoxin 
docking 





docking 





ATP synthase 





FIGURE 21.20 The mechanism of photophosphorylation. 
. A i Photosynthetic electron transport establishes a proton 
c-subunits (see Chapter 20), consisting of a knoblike structure some 9 nm in diame- gradient that is tapped by the CF:CFo-ATP synthase to 


ter (CF) attached to a stalked base (CFọ) embedded in the thylakoid membrane. The drive ATP synthesis. 
mechanism of action of CFıCFo—ATP synthase in coupling ATP synthesis to the col- 

lapse of the pH gradient is similar to that of the mitochondrial ATP synthase de- 

scribed in Chapter 20. However, higher plant CF,CF)-ATP synthase is believed to have 

14 c-subunits in its Fy rotor, implying that one turn of Fy would require 14 H+ and lead 

to synthesis of 3 ATPs. The mechanism of photophosphorylation is summarized 

schematically in Figure 21.20. 


Photophosphorylation Can Occur in Either a Noncyclic or a Cyclic Mode 


Photosynthetic electron transport, which pumps H* into the thylakoid lumen, can 
occur in two modes, both of which lead to the establishment of a transmembrane 
proton-motive force. Thus, both modes are coupled to ATP synthesis and are con- 
sidered alternative mechanisms of photophosphorylation, even though they are dis- 
tinguished by differences in their electron transfer pathways. The two modes are 
cyclic and noncyclic photophosphorylation. Noncyclic photophosphorylation has 
been the focus of our discussion and is represented by the scheme in Figure 21.20, 
where electrons activated by quanta at PSII and PSI flow from HO to NADP*, with 
concomitant establishment of the proton-motive force driving ATP synthesis. Note 
that in noncyclic photophosphorylation, O% is evolved and NADP* is reduced. 


Cyclic Photophosphorylation Generates ATP but Not NADPH or 0, 


In cyclic photophosphorylation, the “electron hole” in P700* created by electron 
loss from P700 is filled not by an electron derived from HO via PSII but by a cyclic 
pathway in which the photoexcited electron returns ultimately to P700*. This path- 
way is schematically represented in Figure 21.11 by the dashed line connecting 
ferredoxin (Fd) and plastoquinone (PQ) within the membrane. This pathway di- 
verts the activated e lost from PSI back through the PQ pool, the cytochrome bf 
complex, and plastocyanin to re-reduce P700* (Figure 21.21). 
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FIGURE 21.21 The pathway of cyclic photophosphoryla- 
tion by PSI. (Adapted from Arnon, D. I, 1984. The discovery of 
photosynthetic phosphorylation. Trends in Biochemical Sciences 
9:258-262.) 





Proton translocations accompany these cyclic electron transfer events so that ATP 
synthesis can be achieved. In cyclic photophosphorylation, ATP is the sole product 
of energy conversion. No NADPH is generated, and because PSII is not involved, no 
oxygen is evolved. Cyclic photophosphorylation theoretically yields 2 H* per e 
(2 H*/hv) from the operation of the cytochrome bf complex. Thus, cyclic photo- 
phosphorylation provides a mechanism for overcoming the ATP deficit for CO» fix- 
ation (see the earlier section titled Calculation of the Photosynthetic Energy Re- 
quirements for Hexose Synthesis Depends on H*/hv and ATP/H* Ratios, page 
647). Estimates indicate that cyclic photophosphorylation may contribute about 10% 
of total chloroplast ATP synthesis and thereby elevate the ATP/NADPH ratio. 


217s How Is Carbon Dioxide Used to Make 
Organic Molecules? 


As we began this chapter, we saw that photosynthesis traditionally is equated with the 
process of CO» fixation, that is, the net synthesis of carbohydrate from COs». Indeed, 
the capacity to perform net accumulation of carbohydrate from CO» distinguishes the 
phototrophic (and autotrophic) organisms from heterotrophs. Although animals 
possess enzymes capable of linking CO» to organic acceptors, they cannot achieve a 
net accumulation of organic material by these reactions. For example, fatty acid 
biosynthesis is primed by covalent attachment of CO, to acetyl-CoA to form malonyl- 
CoA (see Chapter 24). Nevertheless, this “fixed CO.” is liberated in the very next re- 
action, so no net COs incorporation occurs. 

Elucidation of the pathway of CO» fixation represents one of the earliest appli- 
cations of radioisotope tracers to the study of biology. In 1945, Melvin Calvin and 
his colleagues at the University of California, Berkeley, were investigating photo- 
synthetic CO, fixation in Chlorella. Using ''COs, they traced the incorporation of 
radioactive C into organic products and found that the earliest labeled product 
was 3-phosphoglycerate (see Figure 17.14). Although this result suggested that the 
CO» acceptor was a two-carbon compound, further investigation revealed that, in 
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reality, 2 equivalents of 3-phosphoglycerate were formed following addition of CO, 
to a five-carbon (pentose) sugar: 


CO» + 5-carbon acceptor—> [6-carbon intermediate] —> 
two 3-phosphoglycerates 


Ribulose-1,5-Bisphosphate Is the CO, Acceptor in CO, Fixation 


The five-carbon COs acceptor was identified as ribulose-1,5-bisphosphate (RuBP), 
and the enzyme catalyzing this key reaction of CO, fixation is ribulose bisphosphate 
carboxylase/oxygenase, or, in the jargon used by workers in this field, rubisco. The 
name ribulose bisphosphate carboxylase/oxygenase reflects the fact that rubisco catalyzes the 
reaction of either CO, or, alternatively, Og with RuBP. Rubisco is found in the chloro- 
plast stroma. It is a very abundant enzyme, constituting more than 15% of the total 
chloroplast protein. Given the preponderance of plant material in the biosphere, 
rubisco is probably the world’s most abundant protein. Rubisco is large: In higher 
plants, rubisco is a 550-kD heteromultimeric (@g6g) complex consisting of eight iden- 
tical large subunits (55 kD) and eight small subunits (15 kD) (Figure 21.22). The large 
subunit is the catalytic unit of the enzyme. It binds both substrates (CO, and RuBP) 
and Mg?* (a divalent cation essential for enzymatic activity). The small subunit modu- 
lates the catalytic efficiency of the enzyme, increasing kca more than 100-fold. The ru- 
bisco large subunit is encoded by a gene within the chloroplast DNA, whereas the 
small subunit is encoded by a multigene family in the nuclear DNA. Assembly of ac- 
tive rubisco heteromultimers occurs within chloroplasts following transit of the 
small subunit polypeptide across the chloroplast membrane. 





2-Carboxy-3-Keto-Arabinitol Is an Intermediate 
e e E Ri e FIGURE 21.22 Molecular graphic of ribulose bisphos- 
in the Ribulose 1,5 Bisphosphate Carboxylase Reaction phate carboxylase. The enzyme consists of 8 equivalents 


The addition of COs to ribulose-1,5-bisphosphate results in the formation of an — each of two subunits. Clusters of four small subunits 
enzyme-bound intermediate, 2-carboxy-3-keto-arabinitol (Figure 21.23, II). This in- (orange and red) are located at each end of the sym- 

F z ; ; j metric octamer formed by the L subunits (light and dark 
termediate arises when CO, adds to the enediol intermediate generated from green); The ačtiveisites sre revealed In ihéribbonidia- 
ribulose-1,5-bisphosphate. Hydrolysis of the Cə—Cs bond of the intermediate gen- gram by bound ribulose-1,5-bisphosphate (yellow) 
erates two molecules of 3-phosphoglycerate. The COs ends up as the carboxyl group (pdb id = 1RXO). 


of one of the two molecules. 


Ribulose-1,5-Bisphosphate Carboxylase Exists in Inactive 
and Active Forms 


Rubisco exists in three forms: an inactive form, designated FE; a carbamylated, but in- 
active, form, designated EC; and an active form, ECM, which is carbamylated and has 
Mg** at its active sites as well. Carbamylation of rubisco takes place by addition of CO, 
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FIGURE 21.23 The ribulose bisphosphate carboxylase reaction. Mg?* at the active site aids in stabilizing the 
2,3-enediol transition state (I) for CO, addition and in facilitating the carbon-carbon bond cleavage that leads to 
product formation. 
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to its Lys’! e-NH» groups (to give e—NH—COO° derivatives). The CO, molecules 
used to carbamylate Lys residues do not become substrates. The carbamylation reac- 
tion occurs spontaneously at slightly alkaline pH (pH 8). Carbamylation of rubisco 
completes the formation of a binding site for the Mg?* that participates in the catalytic 
reaction. Once Mg** binds to EC, rubisco achieves its active ECM form. Activated 
rubisco displays a K„ for CO» of 10 to 20 uM. The relative abundance of CO» in the 
atmosphere is low, about 0.03%. The concentration of COs dissolved in aqueous so- 
lutions equilibrated with air is about 10 uM. 

Substrate RuBP binds much more tightly to the inactive E form of rubisco (Kp = 
20 nM) than to the active ECM form (K,, for RuBP = 20 uM). Thus, RuBP is also a 
potent inhibitor of rubisco activity. Release of RuBP from the active site of rubisco is 
mediated by rubisco activase. Rubisco activase is a regulatory protein; it binds to E-form 
rubisco and, in an ATP-dependent reaction, promotes the release of RuBP. Rubisco 
then becomes activated by carbamylation and Mg** binding. Rubisco activase itself 
is activated in an indirect manner by light. Thus, light is the ultimate activator of 
rubisco. 


CO, Fixation into Carbohydrate Proceeds Via the Calvin—Benson Cycle 


The immediate product of CO, fixation, 3-phosphoglycerate, must undergo a series 
of transformations before the net synthesis of carbohydrate is realized. Among car- 
bohydrates, hexoses (particularly glucose) occupy center stage. Glucose is the build- 
ing block for both cellulose and starch synthesis. These plant polymers constitute the 
most abundant organic material in the living world, and thus, the central focus on 
glucose as the ultimate end product of CO, fixation is amply justified. Also, sucrose 
(a-D-glucopyranosyl-(1—2)-6-D-fructofuranoside) is the major carbon form translo- 
cated out of leaves to other plant tissues. In nonphotosynthetic tissues, sucrose is me- 
tabolized via glycolysis and the TCA cycle to produce ATP. 

The set of reactions that transforms 3-phosphoglycerate into hexose is named 
the Calvin-Benson cycle (often referred to simply as the Calvin cycle) for its dis- 
coverers. The reaction series is indeed cyclic because not only must carbohydrate 
appear as an end product, but the five-carbon acceptor, RuBP, must be regenerated 
to provide for continual CO, fixation. Balanced equations that schematically repre- 
sent this situation are 


6(1) + 6(5) > 12(3) 
12(3) —>1(6) + 6(5) 


Net: 6(1) —> 1(6) 


Each number in parentheses represents the number of carbon atoms in a com- 
pound, and the number preceding the parentheses indicates the stoichiometry 
of the reaction. Thus, 6(1), or 6 COs, condense with 6(5) or 6 RuBP to give 
12 3-phosphoglycerates. These 12(3)s are then rearranged in the Calvin cycle to 
form one hexose, 1(6), and regenerate the six 5-carbon (RuBP) acceptors. 


The Enzymes of the Calvin Cycle Serve Three Metabolic Purposes 


The Calvin cycle enzymes serve three important ends: 


1. They constitute the predominant COs fixation pathway in nature. 

2. They accomplish the reduction of 3-phosphoglycerate, the primary product of 
CO» fixation, to glyceraldehyde-3-phosphate so that carbohydrate synthesis be- 
comes feasible. 

3. They catalyze reactions that transform three-carbon compounds into four-, five-, 
six-, and seven-carbon compounds. 


Most of the enzymes mediating the reactions of the Calvin cycle also participate 
in either glycolysis (see Chapter 18) or the pentose phosphate pathway (see Chap- 
ter 22). The aim of the Calvin scheme is to account for hexose formation from 
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3-phosphoglycerate. In the course of this metabolic sequence, the NADPH and ATP 
produced in the light reactions are consumed, as indicated earlier in Equation 21.3. 
The Calvin cycle of reactions starts with ribulose bisphosphate carboxylase catalyzing 
formation of 3-phosphoglycerate from CO, and RuBP and concludes with ribulose-5- 
phosphate kinase (also called phosphoribulose kinase), which forms RuBP (Figure 21.24 
and Table 21.1). The carbon balance is given at the right side of Table 21.1. Several 
features of the reactions in this table merit discussion. Note that a total of 18 equiva- 
lents of ATP consumed in hexose formation are expended (reactions 2 and 15): 12 
to form 12 equivalents of 1,3-bisphosphoglycerate from 3-phosphoglycerate by a 
reversal of the normal glycolytic reaction catalyzed by 3-phosphoglycerate kinase 
and six to phosphorylate Ru-5-P to regenerate 6 RuBP. All 12 NADPH equivalents 
are used in reaction 3. Plants possess an NADPH-specific glyceraldehyde-3-phosphate 
dehydrogenase, which contrasts with its glycolytic counterpart in its specificity for 
NADP over NAD and in the direction in which the reaction normally proceeds. 


The Calvin Cycle Reactions Can Account for Net Hexose Synthesis 


When carbon rearrangements are balanced to account for net hexose synthesis, five 
of the glyceraldehyde-3-phosphate molecules are converted to dihydroxyacetone 
phosphate (DHAP). Three of these DHAPs then condense with three glyceraldehyde- 
3-P via the aldolase reaction to yield three hexoses in the form of fructose bisphos- 
phate (Figure 21.24). (Recall that the A G°’ for the aldolase reaction in the glycolytic 
direction is +23.9 kJ/mol. Thus, the aldolase reaction running “in reverse” in the 
Calvin cycle would be thermodynamically favored under standard-state conditions.) 
Taking one FBP to glucose, the desired product of this scheme, leaves 30 carbons, dis- 
tributed as 2 fructose-6-phosphates, 4 glyceraldehyde-3-phosphates, and 2 DHAP. 
These 30 Cs are reorganized into 6 RuBP by reactions 9 through 15. Step 9 and steps 
12 through 14 involve carbohydrate rearrangements like those in the pentose 
phosphate pathway (see Chapter 22). Reaction 11 is mediated by sedoheptulose-1,7- 
bisphosphatase. This phosphatase is unique to plants; it generates sedoheptulose-7-P, 


TABLE 21.1 $ 





Reactions 1 through 15 constitute the cycle that leads to the formation of one equivalent of glucose. The enzyme catalyzing each step, a 
concise reaction, and the overall carbon balance are given. Numbers in parentheses show the numbers of carbon atoms in the substrate 
and product molecules. Prefix numbers indicate in a stoichiometric fashion how many times each step is carried out in order to provide 
a balanced net reaction. 





1. Ribulose bisphosphate carboxylase: 6 CO, + 6 HO + 6 RuBP —> 12 3-PG 6(1) + 6(5) — 12(3) 
2. 3-Phosphoglycerate kinase: 12 3-PG + 12 ATP —> 12 1,3-BPG + 12 ADP 12(3) =——> 12(3) 
3. NADP*-glyceraldehyde-3-P dehydrogenase: 

12 1,3-BPG + 12 NADPH —> 12 NADP?t + 12 G-3-P + 12 P; 12(3) — 12(3) 
4. Triose-P isomerase: 5 G-3-P —> 5 DHAP 5 (3): (3) 
5. Aldolase: 3 G-3-P + 3 DHAP —> 3 FBP 3(3) + 3(3) —> 3(6) 
6. Fructose bisphosphatase: 3 FBP + 3 H:O —> 3 F-6-P + 3 P; 3(6) —> 3(6) 
7. Phosphoglucoisomerase: 1 F-6-P —> 1 G-6-P 1(6) —> 1(6) 
8. Glucose phosphatase: 1 G-6-P + 1 HO —> 1 GLUCOSE + 1 P; 1(6) —> 1(6) 


The remainder of the pathway involves regenerating six RuBP acceptors (= 30 C) from the 
leftover two F-6-P (12 C), four G-3-P (12 C), and two DHAP (6 C). 





9. Transketolase: 2 F-6-P + 2 G-3-P —> 2 Xu-5-P + 2 E-4-P 2(6) + 2(3) ——> 2(5) + 2(4) 
10. Aldolase: 2 E-4-P + 2 DHAP —> 2 sedoheptulose-1,7-bisphosphate (SBP) 2(4) + 2(3) —> 2(7) 
11.Sedoheptulose bisphosphatase: 2 SBP + 2 HO —> 2 S-7-P + 2 P; 2(7) => 2(7) 

12. Transketolase: 2 S-7-P + 2 G-3-P —> 2 Xu-5-P + 2 R-5-P 2(7) + 2(3) —> 4(5) 
13. Phosphopentose epimerase: 4 Xu-5-P —> 4 Ru-5-P 4(5) — 4(5) 
14. Phosphopentose isomerase: 2 R-5-P —> 2 Ru-5-P 2(5). 2d) 
15. Phosphoribulose kinase: 6 Ru-5-P + 6 ATP —> 6 RuBP + 6 ADP 6(5) — 6(5) 





Net: 6 CO, + 18 ATP + 12 NADPH + 12 H+ + 12 HO —> glucose + 18 ADP + 18 P; + 12 NADP+ 6(1) — 1(6) 
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21.7 How Is Carbon Dioxide Used to Make Organic Molecules? 


< CENGAGENOW” ACTIVE FIGURE 21.24 The Calvin—Benson cycle of reactions. The number associated with 
the arrow at each step indicates the number of molecules reacting in a turn of the cycle that produces one 
molecule of glucose. Reactions are numbered as in Table 21.1. Test yourself on the concepts in this figure 
at www.cengage.com/login. 


the seven-carbon sugar serving as the transketolase substrate. Likewise, phosphoribu- 
lose kinase carries out the unique plant function of providing RuBP from Ru-5-P 
(reaction 15). The net conversion accounts for the fixation of 6 equivalents of carbon 
dioxide into 1 hexose at the expense of 18 ATP and 12 NADPH. 


The Carbon Dioxide Fixation Pathway Is Indirectly Activated by Light 


Plant cells contain mitochondria and can carry out cellular respiration (glycolysis, 
the citric acid cycle, and oxidative phosphorylation) to provide energy in the 
dark. Futile cycling of carbohydrate to CO» by glycolysis and the citric acid cycle 
in one direction, and CO, to carbohydrate by the CO,» fixation pathway in the 
opposite direction, is thwarted through regulation of the Calvin cycle (Figure 
21.25). In this regulation, the activities of key Calvin cycle enzymes are coordi- 
nated with the output of photosynthesis. In effect, these enzymes respond indi- 
rectly to light activation. Thus, when light energy is available to generate ATP and 
NADPH for CO, fixation, the Calvin cycle proceeds. In the dark, when ATP and 
NADPH cannot be produced by photosynthesis, fixation of COs ceases. The light- 
induced changes in the chloroplast which regulate key Calvin cycle enzymes in- 
clude (1) changes in stromal pH, (2) generation of reducing power, and (3) Mg?* efflux 
from the thylakoid lumen. 


Light Induces pH Changes in Chloroplast Compartments As discussed in Sec- 
tion 21.6, illumination of chloroplasts leads to light-driven pumping of pro- 
tons into the thylakoid lumen, which causes pH changes in both the stroma and the 
thylakoid lumen (Figure 21.26). The stromal pH rises, typically to pH 8. Because 
rubisco and rubisco activase are more active at pH 8, COs fixation is activated as stro- 
mal pH rises. Fructose-1,6-bisphosphatase, ribulose-5-phosphate kinase, and glyceraldehyde-3- 
phosphate dehydrogenase all have alkaline pH optima. Thus, their activities increase as a 
result of the light-induced pH increase in the stroma. 


Light Energy Generates Reducing Power Illumination of chloroplasts initiates pho- 
tosynthetic electron transport, which generates reducing power in the form of re- 
duced ferredoxin. In turn, Fd,.q leads to reduced thioredoxin via ferredoxin—thioredoxin 
reductase (FTR) (Figure 21.27). Thioredoxin is a small (12 kD) protein possessing in 
its reduced state a pair of sulfhydryls (—SH HS—), which upon oxidation form a 
disulfide bridge (—S—S—). Several Calvin cycle enzymes have pairs of cysteine 
residues that are involved in a disulfide-sulfhydryl transition between an inactive 
(—S—S—) and an active (—SH HS—) form. These enzymes include /fructose-1,6- 
bisphosphatase (residues Cys!” and Cys!”), NADP*-malate dehydrogenase (residues Cys!” 
and Cys!®), and ribulose-5-phosphate kinase (residues Cys!® and Cys”). Thus, light acti- 
vates these key enzymes through this ferredoxin-dependent, thioredoxin-mediated 
pathway (Figure 21.27). 
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FIGURE 21.25 Light regulation of CO, fixation prevents 
a substrate cycle between cellular respiration and 
hexose synthesis by CO, fixation. 
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CENGAGENOW” ANIMATED FIGURE 21.26 Light- 
induced pH changes in chloroplast compartments. 
These pH changes modulate the activity of key Calvin 
cycle enzymes. See this figure animated at www 
.cengage.com/login. 


FIGURE 21.27 The pathway for light-regulated reduction 

of Calvin cycle enzymes. Light-generated reducing power 
(Fdreg = reduced ferredoxin) provides e~ for reduction of 

thioredoxin (T) by FTR. 


656 Chapter 21 


Photosynthesis 


Light Induces Movement of Mg?* lons from the Thylakoid Vesicles into the Stroma 
When light-driven proton pumping across the thylakoid membrane occurs, a con- 
comitant efflux of Mg** ions from vesicles into the stroma is observed. This efflux of 
Mg?* somewhat counteracts the charge accumulation due to H* influx and is one rea- 
son why the membrane potential change in response to proton pumping is less in 
chloroplasts than in mitochondria (see Equation 21.5). Both ribulose bisphosphate car- 
boxylase and_fructose-1,6-bisphosphatase are Mg**-activated enzymes, and Mg** flux into 
the stroma as a result of light-driven proton pumping stimulates the CO» fixation path- 
way at these key steps. Activity measurements have indicated that fructose bisphos- 
phatase may be the rate-limiting step in the Calvin cycle. The recurring theme of fruc- 
tose bisphosphatase as the target of the light-induced changes in the chloroplasts 
implicates this enzyme as a key point of control in the Calvin cycle. 


Protein-Protein Interactions Mediated by an Intrinsically 
Unstructured Protein Also Regulate Calvin—Benson Cycle Activity 


The 8.5-kD chloroplast protein CP12 is an instrinsically unstructured protein (see 
Chapter 6) that interacts with ribulose-5-P kinase (R5PK) and glyceraldehyde-3-P de- 
hydrogenase (GAPDH) to form a complex. When complexed with CP12, R5PK and 
GAPDH are relatively inactive. Note that the first of these enzymes is ATP-dependent 
and the second is NADPH-dependent. Thus, formation of this complex constrains the 
use of ATP and NADPH, the principal products of the light reactions of photosynthe- 
sis. Reduced thioredoxin, through interactions with CP12, leads to the dissociation of 
the GAPDH-CP12-R5PK complex. Uncomplexed R5PK and GAPDH are inherently 
more active, and operation of the Calvin—Benson cycle is enhanced. 


2:1e8: How Does Photorespiration Limit CO, Fixation? 


As indicated, ribulose bisphosphate carboxylase /oxygenase catalyzes an alternative 
reaction in which Où replaces COs as the substrate added to RuBP (Figure 21.28a). 
The ribulose-1,5-bisphosphate oxygenase reaction diminishes plant productivity because 
it leads to loss of RuBP, the essential CO, acceptor. The K,,, for Où in this oxygenase 
reaction is about 200 uM. Given the relative abundance of CO, and Oz in the at- 
mosphere and their relative K„ values in these rubisco-mediated reactions, the ra- 
tio of carboxylase to oxygenase activity in vivo is about 3 or 4 to 1. 

The products of ribulose bisphosphate oxygenase activity are 3-phosphoglycerate 
and phosphoglycolate. Dephosphorylation and oxidation convert phosphoglycolate to 
glyoxylate, the a-keto acid of glycine (Figure 21.28b). Transamination yields 
glycine. In mitochondria, two glycines from photorespiration are converted into 
one serine and CQs. This step is the source of the COs evolved during photorespi- 
ration. Transamination of glyoxylate to glycine by the product serine yields hydroxy- 
pyruvate; reduction of hydroxypyruvate yields glycerate, which can be phosphory- 
lated to 3-phosphoglycerate. 3-Phosphoglycerate can fuel resynthesis of ribulose 
bisphosphate by the Calvin cycle (see Figure 21.24). 

Other fates of phosphoglycolate are also possible, including oxidation to COs, with 
the released energy being dissipated as heat. Obviously, agricultural productivity is 
dramatically lowered by this phenomenon, which, because it is a light-related uptake 
of Oo and release of COs, is termed photorespiration. As we shall see, certain plants, 
particularly tropical grasses, have evolved means to circumvent photorespiration. 
These plants are more efficient users of light for carbohydrate synthesis. 


Tropical Grasses Use the Hatch—Slack Pathway to Capture Carbon 
Dioxide for CO, Fixation 


Tropical grasses are less susceptible to the effects of photorespiration, as noted earlier. 
Studies using ''CO, as a tracer indicated that the first organic intermediate labeled in 
these plants was not a three-carbon compound but a four-carbon compound. Hatch 
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and Slack, two Australian biochemists, first discovered this C-4 product of CO», fixa- 
tion, and the C-4 pathway of COs» incorporation is named the Hatch—Slack pathway 
after them. The C-4 pathway is not an alternative to the Calvin cycle series of reactions 
or even a net COs fixation scheme. Instead, it functions as a COs delivery system, car- 
rying CO» from the relatively oxygen-rich surface of the leaf to interior cells where 
oxygen is lower in concentration and hence less effective in competing with CO» in 
the rubisco reaction. Thus, the C-4 pathway is a means of avoiding photorespiration 
by sheltering the rubisco reaction in a cellular compartment away from high [Og]. 
The C-4 compounds serving as COs» transporters are malate or aspartate. 
Compartmentation of these reactions to prevent photorespiration involves the in- 
teraction of two cell types: mesophyll cells and bundle sheath cells. The mesophyll cells take 
up CO» at the leaf surface, where Og is abundant, and use it to carboxylate phospho- 
enolpyruvate to yield OAA in a reaction catalyzed by PEP carboxylase (Figure 21.29). 
This fourcarbon dicarboxylic acid is then either reduced to malate by an NADPH- 
specific malate dehydrogenase or transaminated to give aspartate in the mesophyll 
cells. The 4-C CO, carrier (malate or aspartate) is then transported to the bundle 
sheath cells, where it is decarboxylated to yield CO% and a 3-C product. The COs is then 
fixed into organic carbon by the Calvin cycle localized within the bundle sheath cells, 
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FIGURE 21.28 The oxygenase reaction of rubisco. 

(a) The reaction of ribulose bisphosphate carboxylase 
with O, and ribulose bisphosphate yields 
3-phosphoglycerate and phosphoglycolate. (b) Conver- 
sion of two phosphoglycolates to serine + COs. 
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FIGURE 21.29 Essential features of the compartmentation 
and biochemistry of the Hatch-Slack pathway of carbon 
dioxide uptake in C4 plants. 
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and the 3-C product is returned to the mesophyll cells, where it is reconverted to PEP 
in preparation to accept another CO, (Figure 21.29). Plants that use the C-4 pathway 
are termed C4 plants, in contrast to those plants with the conventional pathway of CO» 
uptake (C3 plants). 


Intercellular Transport of Each CO, Via a C-4 Intermediate Costs 2 ATPs The trans- 
port of each COs requires the expenditure of two high-energy phosphate bonds. The 
energy of these bonds is expended in the phosphorylation of pyruvate to PEP (phos- 
phoenolpyruvate) by the plant enzyme pyruvate-P; dikinase; the products are PEP, 
AMP, and pyrophosphate (PP;). This represents a unique phosphotransferase reaction 
in that both the £- and y-phosphates of a single ATP are used to phosphorylate the two 
substrates, pyruvate and P;. The reaction mechanism involves an enzyme phosphohisti- 
dine intermediate. The y-phosphate of ATP is transferred to P;, whereas formation of 
E-His-P occurs by addition of the B-phosphate from ATP: 





E—His + AMP,—P,—P, + P; —> E—His—P, + AMP, + P,P; 
E—His—P, + pyruvate —> PEP + E—His 





Net: ATP + pyruvate + Pi —> AMP + PEP + PP; 


Pyruvate-P; dikinase is regulated by reversible phosphorylation of a threonine 
residue, the nonphosphorylated form being active. Interestingly, ADP is the phos- 
phate donor in this interconvertible regulation. Despite the added metabolic ex- 
pense of two phosphodiester bonds for each equivalent of carbon dioxide taken up, 
CO» fixation is more efficient in C4 plants, provided that light intensities and tem- 
peratures are both high. (As temperature rises, photorespiration in C3 plants rises 
and efficiency of COs fixation falls.) Tropical grasses that are C4 plants include 
sugarcane, maize, and crabgrass. In terms of photosynthetic efficiency, cultivated 
fields of sugarcane represent the pinnacle of light-harvesting efficiency. Approxi- 
mately 8% of the incident light energy on a sugarcane field appears as chemical en- 
ergy in the form of CO, fixed into carbohydrate. This efficiency compares dramat- 
ically with the estimated photosynthetic efficiency of 0.2% for uncultivated plant 
areas. Research on photorespiration is actively pursued in hopes of enhancing the 


efficiency of agriculture by controlling this wasteful process. Only 1% of the 230,000 
different plant species known are C4 plants; most are in hot climates. 


Cacti and Other Desert Plants Capture CO, at Night 


In contrast to C4 plants, which have separated COs, uptake and fixation into distinct 
cells in order to minimize photorespiration, succulent plants native to semiarid and 
tropical environments separate CO, uptake and fixation in time. Carbon dioxide 
(as well as Og) enters the leaf through microscopic pores known as stomata, and wa- 
ter vapor escapes from plants via these same openings. In nonsucculent plants, the 
stomata are open during the day, when light can drive photosynthetic CO» fixation, 
and closed at night. Succulent plants, such as the Cactaceae (cacti) and Crassulaceae, 
cannot open their stomata during the heat of day because any loss of precious HyO 
in their arid habitats would doom them. Instead, these plants open their stomata to 
take up CO» only at night, when temperatures are lower and water loss is less likely. 
This carbon dioxide is immediately incorporated into PEP to form OAA by PEP car- 
boxylase; OAA is then reduced to malate by malate dehydrogenase and stored 
within vacuoles until morning. During the day, the malate is released from the vac- 
uoles and decarboxylated to yield CO, and a 3-C product. The CO, is then fixed 
into organic carbon by rubisco and the reactions of the Calvin cycle. Because this 
process involves the accumulation of organic acids (OAA, malate) and is common 
to succulents of the Crassulaceae family, it is referred to as crassulacean acid metabo- 
lism, and plants capable of it are called CAM plants. 
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SUMMARY 


21.1 What Are the General Properties of Photosynthesis? Photosyn- to P680, restoring its ability to absorb another photon, become P680*, 


thesis takes place in membranes. In photosynthetic eukaryotes, the pho- 
tosynthetic membranes form an inner membrane system within chloro- 
plasts that is called the thylakoid membrane system. Photosynthesis is 
traditionally broken down into two sets of reactions: the light reactions, 
whereby light energy is used to generate NADPH and ATP concomitant 
with Os evolution, and the dark reactions in which NADPH and ATP 
provide the chemical energy for fixation of COs into glucose. Water is 
the ultimate e donor for NADP* reduction. 


21.2 How Is Solar Energy Captured by Chlorophyll? Chlorophyll 
and various accessory light-harvesting pigments absorb light through- 
out the visible spectrum and use the light energy to initiate electron- 
transfer reactions. The absorption of a photon of light by a pigment 
molecule promotes an electron of the pigment molecule to a higher 
orbital (and higher energy level). As a result, the pigment molecule is 
a much better electron donor. Photosynthetic units consist of arrays of 
hundreds of chlorophyll molecules and accessory light-harvesting pig- 
ments, but only a single reaction center. The reaction center is formed 
from a pair of Chl molecules. 


21.3 What Kinds of Photosystems Are Used to Capture Light Energy? 
Photosynthetic bacteria have a single photosystem, but eukaryotic pho- 
totrophs have two distinct photosystems. Type I photosystems use pro- 
teins with Fe,S, clusters as terminal e acceptors; type I photosystems 
reduce quinones, such as plastoquinone or phylloquinone. In oxygenic 
phototrophs (cyanobacteria, green algae, and higher plants), photo- 
system II (PSII) generates a strong oxidant that functions in Og evolu- 
tion through the photolysis of water and a weak reductant that reduces 
plastoquinone to plastohydroquinone (PQHs). Photosystem I (PSI) 
generates a weak oxidant that accepts electrons from plastohydro- 
quinone via the cytochrome bf complex and a strong reductant capa- 
ble of reducing NADP* to NADPH. Overall photosynthetic electron 
transfer is accomplished by three supramolecular membrane-spanning 
complexes: PSII, the cytochrome bgf complex, and PSI. Oxygen evolu- 
tion requires the accumulation of four oxidizing equivalents in PSII. 
The electrons withdrawn from water are used to re-reduce P680* back 


and transfer an e once again. Electrons from P680* traverse PSII and 
reduce plastoquinone. Plastohydroquinone is oxidized via the cyto- 
chrome bf complex, with plastocyanin serving as e acceptor. The cy- 
tochrome gfcomplex catalyzes a Q cycle: It translocates 4 H* from the 
stroma to the thylakoid lumen for each molecule of PQHo that it oxi- 
dizes. PSI is a light-driven plastocyanin : ferredoxin oxidoreductase hav- 
ing P700 as its reaction center Chl dimer. Electrons from P700* are 
transferred to the Fe,S, cluster of ferredoxin. Reduced ferredoxin 
reduces NADP* via the ferredoxin: NADP* reductase flavoprotein. The 
electron “hole” in P700* is filled by reduced plastocyanin. 


21.4 What Is the Molecular Architecture of Photosynthetic Reaction 
Centers? All known photosynthetic reaction centers have a universal 
molecular architecture. The “core” structure is a pair of protein sub- 
units having (at least) five transmembrane a-helical segments that pro- 
vide a scaffold upon which the reaction center Chl pair and its associ- 
ated chain of electron transfer cofactors are arrayed in a characteristic 
spatial pattern that facilitates rapid removal of an electron from the 
photoactivated RC and efficient transfer of the e across the membrane 
to a terminal acceptor (such as a quinone or a ferredoxin molecule). 
Photosynthetic electron transport is always coupled to H* translocation 
across the membrane, creating the potential for ATP synthesis by F|Fo- 
type ATP synthases. 


21.5 What Is the Quantum Yield of Photosynthesis? The absorption 
of light energy by the photosynthetic apparatus is very efficient. The 
quantum yield of chemical energy, either in the form of ATP and 
NADPH, or in the form of glucose, depends on a number of factors that 
are still subject to investigation, including the H*/e ratio and the 
ATP/H* ratio. 


21.6 How Does Light Drive the Synthesis of ATP? Photosynthetic elec- 
tron transport leads to proton translocation across the photosynthetic 
membrane and creation of an H* gradient that can be used by an F)Fo- 
type ATP synthase to drive ATP formation from ADP and P;. Pho- 
tophosphorylation occurs by either of two modes: noncyclic and cyclic. 
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Noncyclic photophosphorylation depends on both PSI and PSII and 
leads to Oy evolution, NADP* reduction, and ATP synthesis. In cyclic 
photophosphorylation, only PSI is used, no NADP” is reduced, and no 
Oz is evolved. However, the electron-transfer events of cyclic pho- 
tophosphorylation lead to H* translocation and ATP synthesis. 


21.7 How Is Carbon Dioxide Used to Make Organic Molecules? 
Ribulose-1,5-bisphosphate is the CO% acceptor in the key reaction for 
conversion of carbon dioxide into organic compounds. The reaction is 
catalyzed by rubisco (ribulose bisphosphate carboxylase/oxygenase) ; 
the products of CO» fixation by the rubisco reaction are 2 equivalents of 
3-phosphoglycerate. 

The Calvin—Benson cycle is a series of reactions that converts the 
3-phosphoglycerates formed by rubisco into carbohydrates such as 
glyceraldehyde-3-P, dihydroxyacetone-P, and glucose. COs fixation is 
activated by light through a variety of mechanisms, including changes 
in stromal pH, generation of reducing power in the form of ferre- 
doxin by photosynthetic electron transport, and increased Mg?" efflux 
from the thylakoid lumen to the stroma. 


21.8 How Does Photorespiration Limit CO, Fixation? When Os 
replaces CO, in the rubisco, or ribulose bisphosphate carboxylase/ 


oxygenase, reaction, ribulose-1,5-bisP is destroyed through conversion 
into 3-phosphoglyerate and phosphoglycolate. Phosphoglycolate is oxi- 
dized to form COs, with loss of organic substance from the cell. Because 
Oy is taken up and COs is released in these reactions, the process is 
called photorespiration. 

Tropical grasses carry out the Calvin—Benson cycle of reactions in cells 
shielded from high Oy levels. COg is first incorporated into PEP by PEP 
carboxylase to form oxaloacetate (OAA) in the mesophyll cells on the leaf 
surface. OAA is then reduced to malate and transported to bundle sheath 
cells. There, CO» is released and taken up by the rubisco reaction to form 
3-phosphoglyerate, initiating the Calvin—Benson cycle. Plants capable of 
doing this are called C4 plants. 

Succulent plants of semiarid and tropical regions such as Cactaceae 
and Crassulaceae exchange gases through their stomata only at night in 
order to avoid precious water loss. CO, taken up at night is added to 
PEP by PEP carboxylase to form OAA, which is then reduced to malate 
in the dark. During the day, malate is decarboxylated to yield COs, 
which then enters the Calvin—Benson cycle. This metabolic variation is 
referred to as crassulacean acid metabolism. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. In photosystem I, P700 in its ground state has an é,’ = +0.4 V. Exci- 
tation of P700 by a photon of 700-nm light alters the €,’ of P700* 
to —0.6 V. What is the efficiency of energy capture in this light 
reaction of P700? 

2. What is the ©,’ for the light-generated primary oxidant of photo- 
system II if the light-induced oxidation of water (which leads to O% 
evolution) proceeds with a A G°’ of —25 kJ/mol? 

3. (Integrates with Chapters 3 and 20.) Assuming that the concen- 
trations of ATP, ADP, and P; in chloroplasts are 3, 0.1, and 10 mM, 
respectively, what is the AG for ATP synthesis under these condi- 
tions? Photosynthetic electron transport establishes the proton- 
motive force driving photophosphorylation. What redox potential 
difference is necessary to achieve ATP synthesis under the forego- 
ing conditions, assuming 1.3 ATP equivalents are synthesized for 
each electron pair transferred? 

4. (Integrates with Chapter 20.) Write a balanced equation for the Q 
cycle as catalyzed by the cytochrome bf complex of chloroplasts. 

5. If noncyclic photosynthetic electron transport leads to the translo- 
cation of 3 H*/e and cyclic photosynthetic electron transport 
leads to the translocation of 2 H+/ e, what is the relative photosyn- 
thetic efficiency of ATP synthesis (expressed as the number of pho- 
tons absorbed per ATP synthesized) for noncyclic versus cyclic 
photophosphorylation? (Assume that the CF)CF)—-ATP synthase 
yields 3 ATP/14 H+.) 

6. (Integrates with Chapter 20.) In mitochondria, the membrane po- 
tential (Aw) contributes relatively more to Ap (proton-motive force) 
than does the pH gradient (ApH). The reverse is true in chloro- 
plasts. Why do you suppose that the proton-motive force in chloro- 
plasts can depend more on ApH than mitochondria can? Why is 
(Ay) less in chloroplasts than in mitochondria? 

7. Predict the consequences of a Y161F mutation in the amino acid se- 
quence of the D1 subunit of PSII. 

8. (Integrates with Chapter 20.) Calculate (in Einsteins and in kJ/mol) 
how many photons would be required by the Rhodopseudomonas viridis 
photophosphorylation system to synthesize 3 ATPs. (Assume that the 
R. viridis F\Fo-ATP synthase c-subunit rotor contains 12 c-subunits 
and that the R. viridis cytochrome bc; complex translocates 2 H+/ e.) 

9. (Integrates with Chapters 18 and 20.) Calculate AG°’ for the 
NADP*-specific glyceraldehyde-3-P dehydrogenase reaction of the 
Calvin—Benson cycle. 


10. Write a balanced equation for the synthesis of a glucose molecule 
from ribulose-1,5-bisphosphate and CO» that involves the first three 
reactions of the Calvin cycle and subsequent conversion of the two 
glyceraldehyde-3-P molecules into glucose. 

11. C-labeled carbon dioxide is administered to a green plant, and 
shortly thereafter the following compounds are isolated from 
the plant: 3-phosphoglycerate, glucose, erythrose-4-phosphate, 
sedoheptulose-1,7-bisphosphate, and ribose-5-phosphate. In which 
carbon atoms will radioactivity be found? 

12. The photosynthetic CO, fixation pathway is regulated in response to 
specific effects induced in chloroplasts by light. What is the nature 
of these effects, and how do they regulate this metabolic pathway? 

13. Write a balanced equation for the conversion of phosphoglycolate to 
glycerate-3-P by the reactions of photorespiration. Does this balanced 
equation demonstrate that photorespiration is a wasteful process? 


14. The overall equation for photosynthetic COs fixation is 
6 COs + 6 H:O — CgHj20¢ + 6 Ov 


All the O atoms evolved as Op come from water; none comes from car- 
bon dioxide. But 12 O atoms are evolved as 6 Og, and only 6 O atoms 
appear as 6 HO in the equation. Also, 6 CO, have 12 O atoms, yet 
there are only 6 O atoms in CgH),0,. How can you account for these 
discrepancies? (Hint: Consider the partial reactions of photosynthe- 
sis: ATP synthesis, NADP* reduction, photolysis of water, and the 
overall reaction for hexose synthesis in the Calvin—Benson cycle.) 


15. The number of e¢subunits in F\Fo-type ATP synthases shows some 
variation from organism to organism. For example, the yeast ATP 
synthase contains 10 esubunits, the spinach CF,CF)—-ATP synthase 
has 14, and the cyanobacterium Spirulina platensis enzyme appar- 
ently has 15. 

a. What is the consequence of c-subunit stoichiometry for the 
H*/ATP ratio? 

b. What is the relationship between c-subunit stoichiometry and the 
magnitude of Ap (the proton-motive force)? 

16. The reduction of membrane-associated quinones, such as coen- 
zyme Q and plastoquinones, is a common feature of photosystems 
(see Figures 21.15, 21.20, and 21.21). Assume ©,’ for PQ/PQH» = 
0.07 V and the potential of the ground-state chlorophyll molecule 
= 0.5 V, calculate AG for the reduction of plastoquinone by 
a. 870-nm light. 

b. 700-nm light. 
c. 680-nm light. 


17. Plastoquinone oxidation by cytochrome ba and cytochrome bf 
complexes apparently leads to the translocation of 4*/2e. If €,’ for 
cytochrome f = 0.365 V (Table 20.1) and E,’ for PQ/PQH»s = 
0.07 V, calculate AG for the coupled reaction: 


2 hv + 4H+i,— 4 Aton 
(Assume a value of 23 kJ/mol for the free energy change (AG) as- 
sociated with moving protons from inside to outside.) 


18. What is the overall free energy change (AG) for noncyclic photo- 
synthetic electron transport? 
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4 (700-nm photons) + 4 (680-nm photons) + 2 H:O + 2 NADP* —> 
O + 2 NADPH + 2 H* 


Preparing for the MCAT Exam 

19. From Figure 21.5, predict the spectral properties of accessory light- 
harvesting pigments found in plants. 

20. Draw a figure analogous to Figure 21.26, plotting [Mg**] in the 
stroma and thylakoid lumen on the y-axis and dark-light-dark on the 
x-axis. 
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Gluconeogenesis, Glycogen 
Metabolism, and the Pentose 
Phosphate Pathway 


ESSENTIAL QUESTIONS 


As shown in Chapters 18 and 19, the metabolism of sugars is an important source of 
energy for cells. Animals, including humans, typically obtain significant amounts of 
glucose and other sugars from the breakdown of starch in their diets. Glucose can 
also be supplied via breakdown of cellular reserves of glycogen (in animals) or 
starch (in plants). 


What is the nature of gluconeogenesis, the pathway that synthesizes glucose 
from noncarbohydrate precursors; how is glycogen synthesized from glucose; 
and how are electrons from glucose used in biosynthesis? 


22l What Is Gluconeogenesis, and How Does It Operate? 


The ability to synthesize glucose from common metabolites is very important to 
most organisms. Human metabolism, for example, consumes about 160 + 20 grams 
of glucose per day, about 75% of this in the brain. Body fluids carry only about 
20 grams of free glucose, and glycogen stores normally can provide only about 
180 to 200 grams of free glucose. Thus, the body carries only a little more than a 
1-day supply of glucose. If glucose is not obtained in the diet, the body must pro- 
duce new glucose from noncarbohydrate precursors. The term for this activity is 
gluconeogenesis, which means the generation (genesis) of new (neo) glucose. 

Furthermore, muscles consume large amounts of glucose via glycolysis, produc- 
ing large amounts of pyruvate. In vigorous exercise, muscle cells become anaerobic 
and pyruvate is converted to lactate. Gluconeogenesis salvages this pyruvate and lac- 
tate and reconverts it to glucose. 

Another pathway of glucose catabolism, the pentose phosphate pathway, is the pri- 
mary source of NADPH, the reduced coenzyme essential to most reductive biosyn- 
thetic processes. For example, NADPH is crucial to the biosynthesis of fatty acids 
(see Chapter 24) and amino acids (see Chapter 25). The pentose phosphate path- 
way also results in the production of ribose-5-phosphate, an essential component 
of ATP, NAD*, FAD, coenzyme A, and particularly DNA and RNA. This important 
pathway will also be considered in this chapter. 


The Substrates for Gluconeogenesis Include Pyruvate, Lactate, 
and Amino Acids 


In addition to pyruvate and lactate, other noncarbohydrate precursors can be used 
as substrates for gluconeogenesis in animals. These include most of the amino 
acids, as well as glycerol and all the TCA cycle intermediates. On the other hand, 
fatty acids are not substrates for gluconeogenesis in animals, because most fatty 
acids yield only acetyl-CoA upon degradation, and animals cannot carry out net 
synthesis of sugars from acetyl-CoA. Lysine and leucine are the only amino acids 
that are not substrates for gluconeogenesis. These amino acids produce only 
acetyl-CoA upon degradation. Note also that acetyl-CoA can be a substrate for glu- 
coneogenesis in plants when the glyoxylate cycle is operating (see Chapter 19). 


Nearly All Gluconeogenesis Occurs in the Liver and Kidneys in Animals 


Interestingly, the mammalian organs that consume the most glucose, namely, brain 
and muscle, carry out very little glucose synthesis. The major sites of gluconeogen- 
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HUMAN BIOCHEMISTRY 


The Chemistry of Glucose Monitoring Devices 


Individuals with diabetes must measure their serum glucose con- 
centration frequently, often several times a day. The advent of com- 
puterized, automated devices for glucose monitoring has made this 
necessary chore easier, far more accurate, and more convenient 
than it once was. These devices all use a simple chemical scheme for 
glucose measurement that involves oxidation of glucose to gluconic 
acid by glucose oxidase. This reaction produces two molecules of 
hydrogen peroxide per molecule of glucose oxidized. The H2O% is 
then used to oxidize a dye, such as o-dianisidine, to a colored prod- 
uct that can be measured: 


Glucose + 2 HO + O —> gluconic acid + 2 H20% 
o-dianisidine (colorless) + HO —> oxidized o-anisidine 
(colored) + HO 


The amount of colored dye produced is directly proportional to 
the amount of glucose in the sample. 

The patient typically applies a drop of blood (from a finger- 
prick*) to a plastic test strip that is then inserted into the glucose 
monitor. Within half a minute, a digital readout indicates the 
blood glucose value. Modern glucose monitors store several days 
of glucose measurements, and the data can be easily transferred to 
a computer for analysis and graphing. 





*How does the monitor deal with getting just the right amount of blood? 
The blood flows up an absorbent “wick” by capillary action. It is impossible 
to overfill this device, but the monitor will give an error signal if not 
enough blood flows up the strip. 








esis are the liver and kidneys, which account for about 90% and 10% of the body’s 
gluconeogenic activity, respectively. Glucose produced by gluconeogenesis in the 
liver and kidneys is released into the blood and is subsequently absorbed by brain, 
heart, muscle, and red blood cells to meet their metabolic needs. In turn, pyruvate 
and lactate produced in these tissues are returned to the liver and kidneys to be 
used as gluconeogenic substrates. 


Gluconeogenesis Is Not Merely the Reverse of Glycolysis 


In some ways, gluconeogenesis is the reverse, or antithesis, of glycolysis. Glucose 
is synthesized, not catabolized; ATP is consumed, not produced; and NADH is ox- 
idized to NAD*, rather than the other way around. However, gluconeogenesis 
cannot be merely the reversal of glycolysis, for two reasons. First, glycolysis is exer- 
gonic, with a AG°’ of approximately —74 kJ/mol. If gluconeogenesis were merely 
the reverse, it would be a strongly endergonic process and could not occur spon- 
taneously. Somehow the energetics of the process must be augmented so that glu- 
coneogenesis can proceed spontaneously. Second, the processes of glycolysis and 
gluconeogenesis must be regulated in a reciprocal fashion so that when glycolysis 
is active, gluconeogenesis is inhibited, and when gluconeogenesis is proceeding, 
glycolysis is turned off. Both of these limitations are overcome by having unique 
reactions within the routes of glycolysis and gluconeogenesis, rather than a com- 
pletely shared pathway. 


Gluconeogenesis—Something Borrowed, Something New 


The complete route of gluconeogenesis is shown in Figure 22.1, side by side with the 
glycolytic pathway. Gluconeogenesis employs four different reactions, catalyzed by 
four different enzymes, for the three steps of glycolysis that are highly exergonic 
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FIGURE 22.1 The pathways of gluconeogenesis and 
glycolysis. Species in blue, green, and peach-colored 
shaded boxes indicate other entry points for gluconeo- 
genesis (in addition to pyruvate). 





(and highly regulated). In essence, seven of the ten steps of glycolysis are merely re- 
versed in gluconeogenesis. The six reactions between fructose-1,6-bisphosphate and 
PEP are shared by the two pathways, as is the isomerization of glucose-6-P to fructose- 
6-P. The three exergonic, regulated reactions—the hexokinase (glucokinase), phos- 
phofructokinase, and pyruvate kinase reactions—are replaced by alternative reac- 
tions in the gluconeogenic pathway. 

The conversion of pyruvate to PEP that initiates gluconeogenesis is accom- 
plished by two unique reactions. Pyruvate carboxylase catalyzes the first, convert- 
ing pyruvate to oxaloacetate. Then, PEP carboxykinase catalyzes the conversion 
of oxaloacetate to PEP. Conversion of fructose-1,6-bisphosphate to fructose-6- 
phosphate is catalyzed by a specific phosphatase, fructose-1,6-bisphosphatase. 
The final step to produce glucose, hydrolysis of glucose-6-phosphate, is mediated 


22.1 


by glucose-6-phosphatase. Each of these steps is considered in detail in the fol- 
lowing paragraphs. The overall conversion of pyruvate to PEP by pyruvate car- 
boxylase and PEP carboxykinase has a A G°’ close to zero but is pulled along by sub- 
sequent reactions. The conversion of fructose-1,6-bisphosphate to glucose in the 
last three steps of gluconeogenesis is strongly exergonic, with a AG°’ of about 
—30.5 kJ/mol. This sequence of two phosphatase reactions separated by an iso- 
merization accounts for most of the free energy release that makes the gluconeo- 
genesis pathway spontaneous. 


Four Reactions Are Unique to Gluconeogenesis 


1. Pyruvate Carboxylase—A Biotin-Dependent Enzyme Initiation of gluconeo- 
genesis occurs in the pyruvate carboxylase reaction—the conversion of pyruvate to 
oxaloacetate. 


O 
| + HCO; + ATP —> | + ADP + P, 
CH3C COOR Tooc ECCO COOm 
Pyruvate Bicarbonate Oxaloacetate 


The reaction takes place in two discrete steps, involves ATP and bicarbonate as sub- 
strates, and utilizes biotin as a coenzyme and acetyl-CoA as an allosteric activator. 
Pyruvate carboxylase is a tetrameric enzyme (with a molecular mass of about 
500 kD). Each monomer possesses a biotin covalently linked to the e-amino group 
of a lysine residue at the active site (Figure 22.2). The first step of the reaction 
involves nucleophilic attack of a bicarbonate oxygen at the y-P of ATP to form 
carbonylphosphate, an activated form of COs, and ADP (Figure 22.3). Reaction of 
carbonylphosphate with biotin occurs rapidly to form N-carboxybiotin, liberating 
inorganic phosphate. The third step involves abstraction of a proton from the C-3 
of pyruvate, forming a carbanion that can attack the carbon of N-carboxybiotin to 
form oxaloacetate. 


Pyruvate Carboxylase Is Allosterically Activated by Acetyl-Coenzyme A Two particu- 
larly interesting aspects of the pyruvate carboxylase reaction are (1) allosteric activa- 
tion of the enzyme by acyl-CoA derivatives and (2) compartmentation of the reaction 











Ta tot 
TOR Lo P— O — P— O— Adenosine 
| | | 
On ‘On On 
~o ADP O (0) P; oO 
x + Il | Il O 
Poa -0 — C— O — P— O7 meet RE 
HO | 
o- 
i i 
TCH ~ C— COO 
H H 5 
Biotin 
7 7 
Be | 
Lys H — CH,—C— COO7 


:B—® 1 


What Is Gluconeogenesis, and How Does It Operate? 


665 


CH,CH,CH,CH, — NH Lysine 


A 
CH, 
N 
CH, 
A 
CHo Biotin 
N 
CH, 
H 
N 
S oO 
N 
H 


FIGURE 22.2 Covalent linkage of biotin to an active-site 
lysine in pyruvate carboxylase. 





In most organisms, pyruvate carboxylase is a homo- 
tetramer of 130-kD subunits, with each subunit com- 
posed of three functional domains named biotin car- 
boxylase, carboxyl transferase, and biotin carboxyl carrier 
protein. Shown here is the biotin carboxylase domain of 
pyruvate carboxylase from Bacillus thermodenitrificans. 
(pdb id = 2DZD). 


Lys 


O 





~O—C — CH, — C — CoO 


FIGURE 22.3 A mechanism for the pyruvate carboxylase reaction. Bicarbonate must be activated for attack 
by the pyruvate carbanion. This activation is driven by ATP and involves formation of a carbonylphosphate 
intermediate—a mixed anhydride of carbonic and phosphoric acids. (Carbonylphosphate and carboxyphos- 
phate are synonyms.) 
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FIGURE 22.4 Pyruvate carboxylase is a compartmental- 
ized reaction. Pyruvate is converted to oxaloacetate in 
the mitochondria. Because oxaloacetate cannot be trans- 
ported across the mitochondrial membrane, it must be 
reduced to malate, transported to the cytosol, and then 
oxidized back to oxaloacetate before gluconeogenesis 
can continue. 
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PEP carboxykinase from Escherichia coli with ADP (blue), 
pyruvate (purple), and Mg?* (pdb id = 10S’). 


in the mitochondrial matrix. The carboxylation of biotin requires the presence (at 
an allosteric site) of acetyl-CoA or other acylated CoA derivatives. The second half of 
the carboxylase reaction—the attack by pyruvate to form oxaloacetate—is not af- 
fected by CoA derivatives. 

Activation of pyruvate carboxylase by acetyl-CoA provides an important physio- 
logical regulation. Acetyl-CoA is the primary substrate for the TCA cycle, and oxa- 
loacetate (formed by pyruvate carboxylase) is an important intermediate in both 
the TCA cycle and the gluconeogenesis pathway. If levels of ATP and/or acetyl-CoA 
(or other acyl-CoAs) are low, pyruvate is directed primarily into the TCA cycle, 
which eventually promotes the synthesis of ATP. If ATP and acetyl-CoA levels are 
high, pyruvate is converted to oxaloacetate and consumed in gluconeogenesis. 
Clearly, high levels of ATP and CoA derivatives are signs that energy is abundant 
and that metabolites will be converted to glucose (and perhaps even glycogen). If 
the energy status of the cell is low (in terms of ATP and CoA derivatives), pyruvate 
is consumed in the TCA cycle. Also, as noted in Chapter 19, pyruvate carboxylase is 
an important anaplerotic enzyme. Its activation by acetyl-CoA leads to oxaloacetate 
formation, replenishing the level of TCA cycle intermediates. 


Compartmentalized Pyruvate Carboxylase Depends on Metabolite Conversion and 
Transport The second interesting feature of pyruvate carboxylase is that it is found 
only in the matrix of the mitochondria. By contrast, the next enzyme in the gluco- 
neogenic pathway, PEP carboxykinase, may be localized in the cytosol, in the mito- 
chondria, or both. For example, rabbit liver PEP carboxykinase is predominantly 
mitochondrial, whereas the rat liver enzyme is strictly cytosolic. In human liver, PEP 
carboxykinase is found both in the cytosol and in the mitochondria. Pyruvate is trans- 
ported into the mitochondrial matrix (Figure 22.4), where it can be converted to 
acetyl-CoA (for use in the TCA cycle) and then to citrate (for fatty acid synthesis; see 
Figure 24.1). Alternatively, it may be converted directly to OAA by pyruvate carboxy- 
lase and used in gluconeogenesis. In tissues where PEP carboxykinase is found only 
in the mitochondria, oxaloacetate is converted to PEP, which is then transported to 
the cytosol for gluconeogenesis. However, in tissues that must convert some oxa- 
loacetate to PEP in the cytosol, a problem arises. Oxaloacetate cannot be transported 
directly across the mitochondrial membrane. Instead, it must first be transformed 
into malate or aspartate for transport across the mitochondrial inner membrane 
(Figure 22.4). Cytosolic malate and aspartate must be reconverted to oxaloacetate 
before continuing along the gluconeogenic route. 


2. PEP Carboxykinase The second reaction in the gluconeogenic pyruvate-PEP 
bypass is the conversion of oxaloacetate to PEP. 





oO S 
X DON O, PO penne 
C—COOT + GTP > C— COO” + GDP + 
| PEP carboxykinase | 
H.C — M CH, 
Oxaloacetate PEP 


Production of a high-energy metabolite such as PEP requires energy. The energetic 
requirements are handled in two ways here. First, the CO» added to pyruvate in the 
pyruvate carboxylase step is removed in the PEP carboxykinase reaction. Decarboxy- 
lation is a favorable process and helps drive the formation of the very high-energy 
enol phosphate in PEP. This decarboxylation drives a reaction that would otherwise 
be highly endergonic. Note the inherent metabolic logic in this pair of reactions: 
Pyruvate carboxylase consumed an ATP to drive a carboxylation so that the PEP 
carboxykinase could use the decarboxylation to facilitate formation of PEP. Second, 
another high-energy phosphate is consumed by the carboxykinase. Mammals and 
several other species use GTP in this reaction, rather than ATP. The use of GTP here 
is equivalent to the consumption of an ATP, due to the activity of the nucleoside 
diphosphate kinase (see Figure 19.2). The substantial free energy of hydrolysis of GTP 
is crucial to the synthesis of PEP in this step. The overall AG for the pyruvate car- 
boxylase and PEP carboxykinase reactions under physiological conditions in the liver 
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is —22.6 kJ/mol. Once PEP is formed in this way, the phosphoglycerate mutase, phos- 
phoglycerate kinase, glyceraldehyde-3-P dehydrogenase, aldolase, and triose phos- 
phate isomerase reactions act to eventually form fructose-1,6-bisphosphate, as shown 
in Figure 22.1. 


3. Fructose-1,6-Bisphosphatase The hydrolysis of fructose-1,6-bisphosphate to 
fructose-6-phosphate, 


79,POH,C ON cH,OPOe *OsPOHSC OO. cH,OH 
H HO + HO ———> H HO + ® 
H OH Fructose-1,6- H OH 
bisphosphatase 
HO H HO H 
AG” = -16.7 kJ/mol 
Fructose-1,6-bisphosphate Fructose-6-phosphate 


like all phosphate ester hydrolyses, is a thermodynamically favorable (exergonic) 
reaction under standard-state conditions (AG°’ = —16.7 kJ/mol). Under physio- 
logical conditions in the liver, the reaction is also exergonic (AG = —8.6 kJ/mol). 
Fructose-1,6-bisphosphatase is an allosterically regulated enzyme. Citrate stimu- 
lates bisphosphatase activity, but fructose-2,6-bisphosphate is a potent allosteric in- 
hibitor. AMP also inhibits the bisphosphatase; the inhibition by AMP is enhanced 
by fructose-2,6-bisphosphate. 


4. Glucose-6-Phosphatase The final step in the gluconeogenesis pathway is 
the conversion of glucose-6-phosphate to glucose by the action of glucose-6- 
phosphatase (Figure 22.5). This enzyme is present in the membranes of the endo- 
plasmic reticulum of liver and kidney cells but is absent in muscle and brain. For 
this reason, gluconeogenesis is not carried out in muscle and brain. The glucose- 
6-phosphatase system includes the phosphatase itself and three transport proteins, 
T1, T2, and T3. The glucose-6-phosphate transporter (T1) takes glucose-6-phos- 
phate into the endoplasmic reticulum, where it is hydrolyzed to glucose and P;. 
The T2 and T3 transporters export glucose and P;, respectively, to the cytosol, and Frietose te bisphosphatase fon pigiwih Paces: 
glucose is then exported (to the circulation) by the GLUT2 transporter. The phosphate (orange), AMP (blue), and Mg?" (dark blue) 
glucose-6-phosphatase reaction involves a phosphorylated enzyme intermediate, (pdb id = 1FBP). 

phosphohistidine (Figure 22.6). The AG for the glucose-6-phosphatase reaction in 

liver is —5.1 kJ/mol. 





Glucose-6-phosphate 


G-6-P a Glucose-6-phosphatase 


transporter 





FIGURE 22.5 Glucose-6-phosphatase is localized in 

the endoplasmic reticulum. Conversion of glucose-6- 
phosphate to glucose occurs following transport into 
the endoplasmic reticulum. Glucose-6-phosphatase and 
the three transporters, T1, T2, and T3, are known collec- 
transporter tively as the glucose-6-phosphatase system. 
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FIGURE 22.6 The glucose-6-phosphatase reaction in- 
volves formation of a phosphohistidine intermediate. 
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Coupling with Hydrolysis of ATP and GTP Drives Gluconeogenesis_ The net reaction 


for the conversion of pyruvate to glucose in gluconeogenesis is 


2 Pyruvate + 4 ATP + 2 GTP + 2 NADH + 2 H+ + 6 HXO—> 
glucose + 4 ADP + 2 GDP + 6 P; + 2 NAD* 





The net free energy change, A G°’, for this conversion is —37.7 kJ/mol. The con- 
sumption of a total of six nucleoside triphosphates drives this process forward. If 
glycolysis were merely reversed to achieve the net synthesis of glucose from pyru- 
vate, the net reaction would be 


2 Pyruvate + 2 ATP + 2 NADH +4 





2H* + 2H,O—> 
glucose + 





2 ADP + 2 P; + 2 NAD* 


HUMAN BIOCHEMISTRY 





Gluconeogenesis Inhibitors and Other Diabetes Therapy Strategies 


Diabetes, the inability to assimilate and metabolize blood glucose, 
afflicts millions of people. People with type 1 diabetes are unable 
to synthesize and secrete insulin. On the other hand, people with 
type 2 diabetes make sufficient insulin, but the molecular path- 
ways that respond to insulin are defective. Many type 2 diabetic 
people exhibit a condition termed insulin resistance even before 
the onset of diabetes. Metformin (see accompanying figure) is a 
drug that improves sensitivity to insulin, primarily by stimulating 
glucose uptake by glucose transporters in peripheral tissues. It also 
increases binding of insulin to insulin receptors, stimulates tyro- 
sine kinase activity (see Chapter 32) of the insulin receptor, and 
inhibits gluconeogenesis in the liver. 


Gluconeogenesis inhibitors may be the next wave in diabetes 
therapy. Drugs that block gluconeogenesis without affecting glycol- 
ysis would need to target one of the enzymes unique to gluconeo- 
genesis. 3-Mercaptopicolinate and hydrazine inhibit PEP carboxyk- 
inase, and chlorogenic acid, a natural product found in the skin of 
peaches, inhibits the transport activity of the glucose-6-phosphatase 
system (but not the glucose-6-phosphatase enzyme activity). The 
drug S-3483, a derivative of chlorogenic acid, also inhibits the glu- 
cose-6-phosphatase transport activity and binds a thousand times 
more tightly to the transporter than chlorogenic acid. Drugs of this 
type may be useful in the treatment of type 2 diabetes. 
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and the overall A G°’ would be about +74 kJ/mol. Such a process would be highly 
endergonic and therefore thermodynamically unfeasible. Hydrolysis of four addi- 
tional high-energy phosphate bonds makes gluconeogenesis thermodynamically 
favorable. Under physiological conditions, however, gluconeogenesis is somewhat 
less favorable than at standard state, with an overall AG of —15.6 kJ/mol for the 
conversion of pyruvate to glucose. 


Lactate Formed in Muscles Is Recycled to Glucose in the Liver A final point on the re- 
distribution of lactate and glucose in the body serves to emphasize the metabolic in- 
teractions between organs. Vigorous exercise can lead to oxygen shortage (anaero- 
bic conditions), and energy requirements must be met by increased levels of 
glycolysis. Under such conditions, glycolysis converts NAD* to NADH, yet O% is un- 
available for regeneration of NAD* via cellular respiration. Instead, large amounts 
of NADH are reoxidized by the reduction of pyruvate to lactate. The lactate thus 
produced can be transported from muscle to the liver, where it is reoxidized by liver 
lactate dehydrogenase to yield pyruvate, which is converted eventually to glucose. In 
this way, the liver shares in the metabolic stress created by vigorous exercise. It ex- 
ports glucose to muscle, which produces lactate, and lactate from muscle can be 
processed by the liver into new glucose. This is referred to as the Cori cycle (Figure 
22.7). Liver, with a typically high NAD*/NADH ratio (about 700), readily produces 
more glucose than it can use. Muscle that is vigorously exercising will enter anaer- 
obiosis and show a decreasing NAD*/NADH ratio, which favors reduction of pyru- 
vate to lactate. 


22:28 How Is Gluconeogenesis Regulated? 


Nearly all of the reactions of glycolysis and gluconeogenesis take place in the cyto- 
sol. If metabolic control were not exerted over these reactions, glycolytic degrada- 
tion of glucose and gluconeogenic synthesis of glucose could operate simultane- 
ously, with no net benefit to the cell and with considerable consumption of ATP. 
This is prevented by a sophisticated system of reciprocal control, which inhibits 
glycolysis when gluconeogenesis is active, and vice versa. Reciprocal regulation of 
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FIGURE 22.7 The Cori cycle. 
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CRITICAL DEVELOPMENTS IN BIOCHEMISTRY 


The Pioneering Studies of Carl and Gerty Cori 


The Cori cycle is named for Carl and Gerty Cori, who received the 
Nobel Prize in Physiology or Medicine in 1947 for their studies of 
glycogen metabolism and blood glucose regulation. Carl Ferdi- 
nand Cori and Gerty Theresa Radnitz were both born in Prague 
(then in Austria). They earned medical degrees from the German 
University of Prague in 1920 and were married later that year. 
They joined the faculty of the Washington University School of 
Medicine in St. Louis in 1931. Their remarkable collaboration re- 
sulted in many fundamental advances in carbohydrate and glyco- 
gen metabolism. They were credited with the discovery of glucose- 
1-phosphate, also known at the time as the “Cori ester.” They also 


showed that glucose-6-phosphate was produced from glucose-1-P 
by the action of phosphoglucomutase. They isolated and crystal- 
lized glycogen phosphorylase and elucidated the pathway of glyco- 
gen breakdown. In 1952, they showed that absence of glucose-6- 
phosphatase in the liver was the enzymatic defect in von Gierke’s 
disease, an inherited glycogen-storage disease. Six eventual Nobel 
laureates received training in their laboratory. Gerty Cori was the 
first American woman to receive a Nobel Prize. Carl Cori said of 
their remarkable collaboration: “Our efforts have been largely 
complementary and one without the other would not have gone 
so far...” 











these two pathways depends largely on the energy status of the cell. When the en- 
ergy status of the cell is low, glucose is rapidly degraded to produce needed energy. 
When the energy status is high, pyruvate and other metabolites are utilized for syn- 
thesis (and storage) of glucose. Moreover, when blood glucose levels are low, gluco- 
neogenesis is active. 

In glycolysis, the three regulated enzymes are those catalyzing the strongly 
exergonic reactions: hexokinase (glucokinase), phosphofructokinase, and pyruvate 
kinase. As noted, the gluconeogenic pathway replaces these three reactions with 
corresponding reactions that are exergonic in the direction of glucose synthesis: 
glucose-6-phosphatase, fructose-1,6-bisphosphatase, and the pyruvate carboxylase- 
PEP carboxykinase pair, respectively. These are the three most appropriate sites of 
regulation in gluconeogenesis. 


Gluconeogenesis Is Regulated by Allosteric and Substrate-Level 
Control Mechanisms 


The mechanisms of regulation of gluconeogenesis are shown in Figure 22.8. Control 
is exerted at all of the predicted sites, but in different ways. Glucose-6-phosphatase is 
not under allosteric control. However, the K,,, for the substrate, glucose-6-phosphate, 
is considerably higher than the normal range of substrate concentrations. As a 
result, glucose-6-phosphatase displays a nearlinear dependence of activity on sub- 
strate concentrations and is thus said to be under substrate-level control by glucose- 
6-phosphate. 

Acetyl-CoA is a potent allosteric effector of glycolysis and gluconeogenesis. It al- 
losterically inhibits pyruvate kinase (as noted in Chapter 18) and activates pyruvate 
carboxylase. Because it also allosterically inhibits pyruvate dehydrogenase (the en- 
zymatic link between glycolysis and the TCA cycle), the cellular fate of pyruvate is 
strongly dependent on acetyl-CoA levels. A rise in [acetyl-CoA] indicates that cellu- 
lar energy levels are high and that carbon metabolites can be directed to glucose 
synthesis and storage. When acetyl-CoA levels drop, the activities of pyruvate kinase 
and pyruvate dehydrogenase increase and flux through the TCA cycle increases, 
providing needed energy for the cell. 

Fructose-1,6-bisphosphatase is another important site of gluconeogenic regula- 
tion. This enzyme is inhibited by AMP and activated by citrate. These effects by AMP 
and citrate are the opposites of those exerted on phosphofructokinase in glycolysis, 
providing another example of reciprocal regulatory effects. When AMP levels in- 
crease, gluconeogenic activity is diminished and glycolysis is stimulated. An increase 
in citrate concentration signals that TCA cycle activity can be curtailed and that 
pyruvate should be directed to sugar synthesis instead. 
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FIGURE 22.8 The principal regulatory mechanisms in glycolysis and gluconeogenesis. Activators are indicated 
by plus signs and inhibitors by minus signs. 


bisphosphate is a potent stimulator of phosphofructokinase (see Chapter 18). Cog- 
nizant of the reciprocal nature of regulation in glycolysis and gluconeogenesis, 
Van Schaftingen and Hers also considered the possibility of an opposite effect— 
inhibition—for fructose-1,6-bisphosphatase. In 1981, they reported that fructose- 
2,6-bisphosphate was indeed a powerful inhibitor of fructose-1,6-bisphosphatase 
(Figure 22.9). Inhibition occurs in either the presence or absence of AMP, and the 
effects of AMP and fructose-2,6-bisphosphate are synergistic. 

Cellular levels of fructose-2,6-bisphosphate are controlled by phosphofructoki- 
nase-2 (PFK-2), an enzyme distinct from the phosphofructokinase of the glycolytic 
pathway, and by fructose-2,6-bisphosphatase (F-2,6-BPase). Remarkably, these two 
enzymatic activities are both found in the same protein molecule, which is an exam- 
ple of a bifunctional, or tandem, enzyme (Figure 22.10). The opposing activities 
of this bifunctional enzyme are themselves regulated in two ways. First, fructose-6- 
phosphate, the substrate of phosphofructokinase and the product of fructose-1,6- 
bisphosphatase, allosterically activates PFK-2 and inhibits F-2,6-BPase. Second, the 
phosphorylation by cAMP-dependent protein kinase of a single Ser (Ser**) residue on 
the liver enzyme exerts reciprocal control of the PFK-2 and F-2,6-BPase activities. Phos- 
phorylation inhibits PFK-2 activity (by increasing the K,, for fructose-6-phosphate) and 
stimulates F-2,6-BPase activity. 
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FIGURE 22.9 Inhibition of fructose-1,6-bisphosphatase by fructose-2,6-bisphosphate in the (a) absence and 

(b) presence of 25 mM AMP. In (a) and (b), enzyme activity is plotted against substrate (fructose-1,6-bisphosphate) 
concentration. Concentrations of fructose-2,6-bisphosphate (in mM) are indicated above each curve. (c) The effect 
of AMP (0, 10, and 25 mM) on the inhibition of fructose-1,6-bisphosphatase by fructose-2,6-bisphosphate. Activity 
was measured in the presence of 10 mM fructose-1,6-bisphosphate. (Adapted from Van Schaftingen, E., and Hers, H-G, 
1981. Inhibition of fructose-1,6-bisphosphatase by fructose-2,6-bisphosphate. Proceedings of the National Academy of Science, U.S.A. 
78:2861-2863.) 
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FIGURE 22.10 (a) Synthesis and degradation of fructose-2,6-bisphosphate are catalyzed by the same bifunctional 
enzyme. (b) The structure of PFK-2/F-2,6-BPase from rat liver. PFK-2 activity resides in the N-terminal portion of the 
protein (left), and the C-terminal domain (right) contains F-2,6-BPase activity. The PFK-2 domain has a bound ATP 
analog; the F2, 6-Pase has two phosphates bound (pdb id = 1BIF). 


Substrate Cycles Provide Metabolic Control Mechanisms 


If fructose-1,6-bisphosphatase and phosphofructokinase acted simultaneously, they 
would constitute a substrate cycle in which fructose-1,6-bisphosphate and fructose- 
6-phosphate became interconverted with net consumption of ATP: 


Fructose-1,6-BP + HO —> fructose-6-P + P; 
Fructose-6-P + ATP —> fructose-1,6-BP + ADP 


Net: ATP + H:O —> ADP + P; 





Because substrate cycles such as this appear to operate with no net benefit to the 
cell, they were once regarded as metabolic quirks and were referred to as futile 
cycles. More recently, substrate cycles have been recognized as important devices 
for controlling metabolite concentrations. 

The three steps in glycolysis and gluconeogenesis that differ constitute three such 
substrate cycles, each with its own particular metabolic raison d’étre. Consider, for 
example, the regulation of the fructose-1,6-BP—fructose-6-P cycle by fructose-2,6- 
bisphosphate. As already noted, fructose-1,6-bisphosphatase is subject to allosteric in- 
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hibition by fructose-2,6-bisphosphate, whereas phosphofructokinase is allosterically 
activated by fructose-2,6-bisphosphate. The combination of these effects should per- 
mit either phosphofructokinase or fructose-1,6-bisphosphatase (but not both) to 
operate at any one time and should thus prevent futile cycling. For instance, in 
the fasting state, when food (that is, glucose) intake is zero, phosphofructokinase 
(and therefore glycolysis) is inactive due to the low concentration of fructose-2,6- 
bisphosphate. In the liver, gluconeogenesis operates to provide glucose for the 
brain. However, in the fed state, up to 30% of fructose-1,6-bisphosphate formed 
from phosphofructokinase is recycled back to fructose-6-P (and then to glucose). 
Because the dependence of fructose-1,6-bisphosphatase activity on fructose-1,6- 
bisphosphate is sigmoidal in the presence of fructose-2,6-bisphosphate (see 
Figure 22.9), substrate cycling occurs only at relatively high levels of fructose-1,6- 
bisphosphate. Substrate cycling in this case prevents the accumulation of excessively 
high levels of fructose-1,6-bisphosphate. 


22:38 How Are Glycogen and Starch Catabolized in Animals? 


Dietary Starch Breakdown Provides Metabolic Energy 


As noted earlier, well-fed adult human beings normally metabolize about 160 grams 
of carbohydrates each day. A balanced diet easily provides this amount, mostly in the 
form of starch. If too little carbohydrate is supplied by the diet, glycogen reserves in 
liver and muscle tissue can also be mobilized. The reactions by which ingested starch 
and glycogen are digested are shown in Figure 22.11. The enzyme known as @-amy- 
lase is an important component of saliva and pancreatic juice. (6-Amylase is found 
in plants. The a- and 6-designations for these enzymes serve only to distinguish the 
two and do not refer to glycosidic linkage nomenclature.) a-Amylase is an endogly- 
cosidase that hydrolyzes a(1—4) linkages of amylopectin and glycogen at random 
positions, eventually producing a mixture of maltose, maltotriose [with three 
a(1—>4)-linked glucose residues], and other small oligosaccharides. a-Amylase can 


(a) 








FIGURE 22.11 (a) The sites of hydrolysis of starch by 
a- and B-amylase are indicated. (b) Glycogenin is a 
glycosyltransferase that initiates eukaryotic glycogen 
synthesis from glucose. Bound UDP-glucose (blue) and 
Mn?* ion (purple) are shown (pdb id = 1LL2). 
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cleave on either side of an amylopectin branch point, but activity is reduced in highly 
branched regions of the polysaccharide and stops four residues from any branch 
point. 

The highly branched polysaccharides that are left after extensive exposure to 
a-amylase are called limit dextrins. These structures can be further degraded by the 
action of a debranching enzyme, which carries out two distinct reactions. The first of 
these, known as oligo(@1,4—a@1,4) glucanotransferase activity, removes a trisaccha- 
ride unit and transfers this group to the end of another, nearby branch (Figure 
22.12). This leaves a single glucose residue in a(1—6) linkage to the main chain. The 
a(1—6) glucosidase activity of the debranching enzyme then cleaves this residue 
from the chain, leaving a polysaccharide chain with one branch fewer. Repetition of 
this sequence of events leads to complete degradation of the polysaccharide. 

B-Amylase is an exoglycosidase that cleaves maltose units from the free, nonre- 
ducing ends of amylopectin branches, as in Figure 22.11. Like a-amylase, however, 
B-amylase does not cleave either the a(1—6) bonds at the branch points or the 
a(1—>4) linkages near the branch points. 


Metabolism of Tissue Glycogen Is Regulated 


Digestion itself is a highly efficient process in which almost 100% of ingested food 
is absorbed and metabolized. Digestive breakdown of starch is an unregulated 
process. On the other hand, tissue glycogen represents an important reservoir of 
potential energy, and it should be no surprise that the reactions involved in its 
degradation and synthesis are carefully controlled and regulated. Glycogen re- 
serves in liver and muscle tissue are stored in the cytosol as granules exhibiting a 
molecular weight range from 6 X 10° to 1600 X 10°. These granular aggregates 
contain the enzymes required to synthesize and catabolize the glycogen, as well as 
all the enzymes of glycolysis. 

The principal enzyme of glycogen catabolism is glycogen phosphorylase, a highly 
regulated enzyme that was discussed extensively in Chapter 15. The glycogen phos- 
phorylase reaction (Figure 22.13) involves phosphorolysis at a nonreducing end of 
a glycogen polymer. The standard-state free energy change for this reaction is 
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+3.1 kJ/mol, but the intracellular ratio of [P;] to [glucose-1-P] approaches 100, and 
thus the actual AG in vivo is approximately —6 kJ/mol. There is an energetic ad- 
vantage to the cell in this phosphorolysis reaction. If glycogen breakdown were hy- 
drolytic and yielded glucose as a product, it would be necessary to phosphorylate 
the product glucose (with the expenditure of a molecule of ATP) to initiate its gly- 
colytic degradation. 

The glycogen phosphorylase reaction degrades glycogen to produce limit dex- 
trins, which are further degraded by debranching enzyme, as already described. 


22:45 How Is Glycogen Synthesized? 


Animals synthesize and store glycogen when glucose levels are high, but the syn- 
thetic pathway is not merely a reversal of the glycogen phosphorylase reaction. High 
levels of phosphate in the cell favor glycogen breakdown and prevent the phos- 
phorylase reaction from synthesizing glycogen in vivo, despite the fact that AG°’ for 
the phosphorylase reaction actually favors glycogen synthesis. Hence, another reac- 
tion pathway must be employed in the cell for the net synthesis of glycogen. In 
essence, this pathway must activate glucose units for transfer to glycogen chains. 


Glucose Units Are Activated for Transfer by Formation 
of Sugar Nucleotides 


We are familiar with several examples of chemical activation as a strategy for group 
transfer reactions. Acetyl-CoA is an activated form of acetate; biotin and tetrahydro- 
folate activate one-carbon groups for transfer; and ATP is an activated form of phos- 
phate. Luis Leloir, a biochemist in Argentina, showed in the 1950s that glycogen syn- 
thesis depended upon sugar nucleotides, which may be thought of as activated forms 
of sugar units. For example, formation of an ester linkage between the C-1 hydroxyl 
group and the B-phosphate of UDP activates the glucose moiety of UDP-glucose. 
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FIGURE 22.13 The glycogen phosphorylase reaction. 





Uridine diphosphate glucose (UDP-glucose). 
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Glycogen synthase from Agrobacterium tumefaciens 
consists of two Rossman folds (see Chapter 16) sepa- 
rated by a deep cleft that includes the active site 
(shown here with bound ADP, purple) (pdb id = 1RZU). 


CENGAGENOW” ANIMATED FIGURE 22.14 The UDP- 
glucose pyrophosphorylase reaction is a phospho- 
anhydride exchange, with a phosphoryl oxygen of 
glucose-1-P attacking the a-phosphorus of UTP to form 
UDP-glucose and pyrophosphate. See this figure ani- 
mated at www.cengage.com/login. 


UDP-Glucose Synthesis Is Driven by Pyrophosphate Hydrolysis 


Sugar nucleotides are formed from sugar-l-phosphates and nucleoside triphos- 
phates by specific pyrophosphorylase enzymes (Figure 22.14). For example, UDP- 
glucose pyrophosphorylase catalyzes the formation of UDP-glucose from glucose- 
1-phosphate and uridine 5’-triphosphate: 


Glucose-1-P + UTP — > UDP-glucose + pyrophosphate 


The reaction proceeds via attack by a phosphate oxygen of glucose-1-phosphate on 
the a-phosphorus of UTP, with departure of the pyrophosphate anion. The reaction 
is a reversible one, but—as is the case for many biosynthetic reactions—it is driven 
forward by subsequent hydrolysis of pyrophosphate: 


Pyrophosphate + H:O —> 2 P; 


The net reaction for sugar nucleotide formation (combining the preceding two 
equations) is thus 


Glucose-1-P + UTP + HO —> UDP-glucose + 2 P; 


Sugar nucleotides of this type act as donors of sugar units in the biosynthesis of 
oligosaccharides and polysaccharides. In animals, UDP-glucose is the donor of glu- 
cose units for glycogen synthesis, but ADP-glucose is the glucose source for starch 
synthesis in plants. 


Glycogen Synthase Catalyzes Formation of a(1— 4) Glycosidic 
Bonds in Glycogen 


The very large glycogen polymer is built around a tiny protein core. The first glu- 
cose residue is covalently joined to the protein glycogenin (see Figure 22.11b) via 
an acetal linkage to a tyrosine—OH group on the protein. Sugar units can then be 
added by the action of glycogen synthase. The reaction involves transfer of a glu- 
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Advanced Glycation End Products—A Serious Complication of Diabetes 


Covalent linkage of sugars to proteins to form glycoproteins nor- 
mally occurs through the action of enzymes that use sugar nucleo- 
tides as substrates. However, sugars may also react nonenzymati- 
cally with proteins. The C-1 carbonyl group of glucose forms Schiff 
base linkages with lysine side chains of proteins. These Schiff base 
adducts undergo Amadori rearrangements and subsequent oxida- 
tions to form irreversible “glycation” products, including carboxy- 
methyllysine and pentosidine derivatives (see accompanying fig- 
ure). These advanced glycation end products (AGEs) can alter 


the function of the protein. Such AGE-dependent changes are 
thought to contribute to circulation, joint, and vision problems in 
people with diabetes. 

Nonenzymatic glycation of hemoglobin is a better diagnostic 
yardstick for type-2 diabetes than serum glucose levels. Red blood 
cells have an average life expectancy of about 4 months. By mea- 
suring the concentration of “glycated hemoglobin” in a patient, it 
is possible to determine the average glucose concentration in the 
blood over the past several months. 
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cosyl unit from UDP-glucose to the C-4 hydroxyl group at a nonreducing end of a 
glycogen strand. The mechanism proceeds by cleavage of the C—O bond between 
the glucose moiety and the -phosphate of UDP-glucose, leaving an oxonium ion 
intermediate, which is rapidly attacked by the C-4 hydroxyl oxygen of a terminal glu- 
cose unit on glycogen (Figure 22.15). The reaction is exergonic and has a AG?” of 
—13.3 kJ/mol. 


Glycogen Branching Occurs by Transfer of Terminal Chain Segments 


Glycogen is a branched polymer of glucose units. The branches arise from 
a(1—6) linkages, which occur every 8 to 12 residues. As noted in Chapter 7, the 
branches provide multiple sites for rapid degradation or elongation of the poly- 
mer and also increase its solubility. Glycogen branches are formed by amylo- 
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FIGURE 22.16 Formation of glycogen branches by the 
branching enzyme. Six- or seven-residue segments of 
a growing glycogen chain are transferred to the C-6 
hydroxyl group of a glucose residue on the same or a 
nearby chain. 
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CENGAGENOW” ANIMATED FIGURE 22.15 The glycogen synthase reaction. Cleavage of the C—O bond of 
UDP-glucose yields an oxonium intermediate. Attack by the hydroxyl oxygen of the terminal residue of a 
glycogen molecule completes the reaction. See this figure animated at www.cengage.com/login. 


(1,4 1,6)-transglycosylase, also known as branching enzyme. The reaction involves 
the transfer of a 6- or 7-residue segment from the nonreducing end of a linear 
chain at least 11 residues in length to the C-6 hydroxyl of a glucose residue of the 
same chain or another chain (Figure 22.16). For each branching reaction, the 
resulting polymer has gained a new terminus at which growth can occur. 


22:55 How Is Glycogen Metabolism Controlled? 
Glycogen Metabolism Is Highly Regulated 


Synthesis and degradation of glycogen must be carefully controlled so that this im- 
portant energy reservoir can properly serve the metabolic needs of the organism. Glu- 
cose is the principal metabolic fuel for the brain, and the concentration of glucose in 
circulating blood must be maintained at about 5 mM for this purpose. Glucose derived 
from glycogen breakdown is also a primary energy source for muscle contraction. Con- 
trol of glycogen metabolism is effected via reciprocal regulation of glycogen phospho- 
rylase and glycogen synthase. Thus, activation of glycogen phosphorylase is tightly 
linked to inhibition of glycogen synthase, and vice versa. Regulation involves both al- 
losteric control and covalent modification, with the latter being under hormonal con- 
trol. The regulation of glycogen phosphorylase is discussed in detail in Chapter 15. 


Glycogen Synthase Is Regulated by Covalent Modification 


Glycogen synthase also exists in two distinct forms that can be interconverted by the 
action of specific enzymes: active, dephosphorylated glycogen synthase I (glucose- 
6-P-independent) and less active, phosphorylated glycogen synthase D (glucose-6- 
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P-dependent). The phosphorylated form can be allosterically activated by glucose-6- 
phosphate, but the unphosphorylated enzyme is insensitive to this allosteric effector 
(Figure 22.17). The nature of phosphorylation is complex (Figure 22.17a). At least 
nine serine residues on the enzyme appear to be subject to phosphorylation, each 
site’s phosphorylation having some effect on enzyme activity. Four protein kinases 
are involved in phosphorylation of glycogen synthase: casein kinase, AMP-dependent 
protein kinase, protein kinase A, and glycogen synthase kinase 3 (GSK3). 

Dephosphorylation of both glycogen phosphorylase and glycogen synthase is car- 
ried out by phosphoprotein phosphatase-1 (PP1). The action of PPI inactivates 
glycogen phosphorylase and activates glycogen synthase. 
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FIGURE 22.17 (a) Binding of insulin to plasma mem- 
brane receptors in the liver and muscles triggers protein 
kinase cascades that stimulate glycogen synthesis. 
Insulin’s effects include inactivation of GSK3 and stimu- 
lation of PP1, both actions activating glycogen synthase, 
as well as recruitment of GLUT4 to the plasma mem- 
brane. Glucose uptake provides substrate for glycogen 
synthesis and glucose-6-phosphate, which allosterically 
activates the otherwise inactive form of glycogen 
synthase. (b) The metabolic effects of insulin are medi- 
ated through protein phosphorylation and second mes- 
senger modulation. 
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A DEEPER LOOK 


Carbohydrate Utilization in Exercise 


Animals have a remarkable ability to “shift gears” metabolically dur- 
ing periods of strenuous exercise or activity. Metabolic adaptations 
allow the body to draw on different sources of energy (all of which 
produce ATP) for different types of activity. During periods of 
short-term, high-intensity exercise (such as a 100-m dash), most of 
the required energy is supplied directly by existing stores of 
ATP and creatine phosphate (see figure, part a). Long-term, low- 
intensity exercise (such as a 10-km run or a 42.2-km marathon) is 
fueled almost entirely by aerobic metabolism. Between these ex- 
tremes is a variety of activities (an 800-m run, for example) that rely 
on anaerobic glycolysis—conversion of glucose to lactate in the 
muscles and utilization of the Cori cycle. 

For all these activities, breakdown of muscle glycogen provides 
much of the needed glucose. The rate of glycogen consumption 





















Light exercise 


depends on the intensity of the exercise (see figure, part b). By 
contrast, glucose derived from gluconeogenesis makes only small 
contributions to total glucose consumed during exercise. During 
prolonged mild exercise, gluconeogenesis accounts for only about 
8% of the total glucose consumed. During heavy exercise, this 
percentage becomes even lower. 

Choice of diet has a dramatic effect on glycogen recovery fol- 
lowing exhaustive exercise. A diet consisting mainly of protein 
and fat results in very little recovery of muscle glycogen, even af- 
ter 5 days (see figure, part c). On the other hand, a high-carbo- 
hydrate diet provides faster restoration of muscle glycogen. 
Even in this case, however, complete recovery of glycogen stores 
takes about 2 days. 
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À (a) Contributions of the various energy sources to muscle activity during mild exercise. (b) Con- 
sumption of glycogen stores in fast-twitch muscles during light, moderate, and heavy exercise. 

(c) Rate of glycogen replenishment following exhaustive exercise. (a and c adapted from Rhodes, R., and 
Pflanzer, R. G., 1992. Human Physiology. Philadelphia: Saunders College Publishing; b adapted from Horton, E. S., and 
Terjung, R. L., 1988. Exercise, Nutrition and Energy Metabolism. New York: Macmillan.) 
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FIGURE 22.18 The portal vein system carries pancreatic 
secretions such as insulin and glucagon to the liver and 
then into the rest of the circulatory system. 





Hormones Regulate Glycogen Synthesis and Degradation 


Storage and utilization of tissue glycogen, maintenance of blood glucose concentra- 
tion, and other aspects of carbohydrate metabolism are meticulously regulated by hor- 
mones, including insulin, glucagon, epinephrine, and the glucocorticoids. 


Insulin Is a Response to Increased Blood Glucose The primary hormone responsi- 
ble for conversion of glucose to glycogen is insulin (see Figure 5.8). Insulin is secreted 
by the G-cells in the pancreas within the islets of Langerhans. Secretion of insulin is a re- 
sponse to increased glucose in the blood. When blood glucose levels rise (after a meal, for 
example), insulin is secreted from the pancreas into the pancreatic vein, which emp- 
ties into the portal vein system (Figure 22.18), so insulin traverses the liver before it 
enters the systemic blood supply. Insulin acts to rapidly lower blood glucose concen- 
tration in several ways. Insulin stimulates glycogen synthesis and inhibits glycogen 
breakdown in liver and muscle. 


Insulin Triggers Glycogen Synthesis When Blood Glucose Rises The action of 
insulin when blood glucose rises is immediate and powerful. During periods be- 
tween meals, typical human blood glucose levels are 70 to 90 mg/dL. Glucose lev- 
els normally rise to about 150 mg/dL within the first hour following a carbohydrate- 
rich meal (Figure 22.19) and then return to normal within 2 to 3 hours. (For 
diabetic subjects, whose insulin response is impaired, glucose levels rise after a meal 
to 250 mg/dL or even higher and remain high for much longer times.) 
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HUMAN BIOCHEMISTRY 


von Gierke Disease—A Glycogen-Storage Disease 


In 1929, the physician Edgar von Gierke treated a patient with a very 
enlarged abdomen. The patient’s liver and kidneys were severely 
enlarged due to massive accumulations of glycogen, and von Gierke 
appropriately called the condition “hepato-nephromegalia glyco- 
genica.” Now termed von Gierke’s disease, or Type Ia glycogen stor- 
age disease, this condition results from the absence of glucose-6- 
phosphatase activity in the affected organs. This simple genetic 
defect causes a host of difficult complications, including a striking el- 
evation of serum triglycerides, excess adipose tissue in the cheeks, 
thin extremities, short stature, excessive curvature of the lumbar 
spine, and delay of puberty. 


rupting the recycling of glucose and causing affected individuals 
to be hypoglycemic. The accumulation of glucose-6-phosphate in 
the liver leads to greatly increased glycolytic activity, with conse- 
quent elevation of lactic acid, a condition known more commonly 
as lactic acidosis. Large amounts of uric acid and lipids are pro- 
duced, and the high rates of glycolysis produce excess NADH. 

The treatment of von Gierke’s disease consists of trying to 
maintain normal levels of glucose in the patient’s serum. This of- 
ten requires oral administration of large amounts of glucose, in 
its various forms, including, for example, uncooked cornstarch, 
which acts as a slow-release form of glucose. 


The absence of glucose-6-phosphatase activity in the liver 
blocks the last steps of glycogenolysis and gluconeogenesis, inter- 








Insulin lowers blood glucose by triggering several cascades of reactions that 
result in glucose uptake and glycogen synthesis (see Figure 22.17a). An insulin- 
triggered protein kinase cascade increases glucose transport into muscle, liver, and 
adipose tissues by stimulating exocytotic processes that translocate GLUT4, a glu- 
cose transporter, from intracellular vesicles to the plasma membrane (see Figure 
22.17a). Large amounts of glucose thus transported into the cell are converted to 
glucose-6-P, which can be directed to glycogen synthesis (by conversion to glucose- 
1-P). Also, glucose-6-P is the allosteric effector that activates the otherwise inactive, 
phosphorylated form of glycogen synthase. 

Binding of insulin to the plasma membrane, in either liver or muscle cells, trig- 
gers another protein kinase cascade (see Figure 15.17 and Chapter 32) that results 
in phosphorylation and inactivation of glycogen synthase kinase 3 (GSK3). This ki- 
nase normally phosphorylates and inactivates glycogen synthase. Inhibition of GSK3 
means that more of the cell’s glycogen synthase will remain in the unphosphory- 
lated, active state (see Figure 22.17a). Insulin also stimulates PP1, which dephos- 
phorylates (and activates) glycogen synthase. 

Several other physiological effects of insulin also serve to lower blood and tissue 
glucose levels (see Figure 22.17b). Insulin increases cellular utilization of glucose by 
inducing the synthesis of several important glycolytic enzymes, namely, glucokinase, 
phosphofructokinase, and pyruvate kinase. In addition, insulin acts to inhibit sev- 
eral enzymes of gluconeogenesis. These various actions enable the organism to re- 
spond quickly to increases in blood glucose levels. 


Glucagon and Epinephrine Stimulate Glycogen Breakdown Catabolism of tissue 
glycogen is triggered by the actions of the hormones epinephrine and glucagon (Fig- 
ure 22.20). In response to decreased blood glucose, glucagon is released from the a-cells 
in pancreatic islets of Langerhans. This peptide hormone travels through the blood 
to specific receptors on liver cell membranes. (Glucagon acts on liver and adipose 
tissue but not other tissues.) Similarly, signals from the central nervous system cause 
release of epinephrine—also known as adrenaline—from the adrenal glands into the 
bloodstream. Epinephrine acts on liver and muscles. When either hormone binds to 
its receptor on the outside surface of the cell membrane, a cascade is initiated that 
activates glycogen phosphorylase and inhibits glycogen synthase (Figure 22.20). The 
result of these actions is tightly coordinated stimulation of glycogen breakdown and inhibi- 
tion of glycogen synthesis. 


The Phosphorylase Cascade Amplifies the Hormonal Signal Stimulation of glyco- 
gen breakdown involves consumption of molecules of ATP at three different steps in 
the hormone-sensitive adenylyl cyclase cascade (see Figure 15.17). Note that the cas- 
cade mechanism is a means of chemical amplification, because the binding of just a 
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FIGURE 22.19 A glucose tolerance test involves inges- 
tion of a glucose solution followed by measurements 
of blood glucose for about 3 hours. Normal subjects 
exhibit a rise in blood glucose to about 150 mg/dL, 
followed by a decline to normal values over a 3-hour 
period. In diabetic subjects, blood glucose rises to 
higher values and remains high for longer periods. 
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FIGURE 22.20 Glucagon and epinephrine each activate a cascade of reactions that stimulate glycogen break- 
down and inhibit glycogen synthesis in liver and muscles, respectively. The effects of these hormones on other 
metabolic pathways depend on the tissue. In liver, glucagon inhibits glycolysis and stimulates gluconeogenesis, 
facilitating export of glucose into the bloodstream. In muscles, epinephrine stimulates glycolysis to provide 
energy for contraction. These effects all depend on protein phosphorylations by cAMP-dependent protein kinase. 
Note that the liver and heart isoforms of PFK-2/F-2,6-BPase respond oppositely to phosphorylation by PKA. 
Glucagon is a 29-residue peptide with the sequence H;*N-HSEGTFTSDYSKYLDSRRAQDFVQWLMNT-COO-. 


few molecules of epinephrine or glucagon results in the synthesis of many molecules 
of cyclic AMP, which, through the action of cAMP-dependent protein kinase, can ac- 
tivate many more molecules of phosphorylase kinase and even more molecules of 
phosphorylase. For example, an extracellular level of 107!° to 1078 M epinephrine 
prompts the formation of 10~° M cyclic AMP, and for each protein kinase activated by 
cyclic AMP, approximately 30 phosphorylase kinase molecules are activated; these in 
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turn activate some 800 molecules of phosphorylase. Each of these catalyzes the for- 
mation of many molecules of glucose-1-P. 


The Difference Between Epinephrine and Glucagon Although both epinephrine 
and glucagon exert glycogenolytic effects, they do so for quite different reasons. Ep- 
inephrine is secreted as a response to anger or fear and may be viewed as an alarm 
or danger signal for the organism. Called the “fight or flight” hormone, it prepares 
the organism for mobilization of large amounts of energy. Among the many physi- 
ological changes elicited by epinephrine, one is the initiation of the enzyme cas- 
cade, as in Figure 15.17, which leads to rapid breakdown of glycogen, inhibition of 
glycogen synthesis, stimulation of glycolysis, and production of energy. The burst of 
energy produced is the result of a 2000-fold amplification of the rate of glycolysis. 
Because a fear or anger response must include generation of energy (in the form 
of glucose)—both immediately in localized sites (the muscles) and eventually 
throughout the organism (as supplied by the liver)—epinephrine must be able to 
activate glycogenolysis in both liver and muscles. 

Glucagon is involved in the long-term maintenance of steady-state levels of glu- 
cose in the blood. It performs this function by stimulating the liver to release glu- 
cose from glycogen stores into the bloodstream. To further elevate glucose levels, 
glucagon also stimulates liver gluconeogenesis by activating F-2,6-BPase activity (see 
Figure 22.10). It is important to note, however, that stabilization of blood glucose 
levels is managed almost entirely by the liver. Glucagon does not activate the phos- 
phorylase cascade in muscle (muscle membranes do not contain glucagon recep- 
tors). Muscle glycogen breakdown occurs only in response to epinephrine release, 
and muscle tissue does not participate in maintenance of steady-state glucose levels 
in the blood. 

Glucagon and epinephrine both trigger glycogen breakdown and inhibit glyco- 
gen synthesis (in liver and muscles, respectively), but their other effects on meta- 
bolic pathways are adapted exquisitely to the needs of the tissues involved. The liver 
must export glucose to the bloodstream to support other tissues. Thus, in the liver, 
PFK-2 is phosphorylated and inhibited by protein kinase A (PKA), lowering [fruc- 
tose-2,6-bisphosphate], inhibiting glycolysis, and activating gluconeogenesis (see 
Figure 22.20). In muscles, the glucose provided by glycogen breakdown is used im- 
mediately and locally to provide ATP energy for contraction. Therefore, glycolysis 
should be activated in concert with glycogen breakdown in muscles. Activation of 
glycolysis is accomplished in different ways, depending on the muscle. Heart mus- 
cle PFK-2 is activated upon phosphorylation at Ser*° and Ser’ by PKA in response 
to epinephrine, thus activating glycolysis (see Figure 22.20). Skeletal muscle PFK-2, 
however, is not a substrate for PKA. Instead, skeletal muscle PFK-1 is phosphorylated 
and activated by PKA (see Figure 22.20). 


Cortisol and Glucocorticoid Effects on Glycogen Metabolism Glucocorticoids 
are a class of steroid hormones that exert distinct effects on liver, skeletal muscle, 
and adipose tissue. The effects of cortisol, a typical glucocorticoid, are best de- 
scribed as catabolic because cortisol promotes protein breakdown and decreases 
protein synthesis in skeletal muscle. In the liver, however, it stimulates gluconeo- 
genesis and increases glycogen synthesis. Cortisol-induced gluconeogenesis results 
primarily from increased conversion of amino acids into glucose (Figure 22.21). 
Specific effects of cortisol in the liver include increased expression of several genes 
encoding enzymes of the gluconeogenic pathway, activation of enzymes involved in 
amino acid metabolism, and stimulation of the urea cycle, which disposes of nitro- 
gen liberated during amino acid catabolism (see Chapter 25). 


22:61 Can Glucose Provide Electrons for Biosynthesis? 


Cells require a constant supply of NADPH for reductive reactions vital to biosynthetic 
purposes. Much of this requirement is met by a glucose-based metabolic sequence 
variously called the pentose phosphate pathway, the hexose monophosphate shunt, or 
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FIGURE 22.21 The effects of cortisol on carbohydrate 
and protein metabolism in the liver. 


the phosphogluconate pathway. In addition to providing NADPH for biosynthetic 
processes, this pathway produces ribose-5-phosphate, which is essential for nucleic acid 
synthesis. Several metabolites of the pentose phosphate pathway can also be shuttled 
into glycolysis. 


The Pentose Phosphate Pathway Operates Mainly in Liver 
and Adipose Cells 


The pentose phosphate pathway begins with glucose-6-phosphate, a six-carbon sugar, 
and produces three-, four-, five-, six-, and seven-carbon sugars (Figure 22.22). As we will 
see, two successive oxidations lead to the reduction of NADP* to NADPH and the re- 
lease of COs. Five subsequent nonoxidative steps produce a variety of carbohydrates, 
some of which may enter the glycolytic pathway. The enzymes of the pentose phosphate 
pathway are particularly abundant in the cytoplasm of liver and adipose cells. These en- 
zymes are largely absent in muscle, where glucose-6-phosphate is utilized primarily for 
energy production via glycolysis and the TCA cycle. These pentose phosphate pathway 
enzymes are located in the cytosol, which is the site of fatty acid synthesis, a pathway 
heavily dependent on NADPH for reductive reactions. 


The Pentose Phosphate Pathway Begins with Two Oxidative Steps 


1.Glucose-6-Phosphate Dehydrogenase The pentose phosphate pathway begins 
with the oxidation of glucose-6-phosphate. The products of the reaction are a cyclic 
ester (the lactone of phosphogluconic acid) and NADPH (Figure 22.23). Glucose- 
6-phosphate dehydrogenase, which catalyzes this reaction, is highly specific for 
NADP+. As the first step of a major pathway, the reaction is irreversible and highly 
regulated. Glucose-6-phosphate dehydrogenase is strongly inhibited by the product 
coenzyme, NADPH, and also by fatty acid esters of coenzyme A (which are inter- 
mediates of fatty acid biosynthesis). Inhibition due to NADPH depends upon the 
cytosolic NADP*/NADPH ratio, which in the liver is about 0.015 (compared to 
about 725 for the NAD*/NADH ratio in the cytosol). 


2.Gluconolactonase The gluconolactone produced in step 1 is hydrolytically un- 
stable and readily undergoes a spontaneous ring-opening hydrolysis, although an 
enzyme, gluconolactonase, accelerates this reaction (Figure 22.24). The linear 
product, the sugar acid 6-phospho-p-gluconate, is further oxidized in step 3. 


3. 6-Phosphogluconate Dehydrogenase The oxidative decarboxylation of 6- 
CENGAGENOW” Go to CengageNOW and click : 
Cengagelnteractive to explore the reaction mecha- phosphogluconate by 6-phosphogluconate dehydrogenase yields D-ribulose-5- 
nism for 6-phosphogluconate dehydrogenase. phosphate and another equivalent of NADPH. There are two distinct steps in this 
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reaction (Figure 22.25): The initial NADP*-dependent dehydrogenation yields a 
B-keto acid, 3-keto-6-phosphogluconate, which is very susceptible to decarboxy- 
lation (the second step). The resulting product, D-ribulose-5-P, is the substrate 
for the nonoxidative reactions composing the rest of this pathway. 


There Are Four Nonoxidative Reactions in the Pentose 
Phosphate Pathway 


This portion of the pathway begins with an isomerization and an epimerization, and 
it leads to the formation of either D-ribose-5-phosphate or D-xylulose-5-phosphate. 
These intermediates can then be converted into glycolytic intermediates or directed 
to biosynthetic processes. 


4. Phosphopentose Isomerase This enzyme interconverts ribulose-5-P and ribose- 
5-P via an enediol intermediate (Figure 22.26). The reaction (and mechanism) is quite 
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The complications of diabetes include a high propensity for cataract 
formation in later life, both in type 1 and type 2 diabetics. Hyper- 
glycemia is the suspected cause, but by what mechanism? Several 
lines of evidence point to the polyol pathway, in which glucose and 
other simple sugars are reduced in NADPH-dependent reactions. 
Glucose, for example, is reduced by aldose reductase to sorbitol (see 
accompanying figure), which accumulates in lens fiber cells, in- 
creasing the intracellular osmotic pressure and eventually rupturing 


Aldose Reductase and Diabetic Cataract Formation 


the cells. The involvement of aldose reductase in this process is 
supported by the fact that animals that have high levels of this en- 
zyme in their lenses (such as rats and dogs) are prone to develop 
diabetic cataracts, whereas mice that have low levels of lens aldose 
reductase activity are not. Moreover, aldose reductase inhibitors 
such as tolrestat and epalrestat suppress cataract formation. These 
drugs or derivatives from them may represent an effective preven- 
tive therapy against cataract formation in people with diabetes. 
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similar to the phosphoglucoisomerase reaction of glycolysis, which interconverts 
glucose-6-P and fructose-6-P. The ribose-5-P produced in this reaction is utilized in the 
biosynthesis of coenzymes (including NADH, NADPH, FAD, and By»), nucleotides, 
and nucleic acids (DNA and RNA). The net reaction for the first four steps of the pen- 
tose phosphate pathway is 


Glucose-6-P + 2 NADP* + HO —> ribose-5-P + 2 NADPH + 2 H* + COs 


5. Phosphopentose Epimerase This reaction converts ribulose-5-P to another 
ketose, namely, xylulose-5-P. This reaction also proceeds by an enediol intermediate 
but involves an inversion at C-3 (Figure 22.27). In the reaction, an acidic proton 
located a- to a carbonyl carbon is removed to generate the enediolate, but the pro- 
ton is added back to the same carbon from the opposite side. Note the distinction 
in nomenclature here. Interchange of groups on a single carbon is an epimeriza- 
tion, and interchange of groups between carbons is an isomerization. 

To this point, the pathway has generated a pool of pentose phosphates. The 
AG*' for each of the last two reactions is small, and the three pentose-5-phosphates 
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FIGURE 22.28 The transketolase reaction of step 6 in the 
pentose phosphate pathway. 


FIGURE 22.29 The transketolase reaction of step 8 in the 
pentose phosphate pathway. 


FIGURE 22.27 The phosphopentose epimerase reaction interconverts ribulose-5-P and xylulose-5-phosphate. 
The mechanism involves an enediol intermediate and occurs with inversion at C-3. 


coexist at equilibrium. The pathway has also produced two molecules of NADPH for 
each glucose-6-P converted to pentose-5-phosphate. The next three steps rearrange 
the five-carbon skeletons of the pentoses to produce three-, four-, six-, and seven- 
carbon units, which can be used for various metabolic purposes. Why should the 
cell do this? Very often, the cellular need for NADPH is considerably greater than 
the need for ribose-5-phosphate. The next three steps thus return some of the five- 
carbon units to glyceraldehyde-3-phosphate and fructose-6-phosphate, which can 
enter the glycolytic pathway. The advantage of this is that the cell has met its needs 
for NADPH and ribose-5-phosphate in a single pathway, yet at the same time it can 
return the excess carbon metabolites to glycolysis. 


6and 8.Transketolase The transketolase enzyme acts at both steps 6 and 8 of the 
pentose phosphate pathway. In both cases, the enzyme catalyzes the transfer of two- 
carbon units. In these reactions (and also in step 7, the transaldolase reaction, 
which transfers three-carbon units), the donor molecule is a ketose and the 
recipient is an aldose. In step 6, xylulose-5-phosphate transfers a two-carbon unit 
to ribose-5-phosphate to form glyceraldehyde-3-phosphate and sedoheptulose-7- 
phosphate (Figure 22.28). Step 8 involves a two-carbon transfer from xylulose- 
5-phosphate to erythrose-4-phosphate to produce another glyceraldehyde-3- 
phosphate and a fructose-6-phosphate (Figure 22.29). Three of these products 





























= O | 
Gs CHO HOCH 
| 
C=O HCOH CHO HCOH 
| | Transketolase 
aae E RIN A 
HOCH + HCOH = HCOH + HCOH 
| | ‘ 
HCOH HCOH CH,OPO;7— HCOH 
| 
CH,OPO37— CH,OPO37— CH,OPO37— 
Xylulose-5-P Ribose-5-P Glyceraldehyde-3-P Sedoheptulose-7-P 
CHOH 
T l 
ma C=O 
| i Z 
QG=0 CHO CHO HOCH 
| Transketolase 
HOCH + HCOH 4> HCOH + HCOH 
| j 
HCOH HCOH CH,OPO;7— HCOH 
CH,OPO3"— CH,OPO3"— CH OPO; 
Xylulose-5-P Erythrose-4-P Glyceraldehyde-3-P Fructose-6-P 


22.6 Can Glucose Provide Electrons for Biosynthesis? 689 


enter directly into the glycolytic pathway. (The sedoheptulose-7-phosphate is taken 
care of in step 7, as we shall see.) Transketolase is a thiamine pyrophosphate- 
dependent enzyme, and the mechanism (Figure 22.30) involves abstraction of the 
acidic thiazole proton of TPP, attack by the resulting carbanion at the carbonyl 
carbon of the ketose phosphate substrate, expulsion of the glyceraldehyde-3- 
phosphate product, and transfer of the two-carbon unit. Transketolase can process 
a variety of 2-keto sugar phosphates in a similar manner. It is specific for ketose 
substrates with the configuration shown but can accept a variety of aldose phos- 
phate substrates. 


7. Transaldolase The transaldolase functions primarily to make a useful gly- 
colytic substrate from the sedoheptulose-7-phosphate produced by the first trans- 
ketolase reaction. This reaction (Figure 22.31) is quite similar to the aldolase re- 
action of glycolysis, involving formation of a Schiff base intermediate between the 
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FIGURE 22.30 The mechanism of the TPP-dependent 
transketolase reaction. Ironically, the group transferred 
in the transketolase reaction might best be described 
as an aldol, whereas the transferred group in the 
transaldolase reaction is actually a ketol. Despite the 
irony, these names persist for historical reasons. 
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FIGURE 22.31 The transaldolase reaction. 
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sedoheptulose-7-phosphate and an active-site lysine residue (Figure 22.32). Elim- 
ination of the erythrose-4-phosphate product leaves an enamine of dihydroxy- 
acetone, which remains stable at the active site (without imine hydrolysis) until 
the other substrate comes into position. Attack of the enamine carbanion at the 
carbonyl carbon of glyceraldehyde-3-phosphate is followed by hydrolysis of the 
Schiff base (imine) to yield the product fructose-6-phosphate. 
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the second product, fructose-6-P. See this figure ani- 
mated at www.cengage.com/login. Fructose-6-P 
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Utilization of Glucose-6-P Depends on the Cell’s Need for ATP, NADPH, 
and Ribose-5-P 


It is clear that glucose-6-phosphate can be used as a substrate either for glycolysis or 
for the pentose phosphate pathway. The cell makes this choice on the basis of its rel- 
ative needs for biosynthesis and for energy from metabolism. ATP can be produced 
in abundance if glucose-6-phosphate is channeled into glycolysis. On the other hand, 
if NADPH or ribose-5-phosphate is needed, glucose-6-phosphate can be directed to 
the pentose phosphate pathway. The molecular basis for this regulatory decision de- 
pends on the enzymes that metabolize glucose-6-phosphate in glycolysis and the pen- 
tose phosphate pathway. In glycolysis, phosphoglucoisomerase converts glucose- 
6-phosphate to fructose-6-phosphate, which is utilized by phosphofructokinase (a 
highly regulated enzyme) to produce fructose-1,6-bisphosphate. In the pentose 
phosphate pathway, glucose-6-phosphate dehydrogenase (also highly regulated) 
produces 6-phosphogluconolactone from glucose-6-phosphate. Thus, the fate of 
glucose-6-phosphate is determined to a large extent by the relative activities of phos- 
phofructokinase and glucose-6-P dehydrogenase. Recall from Chapter 18 that PFK is 
inhibited when the ATP/AMP ratio increases and that it is inhibited by citrate but ac- 
tivated by fructose-2,6-bisphosphate. Thus, when the energy charge is high, glycolytic 
flux decreases. Glucose-6-P dehydrogenase, on the other hand, is inhibited by high 
levels of NADPH and also by the acyl-CoA intermediates of fatty acid biosynthesis. 
Both of these are indicators that biosynthetic demands have been satisfied. If that is 
the case, glucose-6-phosphate dehydrogenase and the pentose phosphate pathway 
are inhibited. If NADPH levels drop, the pentose phosphate pathway turns on and 
NADPH and ribose-5-phosphate are made for biosynthetic purposes. 

Even when the latter choice has been made, however, the cell must still be re- 
sponsive to the relative needs for ribose-5-phosphate and NADPH (as well as ATP). 
Depending on these relative needs, the reactions of glycolysis and the pentose phos- 
phate pathway can be combined in novel ways to emphasize the synthesis of needed 
metabolites. There are four principal possibilities. 


1. Both Ribose-5-P and NADPH Are Needed by the Cell In this case, the first four reac- 
tions of the pentose phosphate pathway predominate (Figure 22.33). NADPH is pro- 
duced by the oxidative reactions of the pathway, and ribose-5-P is the principal prod- 
uct of carbon metabolism. As stated earlier, the net reaction for these processes is 


Glucose-6-P + 2 NADP+ + HO —>ribose-5-P + CO, + 2 NADPH + 2 H+ 


2. More Ribose-5-P Than NADPH Is Needed by the Cell Synthesis of ribose-5-P can 
be accomplished without production of NADPH if the oxidative steps of the pen- 
tose phosphate pathway are bypassed. The key to this route is the withdrawal of 
fructose-6-P and glyceraldehyde-3-P, but not glucose-6-P, from glycolysis. The ac- 
tion of transketolase and transaldolase on fructose-6-P and glyceraldehyde-3-P 
produces three molecules of ribose-5-P from two molecules of fructose-6-P and 
one of glyceraldehyde-3-P. In this route, as in case 1, no carbon metabolites are re- 
turned to glycolysis. The net reaction for this route is 


5 Glucose-6-P + ATP —> 6 ribose-5-P + ADP + Ht 


3. More NADPH Than Ribose-5-P Is Needed by the Cell Large amounts of NADPH 
can be supplied for biosynthesis without concomitant production of ribose-5-P if 
ribose-5-P produced in the pentose phosphate pathway is recycled to produce 





FIGURE 22.33 When biosynthetic demands dictate, the first four reactions of the pentose phosphate pathway 
predominate and the principal products are ribose-5-P and NADPH. 
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glycolytic intermediates. This alternative involves a complex interplay between the 
transketolase and transaldolase reactions to convert ribulose-5-P to fructose-6-P and 
glyceraldehyde-3-P, which can be recycled to glucose-6-P via gluconeogenesis. The 


net reaction for this process is 


6 Glucose-6-P + 12 NADP+ + 6 HO —> 
6 ribulose-5-P + 6 CO, + 12 NADPH + 12 H* 
6 Ribulose-5-P —> 5 glucose-6-P + P; 





Net: Glucose-6-P + 12 NADP* + 6 HO —> 6 CO, + 12 NADPH + 12 H* + P; 


Note that in this scheme, the six hexose sugars have been converted to six pen- 
tose sugars with release of six molecules of COs, and the six pentoses are recon- 


verted to five glucose molecules. 


4. Both NADPH and ATP Are Needed by the Cell, but Ribose-5-P Is Not Under 
some conditions, both NADPH and ATP must be provided in the cell. This can 
be accomplished in a series of reactions similar to case 3 if the fructose-6-P and 
glyceraldehyde-3-P produced in this way proceed through glycolysis to produce ATP 
and pyruvate, which itself can yield even more ATP by continuing on to the TCA 
cycle. The net reaction for this alternative is 


3 Glucose-6-P + 5 NAD* + 6 NADP* + 8 ADP + 5 P; —> 
5 pyruvate + 3 CO + 5 NADH + 6 NADPH + 8 ATP + 2 HO + 8 H* 


Note that, except for the three molecules of COs, all the other carbon from glucose- 


6-P is recovered in pyruvate. 


Xylulose-5-Phosphate Is a Metabolic Regulator 


In addition to its role in the pentose phosphate pathway, xylulose-5-phosphate 
serves as a signaling molecule. When blood [glucose] rises (for example, follow- 
ing a carbohydrate-rich meal), glycolysis and the pentose phosphate pathway are 
activated in the liver, and xylulose-5-phosphate produced in the latter pathway 
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FIGURE 22.34 Xylulose-5-phosphate is a regulator of multiple metabolic pathways. Activation of PP2A triggers 
dephosphorylation of the bifunctional PFK-2/F2,6-BPase, which raises fructose-2,6-BP levels, in turn activating 
glycolysis and inhibiting gluconeogenesis. Simultaneously, ChREBP is dephosphorylated, leading to elevated 
expression of genes encoding enzymes for lipogenesis. These effects combine to trigger lipid biosynthesis in 


response to a carbohydrate-rich meal. 
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stimulates protein phosphatase 2A (PP2A), which dephosphorylates the bifunc- 
tional enzyme PFK-2/F-2,6-BPase (Figure 22.34, and see Figure 22.10). Increases 
in [fructose-2,6-bisphosphate] stimulate glycolysis and inhibit gluconeogenesis. At 
the same time, PP2A dephosphorylates carbohydrate-responsive element-binding 
protein (ChREBP), a transcription factor that activates expression of liver genes for 
lipid synthesis. These effects are a powerful combination. Increased glycolysis pro- 
duces substantial amounts of acetyl-CoA, the principal substrate for lipid synthesis. 
The pentose phosphate pathway produces NADPH, the source of electrons for lipid 
biosynthesis. Elevated expression of the appropriate genes sets the stage for lipid 
biosynthesis in the liver, an important consequence of ingestion of carbohydrates. 


SUMMARY 


22.1 What Is Gluconeogenesis, and How Does It Operate? Gluconeo- 
genesis is the generation (genesis) of new (neo) glucose. In addition to 
pyruvate and lactate, other noncarbohydrate precursors can be used as 
substrates for gluconeogenesis in animals, including most of the amino 
acids, as well as glycerol and all the TCA cycle intermediates. On the 
other hand, fatty acids are not substrates for gluconeogenesis in ani- 
mals. Lysine and leucine are the only amino acids that are not substrates 
for gluconeogenesis. These amino acids produce only acetyl-CoA upon 
degradation. Acetyl-CoA can be a substrate for gluconeogenesis in 
plants when the glyoxylate cycle is operating. The major sites of gluco- 
neogenesis are the liver and kidneys, which account for about 90% and 
10% of the body’s gluconeogenic activity, respectively. 


22.2 How Is Gluconeogenesis Regulated? Glycolysis and gluconeo- 
genesis are under reciprocal control, so glycolysis is inhibited when glu- 
coneogenesis is active, and vice versa. When the energy status of the 
cell is low, glucose is rapidly degraded to produce needed energy. 
When the energy status is high, pyruvate and other metabolites are uti- 
lized for synthesis (and storage) of glucose. The three sites of regula- 
tion in the gluconeogenic pathway are glucose-6-phosphatase, fructose- 
1,6-bisphosphatase, and the pyruvate carboxylase-PEP carboxykinase 
pair, respectively. These are the three most appropriate sites of regula- 
tion in gluconeogenesis. Glucose-6-phosphatase is under substrate- 
level control by glucose-6-phosphate. Acetyl-CoA allosterically activates 
pyruvate carboxylase. Fructose-1,6-bisphosphatase is inhibited by AMP 
and activated by citrate. Fructose-2,6-bisphosphate is a powerful in- 
hibitor of fructose-1,6-bisphosphatase. 


22.3 How Are Glycogen and Starch Catabolized in Animals? Virtually 
100% of digestible food is absorbed and metabolized. Digestive break- 
down of starch is an unregulated process. On the other hand, tissue 
glycogen represents an important reservoir of potential energy, and the 
reactions involved in its degradation and synthesis are carefully con- 
trolled and regulated. Glycogen reserves in liver and muscle tissue are 
stored in the cytosol as granules exhibiting a molecular weight range 
from 6 X 10° to 1600 X 10°. These granular aggregates contain the en- 


zymes required to synthesize and catabolize the glycogen, as well as all 
the enzymes of glycolysis. The principal enzyme of glycogen catabolism 
is glycogen phosphorylase, a highly regulated enzyme. The glycogen 
phosphorylase reaction involves phosphorolysis at a nonreducing end 
of a glycogen polymer. 


22.4 How Is Glycogen Synthesized? Luis Leloir, a biochemist in Ar- 
gentina, showed in the 1950s that glycogen synthesis depended upon 
sugar nucleotides. The glycogen polymer is built around a tiny protein 
core. The first glucose residue is covalently joined to the protein glyco- 
genin via an acetal linkage to a tyrosine-OH group on the protein. 
Sugar units are added to the glycogen polymer by the action of glyco- 
gen synthase. The reaction involves transfer of a glucosyl unit from 
UDP-glucose to the C-4 hydroxyl group at a nonreducing end of a 
glycogen strand. The mechanism proceeds by cleavage of the C—O 
bond between the glucose moiety and the -phosphate of UDP-glucose, 
leaving an oxonium ion intermediate, which is rapidly attacked by the 
C-4 hydroxyl oxygen of a terminal glucose unit on glycogen. 


22.5 How Is Glycogen Metabolism Controlled? Activation of glycogen 
phosphorylase is tightly linked to inhibition of glycogen synthase, and 
vice versa. Regulation involves both allosteric control and covalent mod- 
ification, with the latter being under hormonal control. Glycogen syn- 
thase is also regulated by covalent modification. Storage and utilization 
of tissue glycogen are regulated by hormones, including insulin, 
glucagon, epinephrine, and the glucocorticoids. Insulin stimulates glyco- 
gen synthesis and inhibits glycogen breakdown in liver and muscle, 
whereas glucagon and epinephrine stimulate glycogen breakdown. 


22.6 Can Glucose Provide Electrons for Biosynthesis? The pentose 
phosphate pathway is a collection of eight reactions that provide 
NADPH for biosynthetic processes and ribose-5-phosphate for nucleic 
acid synthesis. Several metabolites of the pentose phosphate pathway 
can also be shuttled into glycolysis. Utilization of glucose-6-P in the pen- 
tose phosphate pathway depends on the cell’s need for ATP, NADPH, 
and ribose-5-P. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 


1. Consider the balanced equation for gluconeogenesis in Section 22.1. 
Account for each of the components of this equation and the indi- 
cated stoichiometry. 

2. (Integrates with Chapters 3 and 18.) Calculate AG°’ and AG for 
gluconeogenesis in the erythrocyte, using data in Table 18.2 (as- 
sume NAD*/NADH = 20, [GTP] = [ATP], and [GDP] = [ADP]). 
See how closely your values match those in Section 22.1. 


3. Use the data of Figure 22.9 to calculate the percent inhibition of 
fructose-1,6-bisphosphatase by 25 mM fructose-2,6-bisphosphate 
when fructose-1,6-bisphosphate is (a) 25 mM and (b) 100 mM. 

4. (Integrates with Chapter 3.) Suggest an explanation for the exergonic 
nature of the glycogen synthase reaction (AG°’ = —13.3 kJ/mol). 
Consult Chapter 3 to review the energetics of high-energy phosphate 
compounds if necessary. 

5. Using the values in Table 23.1 for body glycogen content and the 
data in part b of the illustration for A Deeper Look (page 680), 
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calculate the rate of energy consumption by muscles in heavy exer- c. Increasing the concentration of blood insulin 
cise (in J/sec). Use the data for fast-twitch muscle. d. Increasing the amount of blood glucagon 

6. Which reactions of the pentose phosphate pathway would be e. Decreasing levels of tissue ATP 
inhibited by NaBH,? Why? f. Increasing the concentration of tissue AMP 


7. (Integrates with Chapter 7.) Imagine a glycogen molecule with g- Decreasing the concentration of fructose-6-phosphate 


8000 glucose residues. If branches occur every eight residues, how 9. 
many reducing ends does the molecule have? If branches occur 

every 12 residues, how many reducing ends does it have? How many 
nonreducing ends does it have in each of these cases? 


(Integrates with Chapters 3 and 15.) The free energy change of the 
glycogen phosphorylase reaction is AG? = +3.1 kJ/mol. If [P;] = 
1 mM, what is the concentration of glucose-1-P when this reaction 
is at equilibrium? 

10. Based on the mechanism for pyruvate carboxylase (Figure 22.3), 


8. Explain the effects of each of the following on the rates of gluco- 
write reasonable mechanisms for the reactions that follow: 


neogenesis and glycogen metabolism: 
a. Increasing the concentration of tissue fructose-1,6-bisphosphate 
b. Increasing the concentration of blood glucose 
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11. The mechanistic chemistry of the acetolactate synthase and phos- 
phoketolase reactions (shown here) is similar to that of the trans- 
ketolase reaction (Figure 22.30). Write suitable mechanisms for 
these reactions. 





























O O O OH 
| Acetolactate | | 
CH;—C—COO™ + CH;—C—COO- > CH3;—C—C—COO™ + COs 
synthase | 
CH; 
C=0 T 
HOCH O HCOH 
Phosphoketolase | | 
HCOH + HOPO,?~ = = > HC C OPO; + Didi 
9 2- 
HCOH H,O CHOPO; 
CH ,OPO;7— 


Fructose-6-P Acetyl-P Erythrose-4-P 


12. Metaglip is a prescribed preparation (from Bristol-Myers Squibb) for 
treatment of type 2 diabetes. It consists of metformin (see Human 
Biochemistry, page 668) together with glipizide. The actions of met- 
formin and glipizide are said to be complementary. Suggest a mech- 
anism for the action of glipizide. 

13. Study the structures of tolrestat and epalrestat in the Human Bio- 
chemistry box on page 687 and suggest a mechanism of action for 
these inhibitors of aldose reductase. 

14. Based on the discussion on page 691, draw a diagram to show how 
several steps in the pentose phosphate pathway can be bypassed to 
produce large amounts of ribose-5-phosphate. Begin your diagram 
with fructose-6-phosphate. 

15. Consider the diagram you constructed in problem 14. Which car- 
bon atoms in ribose-5-phosphate are derived from carbon atoms in 
positions 1, 3, and 6 of fructose-6-phosphate? 

16. As described on pages 691 and 692, the pentose phosphate pathway 
may be used to produce large amounts of NADPH without signifi- 
cant net production of ribose-5-phosphate. Draw a diagram, 
beginning with glucose-6-phosphate, to show how this may be ac- 
complished. 
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17. The discussion on page 692 explains that the pentose phosphate 
pathway and the glycolytic pathway can be combined to provide 
both NADPH and ATP (as well as some NADH) without net ribose- 
5-phosphate synthesis. Draw a diagram to show how this may be ac- 
complished. 

18. Consider the pathway diagram you constructed in problem 17. 
What is the fate of carbon from positions 2 and 4 of glucose-6- 
phosphate after one pass through the pathway? 

19. Glycogenin catalyzes the first reaction in the synthesis of a glycogen 
particle, with Tyr! of glycogenin (page 676) combining with a glu- 
cose unit (provided by UDP-glucose) to produce a tyrosyl glucose. 
Write a mechanism to show how this reaction could occur. 


Preparing for the MCAT Exam 

20. Study the graphs in the Deeper Look box (page 680) and explain 
the timing of the provision of energy from different metabolic 
sources during periods of heavy exercise. 

21. (Integrates with Chapters 3 and 14.) What is the structure of crea- 
tine phosphate? Write reactions to indicate how it stores and pro- 
vides energy for exercise. 
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3 Fatty Acid Catabolism 


ESSENTIAL QUESTIONS 


Fatty acids represent the principal form of stored energy for many organisms. There 
are two important advantages to storing energy in the form of fatty acids. (1) The car- 
bon in fatty acids (mostly —CH,— groups) is almost completely reduced compared 
to the carbon in other simple biomolecules (sugars, amino acids). Therefore, oxidation 
of fatty acids will yield more energy (in the form of ATP) than any other form of 
carbon. (2) Fatty acids are not generally as hydrated as monosaccharides and polysac- 
charides are, and thus they can pack more closely in storage tissues. 


How are fatty acids catabolized, and how is their inherent energy captured by 
organisms? 


230s How Are Fats Mobilized from Dietary Intake and 
Adipose Tissue? 


Modern Diets Are Often High in Fat 


Fatty acids are acquired readily in the diet and can also be made from carbohydrates 
and the carbon skeletons of amino acids. Fatty acids provide 30% to 60% of the calo- 
ries in the diets of most Americans. For our caveman and cavewoman ancestors, the 
figure was probably closer to 20%. Dairy products were apparently not part of their 
diet, and the meat they consumed (from fast-moving animals) was low in fat. In con- 
trast, modern domesticated cows and pigs are actually bred for high fat content (and 
better taste). However, woolly mammoth burgers and saber-toothed tiger steaks are 
hard to find these days—even in the gourmet sections of grocery stores—and so, by 
default, we consume (and metabolize) large quantities of fatty acids. 


Triacylglycerols Are a Major Form of Stored Energy in Animals 


Although some of the fat in our diets is in the form of phospholipids, triacylglyc- 
erols are a major source of fatty acids. Triacylglycerols are also our principal stored 
energy reserve. As shown in Table 23.1, the energy available in stores of fat in the 
average person far exceeds the energy available from protein, glycogen, and glu- 
cose. Overall, fat accounts for approximately 83% of available energy, partly because 
more fat is stored than protein and carbohydrate and partly because of the sub- 
stantially higher energy yield per gram for fat compared with protein and carbohy- 
drate. Complete combustion of fat yields about 37 kJ/g, compared with about 16 to 
17 kJ/g for sugars, glycogen, and amino acids. In animals, fat is stored mainly as tri- 
acylglycerols in specialized cells called adipocytes or adipose cells. As shown in Fig- 
ure 23.1, triacylglycerols, aggregated to form large globules, occupy most of the vol- 
ume of adipose cells. Much smaller amounts of triacylglycerols are stored as small, 
aggregated globules in muscle tissue. 


Hormones Trigger the Release of Fatty Acids from Adipose Tissue 


The pathways for liberation of fatty acids from triacylglycerols, either from adipose 
cells or from the diet, are shown in Figures 23.2 and 23.3. Fatty acids are mobilized 
from adipocytes in response to hormone messengers such as adrenaline, glucagon, 
and adrenocorticotropic hormone (ACTH). These signal molecules bind to re- 
ceptors on the plasma membrane of adipose cells and lead to the activation of 
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Energy Dry Weight Available Energy 

Constituent (kJ/g dry weight) (g) (kJ) 

Fat (adipose tissue) 37 15,000 555,000 
Protein (muscle) 17 6,000 102,000 
Glycogen (muscle) 16 120 1,920 
Glycogen (liver) 16 70 1,120 
Glucose (extracellular fluid) 16 20 320 
Total 660,360 











Sources: Owen, O. E., and Reichard, G. A., Jr., 1971. Fuels consumed by man: The interplay between carbohydrates and fatty 
acids. Progress in Biochemistry and Pharmacology 6:177; and Newsholme, E. A., and Leech, A. R., 1983. Biochemistry for the 
Medical Sciences. New York: Wiley. 
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FIGURE 23.2 Liberation of fatty acids from triacylglyc- 
erols in adipose tissue is hormone-dependent. 
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FIGURE 23.3 (a) The pancreatic duct secretes digestive fluids into the duodenum, the first portion of the small 
intestine. (b) Hydrolysis of triacylglycerols by pancreatic and intestinal lipases. Pancreatic lipases cleave fatty 
acids at the C-1 and C-3 positions. Resulting monoacylglycerols with fatty acids at C-2 are hydrolyzed by intesti- 
nal lipases. Fatty acids and monoacylglycerols are absorbed through the intestinal wall and assembled into 
lipoprotein aggregates termed chylomicrons (discussed in Chapter 24). 
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FIGURE 23.4 In the small intestine, fatty acids combine 
with bile salts in mixed micelles, which deliver fatty 
acids to epithelial cells that cover the intestinal villi. Tri- 
acylglycerols are formed within the epithelial cells. 


adenylyl cyclase, which forms cyclic AMP from ATP. (Second messengers and hor- 
monal signaling are discussed in Chapter 32.) In adipose cells, cAMP activates pro- 
tein kinase A, which phosphorylates and activates a triacylglycerol lipase (also 
termed hormone-sensitive lipase) that hydrolyzes a fatty acid from C-1 or C-3 of 
triacylglycerols. Subsequent actions of diacylglycerol lipase and monoacylglycerol 
lipase yield fatty acids and glycerol. The cell then releases the fatty acids into 
the blood, where they are bound to serum albumin (the most abundant protein in 
blood serum). Serum albumin transports free fatty acids to sites of utilization. 


Degradation of Dietary Fatty Acids Occurs Primarily in the Duodenum 


Dietary triacylglycerols are degraded to a small extent (via fatty acid release) by 
lipases in the low-pH environment of the stomach, but mostly they pass untouched 
into the duodenum. Alkaline pancreatic juice secreted into the duodenum (Figure 
23.3a) raises the pH of the digestive mixture, allowing hydrolysis of the triacylglyc- 
erols by pancreatic lipase and by nonspecific esterases, which hydrolyze the fatty acid 
ester linkages. Pancreatic lipase cleaves fatty acids from the C-1 and C-3 positions of 
triacylglycerols, and other lipases and esterases attack the C-2 position (Figure 
23.3b). These processes depend upon the presence of bile salts, a family of car- 
boxylic acid salts with steroid backbones (see also Chapter 24). These agents act as 
detergents to emulsify the triacylglycerols and facilitate the hydrolytic activity of the 
lipases and esterases. Short-chain fatty acids (ten or fewer carbons) released in this 
way are absorbed directly into the villi of the intestinal mucosa, whereas long-chain 
fatty acids, which are less soluble, form mixed micelles with bile salts and are carried 
in this fashion to the surfaces of the epithelial cells that cover the villi (Figure 23.4). 
The fatty acids pass into the epithelial cells, where they are condensed with glycerol 
to form new triacylglycerols. These triacylglycerols aggregate with lipoproteins to 
form particles called chylomicrons, which are then transported into the lymphatic 
system and on to the bloodstream, where they circulate to the liver, lungs, heart, 
muscles, and other organs (see Chapter 24). At these sites, the triacylglycerols are 
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Each of the villi of the small intestine consists of a 
layer of epithelial cells and a core of capillaries and 
connective tissue. The outside (apical) face of each 
epithelial cell is covered by smaller projections called 
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and into the epithelial cells. There, the fatty acids 
combine with glycerol to form new triacylglycerols, 
which aggregate with lipoproteins to form 
chylomicrons. Taken up by the lymphatic system, 
chylomicrons circulate to tissues throughout the body. 


23.2 


hydrolyzed to release fatty acids, which can then be oxidized in a highly exergonic 
metabolic pathway known as B-oxidation. 


232) How Are Fatty Acids Broken Down? 


Knoop Elucidated the Essential Feature of (3-Oxidation 


The earliest clue to the secret of fatty acid oxidation and breakdown came in the 
early 1900s, when Franz Knoop carried out experiments in which he fed modified 
fatty acids to dogs. Knoop’s experiments showed that fatty acids must be degraded 
by oxidation at the B-carbon (Figure 23.5), followed by cleavage of the C,—Cg bond. 
Repetition of this process yielded two-carbon units, which Knoop assumed must be 
acetate. Much later, Albert Lehninger showed that this degradative process took 
place in the mitochondria, and F. Lynen and E. Reichart showed that the two- 
carbon unit released is acetyl-CoA, not free acetate. Because the entire process be- 
gins with oxidation of the carbon that is “6” to the carboxyl carbon, the process has 
come to be known as f-oxidation. 

In mammalian cells, -oxidation take place primarily in mitochondria, but a simi- 
lar pathway occurs in peroxisomes. In yeast and other lower eukaryotes, -oxidation 
is confined exclusively to peroxisomes. Mitochondrial -oxidation provides energy 
to the organism (Figure 23.6), whereas peroxisomal 6-oxidation is responsible for 
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FIGURE 23.6 The acyl-CoA synthetase reaction activates fatty acids for B-oxidation. The reaction is driven by 
hydrolysis of ATP to AMP and pyrophosphate and by the subsequent hydrolysis of pyrophosphate. 
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shortening long-chain fatty acids that are poor substrates for mitochondrial 
B-oxidation. Such shortened fatty acids then become substrates for mitochondrial 
B-oxidation. 


Coenzyme A Activates Fatty Acids for Degradation 


The process of 6-oxidation begins with the formation of a thiol ester bond between 
the fatty acid and the thiol group of coenzyme A. This reaction, shown in Figure 
23.6, is catalyzed by acyl-CoA synthetase, which is also called acyl-CoA ligase or fatty 
acid thiokinase. This condensation with CoA activates the fatty acid for reaction in 
the B-oxidation pathway. For long-chain fatty acids, this reaction normally occurs at 
the outer mitochondrial membrane in higher eukaryotes before entry of the fatty 
acid into the mitochondrion, but it may also occur at the surface of the endoplas- 
mic reticulum. Short- and medium-length fatty acids undergo this activating reac- 
tion in the mitochondria. In all cases, the reaction is accompanied by the hydroly- 
sis of ATP to form AMP and pyrophosphate. As shown in Figure 23.6, the overall 
reaction has a net AG°’ of about —0.8 kJ/mol, so the reaction is favorable but eas- 
ily reversible. However, there is more to the story. As we have seen in several similar 
cases, the pyrophosphate produced in this reaction is rapidly hydrolyzed by inor- 
ganic pyrophosphatase to two molecules of phosphate, with a net AG” of about 
—33.6 kJ/mol. Thus, pyrophosphate is maintained at a low concentration in the cell 
(usually less than 10 uM), and the synthetase reaction is strongly promoted. The 
mechanism of the acyl-CoA synthetase reaction is shown in Figure 23.7 and involves 
attack of the fatty acid carboxylate on ATP to form an acyladenylate intermediate, which 
is subsequently attacked by CoA, forming a fatty acyl-CoA thioester. 


Carnitine Carries Fatty Acyl Groups Across the Inner 
Mitochondrial Membrane 


All of the other enzymes of the -oxidation pathway are located in the mitochon- 
drial matrix. Short-chain fatty acids, as already mentioned, are transported into the 
matrix as free acids and form the acyl-CoA derivatives there. However, long-chain 
fatty acyl-CoA derivatives cannot be transported into the matrix directly. These long- 
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CENGAGENOW” ANIMATED FIGURE 23.7 The mechanism of the acyl-CoA synthetase reaction involves fatty 
acid carboxylate attack on ATP to form an acyl-adenylate intermediate. The fatty acyl CoA thioester product is 
formed by CoA attack on this intermediate. See this figure animated at www.cengage.com/login. 


chain derivatives must first be converted to acylcarnitine derivatives, as shown in 
Figure 23.8. Carnitine acyltransferase I, associated with the outer mitochondrial 
membrane, catalyzes the formation of the O-acylcarnitine, which is then trans- 
ported across the inner membrane by a translocase. At this point, the acylcarni- 
tine is passed to carnitine acyltransferase II on the matrix side of the inner mem- 
brane, which transfers the fatty acyl group back to CoA to re-form the fatty 
acyl-CoA, leaving free carnitine, which can return across the membrane via the 
translocase. 

Several additional points should be made. First, although oxygen esters usually 
have lower group-transfer potentials than thiol esters, the O—acyl bonds in acyl- 
carnitines have high group-transfer potentials, and the transesterification reactions 
mediated by the acyltransferases have equilibrium constants close to 1. Second, 
note that eukaryotic cells maintain separate pools of CoA in the mitochondria and 
in the cytosol. The cytosolic pool is utilized principally in fatty acid biosynthesis (see 
Chapter 24), and the mitochondrial pool is important in the oxidation of fatty acids 
and pyruvate, as well as some amino acids. 
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B-Oxidation Involves a Repeated Sequence of Four Reactions 


For saturated fatty acids, the process of B-oxidation involves a recurring cycle of 
four steps, as shown in Figure 23.9. The overall strategy in the first three steps is to 
create a carbonyl group on the 6-carbon by oxidizing the C,—Cg, bond to form an 
olefin, with subsequent hydration and oxidation. In essence, this cycle is directly 
analogous to the sequence of reactions converting succinate to oxaloacetate in the 
TCA cycle. The fourth reaction of the cycle cleaves the B-keto ester in a reverse 
Claisen condensation, producing an acetate unit and leaving a fatty acid chain that 
is two carbons shorter than it began. (Recall from Chapter 19 that Claisen con- 
densations involve attack by a nucleophilic agent on a carbonyl carbon to yield a 
-keto acid.) 


A Family of Acyl-CoA Dehydrogenases Carry Out the First Reaction of -Oxidation 
The enzymes of mitochondrial B-oxidation are organized in two functional systems: 
a membrane-bound complex that is specific for long-chain fatty acids (14 carbons 
and longer) and a family of soluble enzymes in the matrix that is specific for 
short- and medium-chain fatty acids (Figure 23.10). As a fatty acyl chain is shortened 
in successive cycles of 6-oxidation, it moves from the membrane-bound complex to 
the family of soluble matrix enzymes. The first reaction of the -oxidation cycle is 
catalyzed by one of four acyl-CoA dehydrogenases. These include the very long- 
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FIGURE 23.9 The B-oxidation of saturated fatty acids involves a cycle of four enzyme-catalyzed reactions. Each 
cycle produces single molecules of FADH2, NADH, and acetyl-CoA, consumes a water, and yields a fatty acid 
shortened by two carbons. (The delta [A] symbol connotes a double bond, and its superscript indicates the 
lower-numbered carbon involved.) 





FIGURE 23.10 Very long-chain fatty acids proceed through several cycles of B-oxidation (left) via membrane- 
bound enzymes in mitochondria. A membrane-bound multifunctional complex includes the enoyl-CoA hydra- 
tase (ECH), hydroxyacyl-CoA dehydrogenase (HAD), and ketoacyl thiolase (KT) activities. As chains shorten pro- 
gressively, they become substrates for the separate, soluble enzymes of B-oxidation (right). 


chain acyl-CoA dehydrogenase (VLCAD), as well as acyl-CoA dehydrogenases spe- 
cific for long-chain (LCAD), medium-chain (MCAD), and short-chain (SCAD) sub- 
strates. VLCAD is a membrane-bound homodimer of 67-kD subunits (Figure 23.11), 
whereas the soluble LCAD, MCAD, and SCAD are homotetramers of 40- to 45-kD 
subunits. 

All acyl-CoA dehydrogenases carry noncovalently (but tightly) bound FAD, which 
is reduced during the oxidation of the fatty acid. As shown in Figure 23.12, FADH» 
transfers its electrons to an electron transfer flavoprotein (ETF). Reduced ETF is re- 
oxidized by a specific oxidoreductase (an iron-sulfur protein), which in turn sends 
the electrons on to the electron-transport chain at the level of coenzyme Q. Recall 
from Chapter 20 that mitochondrial oxidation of FAD in this way eventually results 
in the net formation of about 1.5 ATPs. The mechanism of the acyl-CoA dehydroge- 
nase (Figure 23.13) involves deprotonation of the fatty acid chain at the a-carbon, fol- 
lowed by hydride transfer from the 6-carbon to FAD. 


(b) 





FIGURE 23.11 (a) The VLCAD of mammalian mitochondria is a 67-kD membrane-bound homodimer with 
bound FAD (red) and myristoyl-CoA (blue) (pdb id = 3B96). The tertiary structure of the N-terminal 400 
residues of VLCAD is similar to that of (b) the soluble MCAD (pdb id = 3MDE), shown with bound FAD (red) 
and octanoyl-CoA (blue). These similar structures each include an N-terminal a-helical domain (yellow), fol- 
lowed by a B-sheet domain (blue) and another a-helical domain (green). The acyl-CoA substrate lies in a long 
cleft between these three domains. The VLCAD also has a C-terminal a-helical domain (purple). 
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FIGURE 23.14 Structures of mitochondrial (a) enoyl- 
CoA hydratase trimer (pdb id = 2DUB), (b) hydroxyacyl- 
CoA dehydrogenase dimer (pdb id = 1FOY), and 

(c) ketoacyl thiolase dimer (pdb id = 1AFW). 
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FIGURE 23.12 The acyl-CoA dehydrogenase reaction. The two electrons removed in this oxidation reaction are 
delivered to the electron-transport chain in the form of reduced coenzyme Q (UQH)). 











FAD 
H H O H H O 
| | | | | | 
R—CH, ¢ ¢ C — SCoA —> R— CH, C C— C— SCoA 
H D H 
+-@ | 
R—CH, i 
C=c 
ee 
7 i 
SCoA 


FIGURE 23.13 The mechanism of acyl-CoA dehydrogenase. Removal of a proton from the a-C is followed by 
hydride transfer from the B-carbon to FAD. 


Enoyl-CoA Hydratase Adds Water Across the Double Bond The next step in 
-oxidation is the addition of the elements of HO across the new double bond in a 
stereospecific manner, yielding the corresponding hydroxyacyl-CoA. 
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The reaction is catalyzed by enoyl-CoA hydratase (Figure 23.14). A number of dif- 
ferent enoyl-CoA hydratase activities have been detected in various tissues. Also 
called crotonases, these enzymes specifically convert transenoyl-CoA derivatives to 
L-B-hydroxyacyl-CoA. Enoyl-CoA hydratases will also metabolize cis-enoyl-CoA (at 
slower rates) to give specifically D-G-hydroxyacyl-CoA. In addition, there is a novel 
enoyl-CoA hydratase that converts trans-enoyl-CoA to D-B-hydroxyacyl-CoA. 


L-Hydroxyacyl-CoA Dehydrogenase Oxidizes the B-Hydroxyl Group The third 
reaction of this cycle is the oxidation of the hydroxyl group at the -position to 
produce a B-ketoacyl-CoA derivative. This second oxidation reaction is catalyzed 
by L-hydroxyacyl-CoA dehydrogenase, an enzyme that requires NAD* as a coen- 
zyme (see Figure 23.9). 
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NADH produced in this reaction represents metabolic energy. Each NADH pro- 
duced in mitochondria by this reaction drives the synthesis of 2.5 molecules of ATP 
in the electron-transport pathway. L-Hydroxyacyl-CoA dehydrogenase shows ab- 
solute specificity for the L-hydroxyacyl isomer of the substrate. (D-Hydroxyacyl 
isomers, which arise mainly from the metabolism of unsaturated fatty acids, are 
handled differently.) 


B-Ketoacyl-CoA Intermediates Are Cleaved in the Thiolase Reaction The final 
step in the -oxidation cycle is the cleavage of the B-ketoacyl-CoA. This reaction, 
catalyzed by ketoacyl thiolase (also known as B-ketothiolase, Figure 23.9), involves 
the attack of a cysteine thiolate from the enzyme on the -carbonyl carbon, fol- 
lowed by cleavage to give the enolate of acetyl-CoA and an enzyme-thioester 
intermediate (Figure 23.15). Subsequent attack by the thiol group of a second CoA 
and departure of the cysteine thiolate yields a new (shorter) acyl-CoA. If the 
reaction in Figure 23.15 is read in reverse, it is easy to see that it is a Claisen 
condensation—an attack of the enolate anion of acetyl-CoA on a thioester. Despite 
the formation of a second thioester, this reaction has a very favorable Keq, and it 
drives the three previous reactions of 6-oxidation. 


Repetition of the -Oxidation Cycle Yields a Succession 
of Acetate Units 


In essence, this series of four reactions has yielded a fatty acid (as a CoA ester) that 
has been shortened by two carbons and one molecule of acetyl-CoA. The shortened 
fatty acyl-CoA can now go through another 6-oxidation cycle, as shown in Figure 23.9. 
Repetition of this cycle with a fatty acid with an even number of carbons eventually 
yields two molecules of acetyl-CoA in the final step. Complete -oxidation of palmitic 
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FIGURE 23.15 The mechanism of the thiolase reaction. Attack by an enzyme cysteine thiolate group at the 
ßB-carbonyl carbon produces a tetrahedral intermediate, which decomposes with departure of acetyl-CoA, 
leaving an enzyme thioester intermediate. Attack by the thiol group of a second CoA yields a new (shortened) 
acyl-CoA. 
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HUMAN BIOCHEMISTRY 


Exercise Can Reverse the Consequences of Metabolic Syndrome 


Metabolic syndrome is a combination of disorders that increase the 
risk of diabetes and cardiovascular disease. The hallmarks of meta- 
bolic syndrome include high blood pressure, elevated serum 
triglycerides, reduced serum high-density lipoprotein (HDL) cho- 
lesterol, insulin resistance, and obesity. The prevalence of these 
conditions is increasing in the United States. By most estimates, 
more than 30% of Americans are obese, and rising obesity has con- 
tributed to an epidemic of type 2 diabetes. Insights into how the 
body deals with high fat and high sugar diets are emerging from a 
variety of studies, and evidence points clearly to the benefits of ex- 
ercise and dietary restriction. Endurance training (such as distance 
running) and resistance training (with weights) are both benefi- 
cial. Endurance training increases the mass of slow-twitch muscle 
fibers, resistance training builds fast-twitch muscle fibers, and both 
types of exercise reduce body fat, but in quite different ways. 


Endurance training 


Slow-twitch 


muscle growth 





Local effects in muscle: 

Increased fatty acid 
oxidation 

Increased expression of 
insulin receptors and 
glucose transporters 


Heart: 


oxidation 





Increased fat 
uptake and 


Slow-twitch muscles depend on fatty acid oxidation and TCA 
cycle activity to support long periods of exercise and are termed 
oxidative. Effects of endurance training include increased expres- 
sion of (1) peroxisome proliferator-activated receptor 6, a tran- 
scription factor that builds slow-twitch muscle fiber, and (2) in- 
sulin receptors and glucose transporters. Fast-twitch muscles are 
adapted for short bursts of energy, which can be supplied by gly- 
colysis and thus are termed glycolytic. Effects of resistance training 
include activation of metabolic processes by the serine/threonine 
kinase Aktl. Induction of the Aktl pathway results in growth of 
fast-twitch skeletal muscle fibers and subsequent effects on several 
other organs. These include increased fat uptake and oxidation by 
the liver and heart, reduction of adipose (fat cell) mass, and re- 
duced blood glucose and insulin levels. 


Weight training 


Fast-twitch 
muscle growth 





Muscle: 
Increased glucose uptake 
and glycolysis 






Circulating 
factors 


Adipose tissue: Circulation: Liver: 
Reduced Reduced glucose, Increased fat 
cell size insulin, and uptake and 
leptin levels oxidation 





acid yields eight molecules of acetyl-CoA as well as seven molecules of FADH» and 
seven molecules of NADH (Figure 23.16 and Table 23.2). The acetyl-CoA can be 
further metabolized in the TCA cycle (as we have already seen). Alternatively, 
acetyl-CoA can also be used as a substrate in amino acid biosynthesis (see Chapter 
25). As noted in Chapter 22, however, acetyl-CoA cannot be used as a substrate for 
gluconeogenesis. 


Complete -Oxidation of One Palmitic Acid Yields 
106 Molecules of ATP 


If the acetyl-CoA is directed entirely to the TCA cycle in mitochondria, it can eventu- 
ally generate approximately ten high-energy phosphate bonds—that is, ten molecules 
of ATP synthesized from ADP (Table 23.2). Including the ATP formed from FADH 2 
and NADH, complete -oxidation of a molecule of palmitoyl-CoA in mitochondria 
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FIGURE 23.16 Reduced coenzymes produced by B-oxidation and TCA cycle activity provide electrons that 
drive the synthesis of ATP in oxidative phosphorylation. Complete oxidation of palmitoyl-CoA yields a total of 
108 ATP. Subtracting the 2 ATP equivalents consumed in forming the original CoA thioester, oxidation of palmi- 
tate produces 106 ATP. 





TABLE 23.2 J 


ATP Free Energy 











Equation Yield Yield (kJ/mol) 
CH; (CHo) ;4CO-CoA + 7 [FAD] + 7 HO + 7 NAD+ + 7 CoA —> 8 CH3CO-CoA + 7 [FADH] + 7 NADH + 7 H+ 

7 [FADHp»] + 10.5 P; + 10.5 ADP + 3.5 Og —> 7 [FAD] + 17.5 HO + 10.5 ATP 10.5 320 

7 NADH + 7 H* + 17.5 P; + 17.5 ADP + 3.5 Og —> 7 NAD* + 24.5 HO + 17.5 ATP 17.5 534 

8 Acetyl-CoA + 16 Os + 80 ADP + 80 P; —> 8 CoA + 88 HO + 16 CO, + 80 ATP 80 2440 
CH;— (CHo) ,4CO-CoA + 108 P; + 108 ADP + 23 Os —> 108 ATP + 16 CO; + 123 HO + CoA 108 3294 
Energetic “cost” of forming palmitoyl-CoA from palmitate and CoA —2 —61 
Total 106 3233 








yields 108 molecules of ATP. Subtracting the two high-energy bonds needed to form 
palmitoyl-CoA, the substrate for 6-oxidation, one concludes that B-oxidation of a mol- 
ecule of palmitic acid yields 106 molecules of ATP. The A G°’ for complete combustion 
of palmitate to COs is —9790 kJ/mol. The hydrolytic energy embodied in 106 ATPs 
is 106 X 30.5 kJ/mol = 3233 kJ/mol, so the overall efficiency of 6-oxidation under 
standard-state conditions is approximately 33%. The large energy yield from fatty acid 
oxidation is a reflection of the highly reduced state of the carbon in fatty acids. Sugars, 
in which the carbon is already partially oxidized, produce less energy, carbon for car- 
bon, than do fatty acids. The breakdown of fatty acids is regulated by a variety of 
metabolites and hormones. Details of this regulation are described in Chapter 24, fol- 
lowing a discussion of fatty acid synthesis. 


Migratory Birds Travel Long Distances on Energy from Fatty 
Acid Oxidation 


Because they represent the most highly concentrated form of stored biological 
energy, fatty acids are the metabolic fuel of choice for sustaining the incredibly 
long flights of many migratory birds. Although some birds migrate over landmasses 
and eat frequently, other species fly long distances without stopping to eat. The 
American golden plover (Figure 23.17) flies directly from Alaska to Hawaii, a 
3300-km flight requiring 35 hours (at an average speed of nearly 60 miles/hr) and 
more than 250,000 wing beats! The ruby-throated hummingbird, which winters in 
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FIGURE 23.17 Animals whose existence is strongly dependent 
on fatty acid oxidation: (a) gerbil, (b) ruby-throated humming- 
bird, (c) golden plover, (d) orca (killer whale), and (e) camels. 
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(b) Ruby-throated hummingbird 
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(d) Orca (e) Camels 


Central America and nests in southern Canada, often flies nonstop across the Gulf 
of Mexico. These and similar birds accomplish these prodigious feats by storing large 
amounts of fatty acids (as triacylglycerols) in the days before their migratory flights. 
The percentage of dry-weight body fat in these birds may be as high as 70% when mi- 
gration begins (compared with values of 30% and less for nonmigratory birds). 


Fatty Acid Oxidation Is an Important Source of Metabolic Water 
for Some Animals 


Large amounts of metabolic water are generated by -oxidation (123 HO per 
palmitoyl-CoA, see Table 23.2). For certain animals—including desert animals 
(such as gerbils) and killer whales (which do not drink seawater)—the oxidation of 
fatty acids can be a significant source of dietary water. A striking example is the 
camel (Figure 23.17), whose hump is essentially a large deposit of fat. Metabolism 
of fatty acids from this store provides needed water (as well as metabolic energy) 
during periods when drinking water is not available. It might well be said that “the 
ship of the desert” sails on its own metabolic water! 


23030 How Are Odd-Carbon Fatty Acids Oxidized? 
B-Oxidation of Odd-Carbon Fatty Acids Yields Propionyl-CoA 


Fatty acids with odd numbers of carbon atoms are rare in mammals but fairly common 
in plants and marine organisms. Humans and animals whose diets include these food 
sources metabolize odd-carbon fatty acids via the B-oxidation pathway. The final prod- 
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uct of -oxidation in this case is the three-carbon propionyl-CoA instead of acetyl- 
CoA. Three specialized enzymes then carry out the reactions that convert propionyl- 
CoA to succinyl-CoA, a TCA cycle intermediate. (Because propionyl-CoA is a degra- 
dation product of methionine, valine, and isoleucine, this sequence of reactions is 
also important in amino acid catabolism, as we shall see in Chapter 25.) The pathway 
involves an initial carboxylation at the a-carbon of propionyl-CoA to produce 
D-methylmalonyl-CoA (Figure 23.18). The reaction is catalyzed by a biotin-dependent 
enzyme, propionyl-CoA carboxylase. The mechanism involves ATP-driven carboxyla- 
tion of biotin at Nj, followed by nucleophilic attack by the a-carbanion of propionyl- 
CoA in a stereospecific manner. 

D-Methylmalonyl-CoA, the product of this reaction, is converted to the 
L-isomer by methylmalonyl-CoA epimerase (Figure 23.18). (This enzyme has often 
and incorrectly been called “methylmalonyl-CoA racemase.” It is not a racemase 
because the CoA moiety contains five other asymmetric centers.) The epimerase 
reaction involves a carbanion at the a-position formed via a reversible dissociation 
of the acidic a-proton (Figure 23.19). The L-isomer is the substrate for methyl- 
malonyl-CoA mutase. Methylmalonyl-CoA epimerase is an impressive catalyst. The 
pKa for the proton that must dissociate to initiate this reaction is approximately 21! 
If binding of a proton to the a-anion is diffusion limited, with kon = 10° M7! sec}, 
then the initial proton dissociation must be rate limiting and the rate constant 
must be 


kot = Ka * kon = (1077! M) + (10° M~! sec7!) = 107!? sec™! 


The turnover number of methylmalonyl-CoA epimerase is 100 sec™!, and thus the 
enzyme enhances the reaction rate by a factor of 10". 


AB, -Catalyzed Rearrangement Yields Succinyl-CoA 
from L-Methylmalonyl-CoA 


The third reaction, catalyzed by methylmalonyl-CoA mutase, is quite unusual be- 
cause it involves a migration of the carbonyl-CoA group from one carbon to its 
neighbor (Figure 23.18). The mutase reaction is vitamin B;.>—dependent and begins 
with homolytic cleavage of the Co**—C bond in 5'-deoxyadenosylcobalamin, re- 
ducing the cobalt to Co?* (see A Deeper Look, page 712). Transfer of a hydrogen 
atom from the substrate to the deoxyadenosyl group produces a methylmalonyl- 
CoA radical, which then can undergo a classic Bj>-catalyzed rearrangement to yield 
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Resonance-stabilized carbanion intermediate 


FIGURE 23.19 The methylmalonyl-CoA epimerase mechanism involves a resonance-stabilized carbanion at the 
a-position. 
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FIGURE 23.18 The conversion of propionyl-CoA (formed 
from -oxidation of odd-carbon fatty acids) to succinyl- 


CoA is carried out by a trio of enzymes, as shown. 
Succinyl-CoA can enter the TCA cycle. 


712 Chapter 23 Fatty Acid Catabolism 





A DEEPER LOOK 
The Activation of Vitamin B42 


Conversion of inactive vitamin By, to active 5’ -deoxyadenosylcobalamin 
involves three steps (see accompanying figure). Two flavoprotein 
reductases sequentially convert Co** in cyanocobalamin to the 
Co** state and then to the Co* state. Co* is an extremely power- 
ful nucleophile. It attacks the C-5’ carbon of ATP as shown, expel- 


Because two electrons from Co* are donated to the Co—carbon 
bond, the oxidation state of cobalt reverts to Co** in the active 
coenzyme. This is one of only two known adenosyl transfers (that 
is, nucleophilic attack on the ribose 5'-carbon of ATP) in biologi- 
cal systems. (The other is the formation of S-adenosylmethionine; 


ling the triphosphate anion to form 5’-deoxyadenosylcobalamin. see Chapter 25.) 
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A Formation of the active coenzyme 5'-deoxyadenosylcobalamin from inactive vitamin Bj, is initiated 
by the action of flavoprotein reductases. The resulting Co* species, dubbed a supernucleophile, attacks 
the 5'-carbon of ATP in an unusual adenosyl transfer. Homolytic cleavage of the Co*-C bond produces 
a reactive free radical that facilitates rearrangements such as that in the methylmalonyl-CoA mutase 
reaction. 











a succinyl-CoA radical. Hydrogen transfer from the deoxyadenosyl group yields 
succinyl-CoA and regenerates the Bı coenzyme (see problem 16 at the end of the 
chapter). 


Net Oxidation of Succinyl-CoA Requires Conversion to Acetyl-CoA 


Succinyl-CoA derived from propionyl-CoA can enter the TCA cycle. Oxidation of 
succinate to oxaloacetate provides a substrate for glucose synthesis. Thus, al- 
though the acetate units produced in -oxidation cannot be utilized in gluco- 
neogenesis by animals, the occasional propionate produced from oxidation of 
odd-carbon fatty acids can be used for sugar synthesis. Alternatively, succinate in- 
troduced to the TCA cycle from odd-carbon fatty acid oxidation may be oxidized 
to CO». However, all of the four-carbon intermediates in the TCA cycle are re- 
generated in the cycle and thus should be viewed as catalytic species. Net con- 
sumption of succinyl-CoA thus does not occur directly in the TCA cycle. Rather, 
the succinyl-CoA generated from -oxidation of odd-carbon fatty acids must be 
converted to pyruvate and then to acetyl-CoA (which is completely oxidized in the 
TCA cycle). To follow this latter route, succinyl-CoA entering the TCA cycle must 
be first converted to malate in the usual way and then transported from the mi- 
tochondrial matrix to the cytosol, where it is oxidatively decarboxylated to pyru- 
vate and CO, by malic enzyme, as shown in Figure 23.20. Pyruvate can then be 
transported back to the mitochondrial matrix, where it enters the TCA cycle via 
pyruvate dehydrogenase. Note that malic enzyme plays a role in fatty acid synthe- 
sis (see Figure 24.1). 
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An Isomerase and a Reductase Facilitate the G-Oxidation 
of Unsaturated Fatty Acids 


Unsaturated fatty acids are also catabolized by B-oxidation, but two additional mito- 
chondrial enzymes—an isomerase and a novel reductase—are required to handle 
the cis double bonds of naturally occurring fatty acids. As an example, consider 
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CENGAGENOW” Go to CengageNOW and click 
Cengagelnteractive to interact with a mechanism 
for the methylmalonyl mutase reaction. 


FIGURE 23.20 The malic enzyme reaction proceeds by 
oxidation of malate to oxaloacetate, followed by decar- 
boxylation to yield pyruvate. 





Numerous beneficial effects have been attributed in recent years to 
the polyphenolic compounds in foods such as chocolate, red wine, 
and (black or green) tea. Principal among these compounds are the 
catechins, a class of antioxidants that have demonstrated protective 
effects against certain cancers, obesity, and heart disease, as well as 
other beneficial health effects. Is it possible to demonstrate specific 
effects of catechins on cellular metabolism? Takatoshi Murase and 
his colleagues have shown that green tea extract, consisting almost 
entirely of six catechins (shown in the figure), added to the diets of 
rats increased muscle glycogen content, decreased muscle fatty acid 
synthesis, and increased fatty acid oxidation by rat skeletal muscle by 
36%. In these same rats, exercise endurance capacity for swimming 
and running increased significantly, suggesting that fatty acids can 
be an effective source of energy in exercising muscle. Measure- 
ments of the catechin content of a range of teas, wines, and other 
beverages indicate that the broadest range and the highest levels of 
these compounds are found in tea, but that red wine has significant A Tea plants 
levels of catechins. (Green tea and black tea are made from the 

same shrub, Camellia sinensis.) 
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FIGURE 23.21 68-Oxidation of unsaturated fatty acids. In 
the case of oleoyl-CoA, three B-oxidation cycles pro- 
duce three molecules of acetyl-CoA and leave cis-A?- 
dodecenoyl-CoA. Rearrangement of enoyl-CoA iso- 
merase gives the trans-A species, which then proceeds 
normally through the B-oxidation pathway. 








Structure of acyl-CoA oxidase from rat liver (pdb id = 

1IS2). The enzyme is a homodimer, each monomer consist- 
ing of an N-terminal a-helical domain (blue), a B-sheet do- 
main (orange), and a C-terminal a-helical domain (violet). 


the breakdown of oleic acid, an 18-carbon chain with a double bond at the 
9,10-position. The reactions of B-oxidation proceed normally through three cycles, 
producing three molecules of acetyl-CoA and leaving the degradation product cis- 
A®-dodecenoyl-CoA, shown in Figure 23.21. This intermediate is not a substrate for 
acyl-CoA dehydrogenase. With a double bond at the 3,4-position, it is not possible 
to form another double bond at the 2,3- (or B-) position. As shown in Figure 23.21, 
this problem is solved by enoyl-CoA isomerase, an enzyme that rearranges this 
cis-A? double bond to a trans-A? double bond. This latter species can proceed 
through the normal route of B-oxidation. 


Degradation of Polyunsaturated Fatty Acids Requires 
2,4-Dienoyl-CoA Reductase 


Polyunsaturated fatty acids pose a slightly more complicated situation for the 
cell. Consider, for example, the case of linoleic acid shown in Figure 23.22. As 
with oleic acid, G-oxidation proceeds through three cycles, and enoyl-CoA iso- 
merase converts the cis-A? double bond to a trans-A? double bond to permit one 
more round of B-oxidation. What results this time, however, is a cis-A* enoyl-CoA, 
which is converted normally by acyl-CoA dehydrogenase to a trans-A?, cis-A‘* 
species. This, however, is a poor substrate for the enoyl-CoA hydratase. This 
problem is solved by 2,4-dienoyl-CoA reductase, the product of which depends 
on the organism. The mammalian form of this enzyme produces a trans-A® enoyl 
product, as shown in Figure 23.22; this enoyl product can be converted by an 
enoyl-CoA isomerase to the trans-A* enoyl-CoA, which can then proceed nor- 
mally through the f-oxidation pathway. Escherichia coli possesses a 2,4-dienoyl- 
CoA reductase that reduces the double bond at the 4,5-position to yield the 
trans-A* enoyl-CoA product in a single step. 


2350 Are There Other Ways to Oxidize Fatty Acids? 
Peroxisomal -Oxidation Requires FAD-Dependent Acyl-CoA Oxidase 


Although -oxidation in mitochondria! is the principal pathway of fatty acid ca- 
tabolism, several other minor pathways play important roles in fat catabolism. 
For example, organelles other than mitochondria, including peroxisomes and gly- 
oxysomes, Carry out B-oxidation processes. Peroxisomes are so named because 
they carry out a variety of flavin-dependent oxidation reactions, regenerating Ox- 
idized flavins by reaction with oxygen to produce hydrogen peroxide, H2Os. Per- 
oxisomal -oxidation is similar to mitochondrial 6-oxidation, except that the ini- 
tial double-bond formation is catalyzed by an FAD-dependent acyl-CoA oxidase 
(Figure 23.23). The action of this enzyme in the peroxisomes transfers the liber- 
ated electrons directly to oxygen instead of the electron-transport chain. As a re- 
sult, each two-carbon unit oxidized in peroxisomes produces fewer ATPs. (Com- 
pare Figure 23.23 with Figure 23.12.) The enzymes responsible for fatty acid 
oxidation in peroxisomes are inactive with acyl chains of eight carbons or fewer. 
Such short-chain products must be transferred to the mitochondria for further 
breakdown. Similar 6-oxidation enzymes are also found in glyoxysomes— 
peroxisomes in plants that also carry out the reactions of the glyoxylate pathway. 


Branched-Chain Fatty Acids Are Degraded Via a-Oxidation 


Although £-oxidation is universally important, there are some instances in which 
it cannot operate effectively. For example, branched-chain fatty acids with alkyl 


18-Oxidation does not occur significantly in plant mitochondria. Most B-oxidation in plants occurs in 
peroxisomes. 
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FIGURE 23.22 The oxidation pathway for polyunsaturated fatty acids in mammals, illustrated for linoleic acid. 
Three cycles of B-oxidation on linoleoyl-CoA yield the cis-4?, cis-A® intermediate, which is converted to a 
trans-A2, cis-A® intermediate. An additional round of B-oxidation gives cis-A* enoyl-CoA, which is oxidized 

to the trans-A?, cis-A* species by acyl-CoA dehydrogenase. The subsequent action of 2,4-dienoyl-CoA reduc- 
tase yields the trans-A? product, which is converted by enoyl-CoA isomerase to the trans-A? form. Normal 
B-oxidation then produces five molecules of acetyl-CoA. 
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FIGURE 23.23 The acyl-CoA oxidase reaction in peroxi- 
somes. Electrons captured as FADH; are used to pro- 
duce the hydrogen peroxide required for degradative 
processes in peroxisomes and thus are not available for 
eventual generation of ATP. (Compare this reaction with 
the acyl-coA dehydrogenase of mitochondria shown in 
Figure 23.12.) 
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FIGURE 23.24 Branched-chain fatty acids are oxidized by a-oxidation, as shown for phytanic acid. The product of 
the phytanic acid oxidase, pristanic acid, is a suitable substrate for normal B-oxidation. lsobutyryl-CoA and 
propionyl-CoA can both be converted to succinyl-CoA, which can enter the TCA cycle. 


branches at odd-numbered carbons are not effective substrates for B-oxidation. For 
such species, a-oxidation is a useful alternative. Consider phytol, a breakdown prod- 
uct of chlorophyll that occurs in the fat of ruminant animals such as sheep and cows 
and also in dairy products. Ruminants oxidize phytol to phytanic acid, and digestion 
of phytanic acid from dairy products is thus an important dietary consideration for 
humans. The methyl group at C-3 will block 6-oxidation, but, as shown in Figure 
23.24, phytanic acid a-hydroxylase places an —OH group at the a-carbon, and 
phytanic acid a-oxidase decarboxylates it to yield pristanic acid. The CoA ester of this 
metabolite can undergo -oxidation in the normal manner. The terminal product, 
isobutyryl-CoA, can be sent into the TCA cycle by conversion to succinyl-CoA. 


o)-Oxidation of Fatty Acids Yields Small Amounts of Dicarboxylic Acids 


In the endoplasmic reticulum of eukaryotic cells, the oxidation of the terminal car- 
bon of a normal fatty acid—a process termed omega oxidation (w-oxidation)—can 
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HUMAN BIOCHEMISTRY 


Refsum’s Disease Is a Result of Defects in a-Oxidation 


The a-oxidation pathway is defective in Refsum’s disease, an in- 
herited metabolic disorder that results in defective night vision, 
tremors, and other neurologic abnormalities. These symptoms are 
caused by accumulation of phytanic acid in the body. Treatment of 


Refsum’s disease requires a diet free of chlorophyll, the precursor 
of phytanic acid. This regimen is difficult to implement because all 
green vegetables and even meat from plant-eating animals, such as 
cows, pigs, and poultry, must be excluded from the diet. 








lead to the synthesis of small amounts of dicarboxylic acids (Figure 23.25). COO 
Cytochrome P-450, a monooxygenase enzyme that requires NADPH as a coenzyme 

and uses Oz as a substrate, places a hydroxyl group at the terminal carbon. Subse- (CHy) 10 
quent oxidation to a carboxyl group produces a dicarboxylic acid. Either end can CH, 


form an ester linkage to CoA and be subjected to B-oxidation, producing a variety 
of smaller dicarboxylic acids. (Cytochrome P-450-—dependent monooxygenases 
also play an important role as agents of detoxication, the degradation and metab- 
olism of toxic hydrocarbon agents.) 
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v 
COO 


(CHo) 10 
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FIGURE 23.25 Dicarboxylic acids can be formed by oxi- 
dation of the omega carbon of fatty acids in a cyto- 
chrome P-450-dependent reaction. 


Ketone Bodies Are a Significant Source of Fuel and Energy 
for Certain Tissues 


Most of the acetyl-CoA produced by the oxidation of fatty acids in liver mito- 
chondria undergoes further oxidation in the TCA cycle, as stated earlier. How- 
ever, some of this acetyl-CoA is converted to three important metabolites: ace- 
tone, acetoacetate, and B-hydroxybutyrate. The process is known as ketogenesis, 
and these three metabolites are traditionally known as ketone bodies, despite the 
fact that B-hydroxybutyrate does not contain a ketone function. These three 
metabolites are synthesized primarily in the liver but are important sources of fuel 
and energy for many tissues, including brain, heart, and skeletal muscle. The 
brain, for example, normally uses glucose as its source of metabolic energy. How- 
ever, during periods of starvation, ketone bodies may be the major energy source 
for the brain. Acetoacetate and 3-hydroxybutyrate are normal substrates for kid- 
ney cortex and for heart muscle. 

Ketone body synthesis occurs only in the mitochondrial matrix. The reactions re- 
sponsible for the formation of ketone bodies are shown in Figure 23.26. The first 
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Large Amounts of Ketone Bodies Are Produced in Diabetes Mellitus 





Diabetes mellitus is the most common endocrine disease and the 
third leading cause of death in the United States, with approxi- 
mately 6 million diagnosed cases and an estimated 4 million more 
borderline but undiagnosed cases. Diabetes is characterized by an 
abnormally high level of glucose in the blood. In type 1 diabetes 
(representing 10% or less of all cases), elevated blood glucose re- 
sults from inadequate secretion of insulin by the islets of Langer- 
hans in the pancreas. Type 2 diabetes (at least 90% of all cases) 
results from an insensitivity to insulin. Type 2 diabetics produce 
normal or even elevated levels of insulin, but their cells are not 
responsive to insulin, often due to a shortage of insulin receptors 
(see Chapter 32). In both cases, transport of glucose into muscle, 


liver, and adipose tissue is significantly reduced, and despite 
abundant glucose in the blood, the cells are metabolically starved. 
They respond by turning to increased gluconeogenesis and ca- 
tabolism of fat and protein. In type 1 diabetes, increased gluco- 
neogenesis consumes most of the available oxaloacetate, but 
breakdown of fat (and, to a lesser extent, protein) produces large 
amounts of acetyl-CoA. This increased acetyl-CoA would nor- 
mally be directed into the TCA cycle, but with oxaloacetate in 
short supply, it is used instead for production of unusually large 
amounts of ketone bodies. Acetone can often be detected on the 
breath of type 1 diabetics, an indication of high plasma levels of 
ketone bodies. 
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FIGURE 23.27 Reconversion of ketone bodies to acetyl- 
CoA in the mitochondria of many tissues (other than 
liver) provides significant metabolic energy. 
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FIGURE 23.26 The formation of ketone bodies, synthesized primarily in liver mitochondria. 


reaction—the condensation of two molecules of acetyl-CoA to form acetoacetyl- 
CoA—is catalyzed by thiolase, which is also known as acetoacetyl-CoA thiolase or 
acetyl-CoA acetyltransferase. This is the same enzyme that carries out the thiolase 
reaction in B-oxidation, but here it runs in reverse. The second reaction adds an- 
other molecule of acetyl-CoA to give 3-hydroxy-3-methylglutaryl-CoA, commonly ab- 
breviated HMG-CoA. These two mitochondrial matrix reactions are analogous to 
the first two steps in cholesterol biosynthesis, a cytosolic process, as we shall see in 
Chapter 24. HMG-CoA is converted to acetoacetate and acetyl-CoA by the action of 
HMG-CoA lyase in a mixed aldol-Claisen ester cleavage reaction. This reaction is 
mechanistically similar to the reverse of the citrate synthase reaction in the TCA 
cycle. A membrane-bound enzyme, B-hydroxybutyrate dehydrogenase, then can re- 
duce acetoacetate to B-hydroxybutyrate. 

Acetoacetate and ß-hydroxybutyrate are transported through the blood from 
liver to target organs and tissues, where they are converted to acetyl-CoA (Figure 
23.27). Ketone bodies are easily transportable forms of fatty acids that move through the cir- 
culatory system without the need for complexation with serum albumin and other fatty acid- 
binding proteins. 
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SUMMARY 


23.1 How Are Fats Mobilized from Dietary Intake and Adipose Tissue? 
Triacylglycerols are a major source of fatty acids in the diet, and they are 
also our principal stored energy reserve in adipose tissue. Hormone 
messengers such as adrenaline, glucagon, and ACTH bind to receptors 
on the plasma membrane of adipose cells and lead to the activation of 
a triacylglycerol lipase that hydrolyzes a fatty acid from C-1 or C-3 of tri- 
acylglycerols. Subsequent actions of diacylglycerol lipase and monoacyl- 
glycerol lipase yield fatty acids and glycerol. The cell then releases the 
fatty acids into the blood, where they are carried to sites of utilization. 
Dietary triacylglycerols are degraded by lipases and esterases in the 
stomach and duodenum. Pancreatic lipase cleaves fatty acids from 
the C-1 and C3 positions of triacylglycerols, and other lipases and es- 
terases attack the C-2 position. Bile salts act as detergents to emulsify the 
triacylglycerols and facilitate the hydrolytic activity of the lipases and 
esterases. 


23.2 How Are Fatty Acids Broken Down? The process of 8-oxidation 
begins with the formation of a thiol ester bond between the fatty acid and 
the thiol group of coenzyme A, catalyzed by acyl-CoA synthetase. The en- 
zymes of the B-oxidation pathway are located in the mitochondrial ma- 
trix. Short-chain fatty acids are transported into the matrix as free acids 
and form the acyl-CoA derivatives there. However, long-chain fatty acyl- 
CoA derivatives must first be converted to acylcarnitine derivatives, which 
are transported across the inner membrane by a translocase. On the ma- 
trix side of the inner membrane, a second acyl carnitine transferase re- 
forms the fatty acyl-CoA. The process of -oxidation involves a recurring 
cycle of four steps. A double bond is formed, water is added across the 
double bond, and the resulting alcohol is oxidized to a carbonyl group. 
The fourth reaction of the cycle cleaves the resulting B-keto ester, pro- 
ducing an acetate unit and leaving a fatty acid chain that is two carbons 
shorter. 


23.3 How Are Odd-Carbon Fatty Acids Oxidized? Humans and animals 
metabolize odd-carbon fatty acids via the B-oxidation pathway, with the fi- 
nal product being propionyl-CoA. Three specialized enzymes then con- 
vert propionyl-CoA to succinyl-CoA, a TCA cycle intermediate. The path- 
way involves an initial ATP-dependent carboxylation (by propionyl-CoA 
carboxylase) at the a-carbon of propionyl-CoA to produce D-methyl- 
malonyl-CoA, which is converted to the L-isomer by methylmalonyl-CoA 
epimerase. The L-isomer is the substrate for methylmalonyl-CoA mutase, 


which catalyzes a migration of a carbonyl-CoA group from one carbon to 
its neighbor, yielding succinyl-CoA. 


23.4 How Are Unsaturated Fatty Acids Oxidized? ‘Two additional mito- 
chondrial enzymes—an isomerase and a novel reductase—are required 
to handle the cis-double bonds of naturally occurring fatty acids. Con- 
sider the breakdown of oleic acid. The reactions of B-oxidation proceed 
normally through three cycles, producing three molecules of acetyl-CoA 
and leaving the product cis-A*-dodecenoyl-CoA. This intermediate is not 
a substrate for acyl-CoA dehydrogenase. Instead, enoyl-CoA isomerase re- 
arranges the cis-A® double bond to a trans-A? double bond, which can pro- 
ceed through the normal route of B-oxidation. 


23.5 Are There Other Ways to Oxidize Fatty Acids? Organelles other 
than mitochondria, including peroxisomes and glyoxysomes, also carry 
out B-oxidation processes. Peroxisomal -oxidation is similar to mito- 
chondrial B-oxidation, except that the initial double-bond formation is 
catalyzed by an FAD-dependent acyl-CoA oxidase, which transfers the 
liberated electrons directly to oxygen instead of the electron-transport 
chain. Short-chain products must be transferred to the mitochondria 
for further breakdown. Similar -oxidation enzymes are also found in 
glyoxysomes. 

Branched-chain fatty acids with alkyl branches at odd-numbered 
carbons are not effective substrates for B-oxidation. For such species, 
a-oxidation is a useful alternative. Ruminants oxidize phytol to phy- 
tanic acid. The methyl group at C-3 will block B-oxidation, but phytanic 
acid a-hydroxylase places an —OH group at the a-carbon, and phy- 
tanic acid a-oxidase decarboxylates it to yield pristanic acid. The CoA 
ester of this metabolite can undergo -oxidation in the normal man- 
ner. The terminal product, isobutyryl-CoA, can be sent into the TCA cy- 
cle by conversion to succinyl-CoA. 


23.6 What Are Ketone Bodies, and What Role Do They Play in Metabo- 
lism? Acetone, acetoacetate, and B-hydroxybutyrate are known as ke- 
tone bodies. These three metabolites are synthesized primarily in the 
liver but are important sources of fuel and energy for many tissues, in- 
cluding brain, heart, and skeletal muscle. During periods of starvation, 
ketone bodies may be the major energy source for the brain. Aceto- 
acetate and 3-hydroxybutyrate are normal substrates for kidney cortex 
and for heart muscle. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. Calculate the volume of metabolic water available to a camel 
through fatty acid oxidation if it carries 30 pounds of triacylglycerol 
in its hump. 

2. Calculate the approximate number of ATP molecules that can be 
obtained from the oxidation of cis-11-heptadecenoic acid to CO% 
and water. 


3. Phytanic acid, the product of chlorophyll that causes problems for 
individuals with Refsum’s disease, is 3,7,11,15-tetramethyl hexadec- 
anoic acid. Suggest a route for its oxidation that is consistent with 
what you have learned in this chapter. (Hint: The methyl group at 
C-3 effectively blocks hydroxylation and normal f-oxidation. You 
may wish to initiate breakdown in some other way.) 

4. Even though acetate units, such as those obtained from fatty acid 
oxidation, cannot be used for net synthesis of carbohydrate in 
animals, labeled carbon from 'C-labeled acetate can be found in 
newly synthesized glucose (for example, in liver glycogen) in ani- 
mal tracer studies. Explain how this can be. Which carbons of glu- 
cose would you expect to be the first to be labeled by 'C-labeled 
acetate? 


5. Human serum albumin (66.4 kD) is a soluble protein present in 
blood at 0.75 mM or so. Among other functions, albumin acts as the 
major transport vehicle for fatty acids in the circulation, carrying 
fatty acids from storage sites in adipose tissue to their sites of oxi- 
dation in liver and muscle. The albumin molecule has up to 11 dis- 
tinct binding sites. Consult the biochemical literature to learn 
about the fatty acid-binding sites of albumin. Where are they lo- 
cated on the protein? What are their relative affinities? (Two suit- 
able references with which to begin your study are Bhattacharya, A. 
A., Grüne, T., and Curry, S., 2000. Crystallographic analysis reveals 
common modes of binding of medium- and long-chain fatty acids 
in human serum albumin. Journal of Molecular Biology 303:721-732; 
and Simard, J. R., Zunszain, P. A., Ha, C-E., et al., 2005. Locating 
high-affinity fatty acid-binding sites on albumin by X-ray crystallog- 
raphy and NMR spectroscopy. Proceedings of the National Academy of 
Sciences U.S.A. 102:17958-17963.) 


6. Overweight individuals who diet to lose weight often view fat in neg- 
ative ways because adipose tissue is the repository of excess caloric 
intake. However, the “weighty” consequences might be even worse 
if excess calories were stored in other forms. Consider a person who 
is 10 pounds “overweight,” and estimate how much more he or she 
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10. 


11 


12. 


13. 


14. 


15. 


16. 


would weigh if excess energy were stored in the form of carbohy- 
drate instead of fat. 


. What would be the consequences of a deficiency in vitamin By» for fatty 


acid oxidation? What metabolic intermediates might accumulate? 


. Write properly balanced chemical equations for the oxidation to 


CO» and water of (a) myristic acid, (b) stearic acid, (c) a-linolenic 
acid, and (d) arachidonic acid. 


. How many tritium atoms are incorporated into acetate if a molecule 


of palmitic acid is oxidized in 100% tritiated water? 


What would be the consequences of a carnitine deficiency for fatty 
acid oxidation? 


. The ruby-throated hummingbird flies 500 miles nonstop across the 


Gulf of Mexico. The flight takes 10 hours at 50 mph. The hum- 
mingbird weighs about 4 grams at the start of the flight and about 
2.7 grams at the end. Assuming that all the lost weight is fat burned 
for the flight, calculate the total energy required by the humming- 
bird in this prodigious flight. Does anything about the results of this 
calculation strike you as unusual? 


Energy production in animals is related to oxygen consumption. 
The ruby-throated hummingbird consumes about 250 mL of oxygen 
per hour during its migration across the Gulf of Mexico. Use this 
number and the data in problem 11 to determine a conversion fac- 
tor for energy expended per liter of oxygen consumed. If a human 
being consumes 12.7 kcal/min while running 8-minute miles, how 
long could a human run on the energy that the hummingbird con- 
sumes in its trans-Gulf flight? How many 8-minute miles would a per- 
son have to run to lose 1 pound of body fat? 

Write a reasonable mechanism for the HMG-CoA synthase reaction 
shown in Figure 23.26. 

The methylmalonyl-CoA mutase reaction (see Figure 23.18) in- 
volves vitamin By as a coenzyme. Write a reasonable mechanism for 
this reaction. (Hint: The reaction begins with abstraction of a hy- 
drogen atom—that is, a proton plus an electron—from the sub- 
strate by vitamin Bı. Consider the chemistry shown in A Deeper 
Look: The Activation of Vitamin Bj, as you write your mechanism.) 
Discuss the changes of the oxidation state of cobalt in the course of 
the methylmalonyl-CoA mutase reaction. Why do they occur as you 
suggested in your answer to problem 14? 

Based on the mechanism for the methylmalonyl-CoA mutase (see 
problem 14), write reasonable mechanisms for the following reac- 
tions shown. 


H H 


| 
-OOC—C—CH, == -00C—C— CH; 

















H CH—COO0O- CH— COO- 
R NH,* 
H H 
CH,—CH—C—OH —> CH;—C—C—H + HO 
la H H | 
H H 
CH,—CH—C—H —> CH,—C—C—H + H,O 
bx bu én du n 6 
H 
CH, l H —> CH; —C—H + NH,* 





17. 


18. 


20. 


A popular dish in the Caribbean islands consists of “akee and salt 
fish.” Unripened akee fruit (from the akee tree, native to West Africa 
but brought to the Caribbean by African slaves) is quite poisonous. 
The unripened fruit contains hypoglycin (structure shown below), a 
metabolite that serves as a substrate for acyl-CoA dehydrogenase. 
However, the product of this reaction irreversibly inhibits the acyl- 
CoA dehydrogenase by reacting covalently with FAD on the enzyme. 
Consumption of unripened akee fruit can lead to vomiting and, in 
severe cases, convulsions, coma, and death. Write a reaction scheme 
to show the product of the acyl-CoA dehydrogenase reaction that 
reacts with FAD. 
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In mammalian mitochondria, three of the enzymes in the membrane- 
bound -oxidation system are components of a multifunctional 
enzyme (MFE). Similarly, glyoxysomes (in plants), peroxisomes (in 
most species), and Gram-negative bacteria carry out several reactions 
of -oxidation via MFEs. In all these cases, the components of an MFE 
must cooperate to carry or “channel” the acyl-CoA substrate from 
one active site to the next in a cyclic fashion. The structure of the com- 
plete MFE from Pseudomonas fragi provides insights into how this 
channeling might occur. Consult the journal article that describes 
this structure (Ishikawa, M., Tsuchiya, D., et al., 2004. EMBO Journal 
23:2745-2754), and explain in your own words how this substrate 
channeling occurs. 


Preparing for the MCAT Exam 
19. 


Study Figure 23.12 and comment on why nature uses FAD/FADH» 
as a cofactor in the acyl-CoA dehydrogenase reaction rather than 
NAD*/NADH. 

Study Figure 23.9. Where else in metabolism have you seen the 
chemical strategy and logic of the -oxidation pathway? Why is it 
that these two pathways are carrying out the same chemistry? 
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4 Lipid Biosynthesis 


ESSENTIAL QUESTION 


We turn now to the biosynthesis of lipid structures. We begin with a discussion of 
the biosynthesis of fatty acids, stressing the basic pathways, additional means of 
elongation, mechanisms for the introduction of double bonds, and regulation of 
fatty acid synthesis. Sections then follow on the biosynthesis of glycerophospho- 
lipids, sphingolipids, eicosanoids, and cholesterol. The transport of lipids through the 
body in lipoprotein complexes is described, and the chapter closes with discussions 
of the biosynthesis of bile salts and steroid hormones. 
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Walruses basking on the beach. 





What are the pathways of lipid synthesis in biological systems? 
To everything there is a season, and a time for 

every purpose under heaven...A time to break 

down and a time to build up. 


Ecclesiastes 3:1-3 We have already seen several cases in which the synthesis of a class of biomolecules 
is conducted differently from degradation (glycolysis versus gluconeogenesis and 
glycogen or starch breakdown versus polysaccharide synthesis, for example). Like- 
wise, the synthesis of fatty acids and other lipid components is different from their 
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24.1 How Are Fatty Acids Synthesized? degradation. Fatty acid synthesis involves a set of reactions that follow a strategy dif- 

24.2 How Are Complex Lipids Synthesized? ferent in several ways from the corresponding degradative process: 

24.3 How Are Eicosanoids Synthesized, and What 1. Intermediates in fatty acid synthesis are linked covalently to the sulfhydryl groups 
Are Their Functions? of special proteins, the acyl carrier proteins (ACPs). In contrast, fatty acid break- 

24.4 How Is Cholesterol Synthesized? down intermediates are bound to the —SH group of coenzyme A. 

24.5 ow Are Lipids Transported Throughout 2. Fatty acid synthesis occurs in the cytosol, whereas fatty acid degradation takes 
he Body? place in mitochondria. 

24.6 How Are Bile Acids Biosynthesized? 3. In animals, the enzymes of fatty acid synthesis are components of one long 

24.7 How Are Steroid Hormones Synthesized polypeptide chain, the fatty acid synthase, whereas no similar association exists 
and Utilized? 


for the degradative enzymes. (Plants and bacteria employ separate enzymes to 
carry out the biosynthetic reactions.) 

4. The coenzyme for the oxidation—-reduction reactions of fatty acid synthesis is 
NADP*/NADPH, whereas degradation involves the NAD*/NADH couple. 
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Formation of Malonyl-CoA Activates Acetate Units 
for Fatty Acid Synthesis 


The design strategy for fatty acid synthesis is this: 


l. Fatty acid chains are constructed by the addition of two-carbon units derived 
from acetyl-CoA. 

. The acetate units are activated by formation of malonyl-CoA (at the expense of ATP). 

3. The addition of two-carbon units to the growing chain is driven by decarboxyla- 
tion of malonyl-CoA. 

4. The elongation reactions are repeated until the growing chain reaches 16 carbons 
in length (palmitic acid). 

5. Other enzymes then add double bonds and additional carbon units to the chain. 


N 


Fatty Acid Biosynthesis Depends on the Reductive Power of NADPH 


The net reaction for the formation of palmitate from acetyl-CoA is 
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(Levels of free fatty acids are very low in the typical cell. The palmitate made in this 
process is rapidly converted to CoA esters in preparation for the formation of tri- 
acylglycerols and phospholipids.) 


Cells Must Provide Cytosolic Acetyl-CoA and Reducing Power 
for Fatty Acid Synthesis 


Eukaryotic cells face a dilemma in providing suitable amounts of substrate for 
fatty acid synthesis. Sufficient quantities of acetyl-CoA, malonyl-CoA, and 
NADPH must be generated in the cytosol for fatty acid synthesis. Malonyl-CoA is 
made by carboxylation of acetyl-CoA, so the problem reduces to generating suf- 
ficient acetyl-CoA and NADPH. 

There are three principal sources of acetyl-CoA (Figure 24.1): 


1. Amino acid degradation produces cytosolic acetyl-CoA. 

2. Fatty acid oxidation produces mitochondrial acetyl-CoA. 

3. Glycolysis yields cytosolic pyruvate, which (after transport into the mitochondria) 
is converted to acetyl-CoA by pyruvate dehydrogenase. 
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FIGURE 24.1 The citrate-malate-pyruvate shuttle provides cytosolic acetate units and some reducing equiva- 
lents (electrons) for fatty acid synthesis. The shuttle collects carbon substrates, primarily from glycolysis but 
also from fatty acid oxidation and amino acid catabolism. Pathways that provide carbon for fatty acid synthe- 
sis are shown in blue; pathways that supply electrons for fatty acid synthesis are shown in red. 
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Biotin carboxylase 
domain of human acetyl-CoA 
carboxylase 2 (pdb id = 2HJW) 


The biotin carboxylase domain from human acetyl-CoA 
carboxylase 2, with the A-subdomain in blue, the 
B-subdomain in red, the A-B linker in green, and the 
C-subdomain in yellow. 


The acetyl-CoA derived from amino acid degradation is normally insufficient for 
fatty acid biosynthesis, and the acetyl-CoA produced by pyruvate dehydrogenase 
and by fatty acid oxidation cannot cross the mitochondrial membrane to participate 
directly in fatty acid synthesis. Instead, acetyl-CoA is linked with oxaloacetate to 
form citrate, which is transported from the mitochondrial matrix to the cytosol (Fig- 
ure 24.1). Here it can be converted back into acetyl-CoA and oxaloacetate by 
ATP-citrate lyase. In this manner, mitochondrial acetyl-CoA becomes the substrate 
for cytosolic fatty acid synthesis. (Oxaloacetate returns to the mitochondria in the 
form of either pyruvate or malate, which is then reconverted to acetyl-CoA and 
oxaloacetate, respectively.) 

NADPH can be produced in the pentose phosphate pathway as well as by malic 
enzyme (Figure 24.1). Reducing equivalents (electrons) derived from glycolysis in 
the form of NADH can be transformed into NADPH by the combined action of 
malate dehydrogenase and malic enzyme: 


Oxaloacetate + NADH + Ht——> malate + NAD* 
Malate + NADP*——> pyruvate + CO. + NADPH + H* 


How many of the 14 NADPH needed to form one palmitate (see equation on 
page 722) can be made in this way? The answer depends on the status of malate. Every 
citrate entering the cytosol produces one acetyl-CoA and one malate (Figure 24.1). 
Every malate oxidized by malic enzyme produces one NADPH, at the expense of a de- 
carboxylation to pyruvate. Thus, when malate is oxidized, one NADPH is pro- 
duced for every acetyl-CoA. Conversion of 8 acetyl-CoA units to one palmitate would 
then be accompanied by production of 8 NADPH. (The other 6 NADPH required, as 
shown in the equation on page 722, would be provided by the pentose phosphate 
pathway.) On the other hand, for every malate returned to the mitochondria, one 
NADPH fewer is produced. 


Acetate Units Are Committed to Fatty Acid Synthesis by Formation 
of Malonyl-CoA 


Rittenberg and Bloch showed in the late 1940s that acetate units are the building 
blocks of fatty acids. Their work, together with the discovery by Salih Wakil that bi- 
carbonate is required for fatty acid biosynthesis, eventually made clear that this 
pathway involves synthesis of malonyl-CoA. The carboxylation of acetyl-CoA to form 
malonyl-CoA is essentially irreversible and is the committed step in the synthesis of 
fatty acids (Figure 24.2). The reaction is catalyzed by acetyl-CoA carboxylase, which 
contains a biotin prosthetic group. This carboxylase is the only enzyme of fatty acid 
synthesis in animals that is not part of the multienzyme complex called fatty acid 
synthase. 


Acetyl-CoA Carboxylase Is Biotin-Dependent and Displays 

Ping-Pong Kinetics 

The biotin prosthetic group of acetyl-CoA carboxylase is covalently linked to the 
€-amino group of an active-site lysine in a manner similar to pyruvate carboxylase 
(see Figure 22.2). The reaction mechanism is also analogous to that of pyruvate car- 
boxylase (see Figure 22.3): ATP-driven carboxylation of biotin is followed by trans- 
fer of the activated CO% to acetyl-CoA to form malonyl-CoA. The enzyme from 
Escherichia coli has three subunits: (1) a biotin carboxyl carrier protein (a dimer of 
22.5-kD subunits); (2) biotin carboxylase (a dimer of 51-kD subunits), which adds 
CO» to the prosthetic group; and (3) carboxyltransferase (an a2f» tetramer with 
30- and 35-kD subunits), which transfers the activated CO», unit to acetyl-CoA. 
The long, flexible biotin—-lysine chain (biocytin) enables the activated carboxyl 
group to be carried between the biotin carboxylase and the carboxyltransferase 
(Figure 24.3). 
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Acetyl-CoA Carboxylase in Animals Is a Multifunctional Protein 


In animals, acetyl-CoA carboxylase (ACC) is a filamentous polymer (4 to 8 X 10° D) 
composed of 265-kD protomers. Each of these subunits contains the biotin carboxyl 
carrier moiety, biotin carboxylase, and carboxyltransferase activities, as well as al- 
losteric regulatory sites. Animal ACC is thus a multifunctional protein. The polymeric 
form is active, but the 265-kD protomers are inactive. The activity of ACC is thus de- 
pendent upon the position of the equilibrium between these two forms: 





Inactive protomers == active polymer 


Because this enzyme catalyzes the committed step in fatty acid biosynthesis, it is 
carefully regulated. Palmitoyl-CoA, the final product of fatty acid biosynthesis, shifts 
the equilibrium toward the inactive protomers, whereas citrate, an important allosteric 
activator of this enzyme, shifts the equilibrium toward the active polymeric form of 
the enzyme. Acetyl-CoA carboxylase shows the kinetic behavior of a Monod—Wyman— 
Changeux V-system allosteric enzyme in which allosteric effectors shift the T/R equi- 
librium between active R conformers and inactive T conformers. 
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CENGAGENOW™ ACTIVE FIGURE 24.2 (a) The acetyl- 
CoA carboxylase reaction produces malonyl-CoA for 
fatty acid synthesis. (b) A mechanism for the acetyl-CoA 
carboxylase reaction. Bicarbonate is activated for car- 
boxylation reactions by formation of N-carboxybiotin. 
ATP drives the reaction forward, with transient forma- 
tion of a carbonylphosphate intermediate (Step 1). In a 
typical biotin-dependent reaction, nucleophilic attack 
by the acetyl-CoA carbanion on the carboxyl carbon of 
N-carboxybiotin—a transcarboxylation—yields the 
carboxylated product (Step 2). Test yourself on 

the concepts in this figure at www.cengage.com/ 
login. 





Carboxyltransferase domain of 
yeast acetyl-CoA carboxylase 
(pdb id = 10D2) 


The carboxyltransferase domain dimer of acetyl-CoA 
carboxylase-1 from Saccharomyces cerevisiae. The N- 
and C-subdomains of one monomer are cyan and yel- 
low, whereas those of the other monomer are purple 
and green. CoA is shown as a ball-and-stick model in 
one subunit. 
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Biotin carboxylase 


FIGURE 24.3 In the acetyl-CoA carboxylase reaction, the 
biotin ring, on its flexible tether, acquires carboxyl 
groups from carbonylphosphate on the biotin 
carboxylase subunit and transfers them to acyl-CoA 
molecules on the carboxyltransferase subunits. Colors 
of the domains correspond to those in Figure 24.4. 


FIGURE 24.4 Schematic of the acetyl-CoA carboxylase 
polypeptide, with domains and phosphorylation sites 
indicated, along with the protein kinases responsible. 
Phosphorylation at Ser'?°° is primarily responsible for 
decreasing the affinity for citrate. 





Phosphorylation of ACC Modulates Activation by Citrate 
and Inhibition by Palmitoyl-CoA 


The regulatory effects of citrate and palmitoyl-CoA are dependent on the phosphorylation state of 
acetyl-CoA carboxylase. The animal enzyme is phosphorylated at eight to ten sites on 
each enzyme subunit (Figure 24.4). Some of these sites are regulatory, whereas oth- 
ers are “silent” and have no effect on enzyme activity. Unphosphorylated acetyl-CoA 
carboxylase binds citrate with high affinity and thus is active at very low citrate con- 
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centrations (Figure 24.5). Phosphorylation of the regulatory sites decreases the affin- 
ity of the enzyme for citrate, and in this case, high levels of citrate are required to 
activate the carboxylase. The inhibition by fatty acyl-CoAs operates in a similar but op- 
posite manner. Thus, low levels of fatty acyl-CoA inhibit the phosphorylated carboxy- 
lase, but the dephosphoenzyme is inhibited only by high levels of fatty acyl-CoA. Spe- 
cific phosphatases act to dephosphorylate ACC, thereby increasing the sensitivity to 
citrate. 


Acyl Carrier Proteins Carry the Intermediates in Fatty Acid Synthesis 


The basic building blocks of fatty acid synthesis are acetyl and malonyl groups, but 
they are not transferred directly from CoA to the growing fatty acid chain. Rather, 
they are first passed to ACP. This protein consists (in E. colt) of a single polypeptide 
chain of 77 residues to which is attached (on a serine residue) a phosphopante- 
theine group, the same group that forms the “business end” of coenzyme A. Thus, 
ACP is a somewhat larger version of coenzyme A, specialized for use in fatty acid 
biosynthesis (Figure 24.6). 


In Some Organisms, Fatty Acid Synthesis Takes Place 
in Multienzyme Complexes 


The enzymes that catalyze formation of acetyl-ACP and malonyl-ACP and the subse- 
quent reactions of fatty acid synthesis are organized quite differently in different or- 
ganisms. Fatty acid synthesis in mammals occurs on homodimeric fatty acyl synthase 
I (FAS I), each 270-kD polypeptide of which contains all reaction centers required to 
produce a fatty acid. In lower eukaryotes, such as yeast and fungi, the enzymatic 





Phosphopantetheine group of coenzyme A 





Phosphopantetheine prosthetic group of ACP 


FIGURE 24.6 Fatty acids are conjugated both to coenzyme A and to acyl carrier protein through the sulfhydryl 
of phosphopantetheine prosthetic groups. 
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FIGURE 24.5 The activity of acetyl-CoA carboxylase is 
modulated by phosphorylation and dephosphorylation. 
The dephospho form of the enzyme is activated by low 
[citrate] and inhibited only by high levels of fatty acyl- 
CoA. In contrast, the phosphorylated form of the enzyme 
is activated only by high levels of citrate but is very sensi- 
tive to inhibition by fatty acyl-CoA. 


Adenine 





A DEEPER LOOK 


Choosing the Best Organism for the Experiment 


cado, which has one of the highest fatty acid contents in the plant the lab! 
kingdom. Early animal studies centered primarily on pigeons, which 





The selection of a suitable and relevant organism is an important are easily bred and handled and which possess high levels of fats in 
part of any biochemical investigation. The studies that revealed their tissues. Other animals, richer in fatty tissues, might be even 
the secrets of fatty acid synthesis are a good case in point. more attractive but more challenging to maintain. Grizzly bears, for 

The paradigm for fatty acid synthesis in plants has been the avo- example, carry very large fat reserves but are difficult to work with in 
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activities of FAS are distributed on two multifunctional peptide chains, which form 
2.6-megadalton a8, complexes. In plants, most bacteria, and parasites, the enzymes 
of fatty acid synthesis are separated and independent, and this collection of enzymes 
is referred to as fatty acyl synthase II (FAS II). 

The individual steps in the elongation of the fatty acid chain are quite similar 
across all organisms. The mammalian pathway (Figure 24.7) is a cycle of elongation 
that involves six enzyme activities. The elongation cycle is initiated by transfer of the 
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acyl moiety of acetyl-CoA to the acyl carrier protein by the malonyl-CoA-—acetyl-CoA- 
ACP transacylase (MAT), which also transfers the malonyl group of malonyl-CoA 
to ACP. 


Decarboxylation Drives the Condensation of Acetyl-CoA 
and Malonyl-CoA 


The B-ketoacyl-ACP synthase (KS) catalyzes the decarboxylative condensation of the 
acyl group with malonyl-ACP to produce a B-ketoacyl-ACP intermediate (acetoacetyl- 
ACP in the first cycle). The mechanism (Figure 24.8) begins with acetyl group trans- 
fer to MAT, followed with attack by the ACP thiol sulfur to form an acetyl-ACP. The 
acetyl group is transferred to a cysteine sulfur on KS, freeing the ACP thiol to acquire 
the malonyl group. In the condensation reaction that follows, decarboxylation of the 
malonyl group creates a transient, highly nucleophilic carbanion that can attack the 
acetate group. 

The net reaction for each turn of this cycle (see Figure 24.7) is addition of a two- 
carbon unit to the acyl group. Why is the three-carbon malonyl group used here as a two- 
carbon donor? The answer is that this is yet another example of a decarboxylation 
driving a desired but otherwise thermodynamically unfavorable reaction. The de- 
carboxylation that accompanies the reaction with malonyl-ACP drives the synthesis 
of acetoacetyl-ACP. Note that hydrolysis of ATP drove the carboxylation of acetyl- 
CoA to form malonyl-ACP, so, indirectly, ATP is responsible for the condensation re- 
action to form acetoacetyl-ACP. Malonyl-CoA can be viewed as a form of stored en- 
ergy for driving fatty acid synthesis. 

It is also worth noting that the carbon of the carboxyl group that was added to 
drive this reaction is the one removed by the condensing enzyme. Thus, all the car- 
bons of acetoacetyl-ACP (and of the fatty acids to be made) are derived from acetate 
units of acetyl-CoA. 


Reduction of the 3-Carbonyl Group Follows a Now-Familiar Route 


The next three steps—reduction of the 6-carbonyl group by B-ketoacyl-ACP re- 
ductase (KR) to form a $-alcohol, then dehydration by B-hydroxyacyl-ACP dehy- 
dratase (DH) and reduction by 2,3-trans-enoyl-ACP reductase (ER) to saturate the 
chain (see Figure 24.7)—look very similar to the fatty acid degradation pathway in 
reverse. However, there are two crucial differences between fatty acid biosynthesis 
and fatty acid oxidation (besides the fact that different enzymes are involved): First, 
the alcohol formed in biosynthesis has the D-configuration rather than the L-form 
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FIGURE 24.8 A mechanism for mammalian ketoacy! synthase. An acetyl group 
is transferred from CoA to MAT, then to the acyl carrier protein, and then to 
ketoacyl synthase. Next,a malonyl group is transferred to MAT and then to the 
acyl carrier protein. Decarboxylation of the malonyl group creates a transient 
carbanion on the acyl group of ACP which attacks the KS acetyl group to form 
a ketoacyl-ACP. A cycle (see Figure 24.7) of keto group reduction (by KR), water 
removal (by DH), and double bond reduction (by ER; see next section) will 
finally produce an acyl group increased in length by two carbons. 
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AT: Acetyl transferase 

MPT: Malonyl/palmitoyl transferase 

MAT: Malonyl-CoA-acetyl-CoA-ACP 
transacylase 

TE: Thioesterase 

ACP: Acyl carrier protein 

PPT: Phosphopantetheinyl 
transferase 

KR: B-Ketoacyl reductase 

KS: B-Ketoacyl synthase 

ER: B-Enoyl reductase 

DH: Dehydratase 


seen in catabolism; second, the reducing coenzyme is NADPH, whereas NAD+ and 
FAD are the oxidants in the catabolic pathway. 

The net result of the first turn of the biosynthetic cycle is the synthesis of a four- 
carbon unit, a butyryl group, from two smaller building blocks. In the next cycle of 
the process, this butyryl-ACP condenses with another malonyl-ACP to make a six- 
carbon B-ketoacyl-ACP and CO». Subsequent reduction to a B-alcohol, dehydration, 
and another reduction yield a six-carbon saturated acyl-ACP. This cycle continues 
with the net addition of a two-carbon unit in each turn until the chain is 16 carbons 
long (see Figure 24.7). The KS cannot accommodate larger substrates, so the reac- 
tion cycle ends with a 16-carbon chain. Hydrolysis of the Cyg-acyl-ACP yields a 
palmitic acid and the free ACP. 

In the end, seven malonyl-CoA molecules and one acetyl-CoA yield a palmitate 
(shown here as palmitoyl-CoA): 


Acetyl-CoA + 7 malonyl-CoA~ + 14 NADPH + 14 H*—> 
palmitoyl-CoA + 7HCO,” + 14 NADP* + 7 CoASH 


The formation of seven malonyl-CoA molecules requires 


7 Acetyl-CoA + 7 HCO,” + 7 ATP —> 
7 malonyl-CoA~ + 7 ADP? + 7 PP + 7 Ht 


Thus, the overall reaction of acetyl-CoA to yield palmitic acid is 


8 Acetyl-CoA + 7 ATP} + 14 NADPH + 7 H*—> 
palmitoyl-CoA + 14 NADP* + 7 CoASH + 7 ADP?~ + 7 P? 


Note: These equations are stoichiometric and are charge balanced. See problem 1 
at the end of the chapter for practice in balancing these equations. 


Eukaryotes Build Fatty Acids on Megasynthase Complexes 


The multiple enzyme domains of eukaryotic fatty acyl synthases are arrayed on large 
protein structures termed megasynthases. The individual enzyme domains of these 
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FIGURE 24.9 Organization of enzyme functions on two eukaryotic fatty acid synthases. (left) Fungal FAS is a 
closed barrel 260 A high and 230 A wide. (right) Mammalian (pig) FAS is an asymmetric X-shape 210 A high, 
180 A wide, and 90 A deep.The arrangement of functional domains along the FAS polypeptides is shown at 
the bottom of the figure. KS domains form dimers in both structures. KR domains form dimers in the fungal 
enzyme, whereas ER and DH domains form dimers in the mammalian complex. 
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structures in all eukaryotes are homologous to the corresponding small, discrete en- 
zymes of bacterial FAS pathways. Remarkably, however, lower eukaryotes such as 
fungi and higher eukaryotes such as mammals have evolved entirely different 
megasynthase architectures for fatty acid synthesis. Mammalian homodimeric FAS 
has a flattened X-shape, whereas the fungal dodecameric FAS is a large, closed bar- 
rel, with two reaction chambers separated by equatorial stabilizing struts (Figure 
24.9). In the fungal structure, the six a-subunits form a central ring that is a “trimer 
of dimers” (Figure 24.10a,b). Each a-subunit contributes an extended a-helical seg- 
ment to the center of the structure. Pairs of these helices form three coiled-coil 
struts anchored by a six-helix bundle in the center of the barrel. Each a-subunit 


(a) 







Upper 
reaction 
chamber 


Lower 
reaction 
chamber 


(c) 


Central 
anchor 


Peripheral 
anchor 






(b) Central wheel 


: 90° 


r-— KR dimer — 





Interchamber 
opening 





L— KS dimer — 


FIGURE 24.10 (a) FAS from S. cerevisiae possesses two trimeric reaction chambers 
separated by equatorial stabilizing struts. ACP domains are located at the equa- 
torial base of each reaction chamber, close to the catalytic site of KS. (b) The 
equatorial base of this structure is an ag trimer of dimers, with alternating KS and 
KR domains. The central stabilizing struts are a-helical extensions of the function- 
al domains arranged around the outside of the ring. A central six-helix bundle 
stabilizes the structure. (c) The ACP domains (red) are tethered on flexible linkers 
(yellow) so that they can move from one active site to the next in the catalytic 
cycle. Comparison of the ACP domains of S. cerevisiae and E. coli reveals that the 
ACP domains probably extend the acyl-phosphopantetheine group to active 
sites but then retract the acyl group into a hydrophobic cleft while moving from 
one site to the next (pdb = 2UV8; images courtesy of Marc Leibundgut and 
Nenad Ban, ETH [Zurich]). 
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contains KR and KS domains. Three KR and three KS active sites are oriented to- 
ward the upper reaction chamber, and three of each face the lower chamber. The 
B-subunit trimers form rounded caps over the upper and lower reaction chambers. 
Each chamber contains three pores that allow substrates (acetyl-CoA and malonyl- 
CoA) to diffuse in and palmitoyl-CoA to exit. On each end of the structure, the ac- 
tive sites of the four -subunit enzyme domains (see Figure 24.9) are oriented to- 
ward the interior of the reaction chamber. Three ACP domains in each chamber 
shuttle growing acyl chains from site to site during the catalytic cycle. Each ACP is 
tethered by two flexible linker peptides, which facilitate its site-to-site movement 
(Figure 24.10c). The phosphopantetheine arm on each ACP can extend outward to 
reach into active sites or may retract to insert its acyl chain in a protective hy- 
drophobic cavity during intersite transport. 

The homodimeric mammalian FAS contains all six functional enzyme domains 
on each subunit (Figures 24.9 and 24.11). In the X-shaped dimer, three of the 
domains—including KS, ER, and DH—form dimeric structures, whereas the KR 
and MAT domains are separated and lie near the ends of the extended “arms.” The 
arms form reaction chambers on either side of the structure. The flexible ACP do- 
mains do not appear in this structure (probably because they are not fixed in any 
one position in the crystals used for the structural studies). However, since it follows 
the KR domain in the polypeptide sequence, the ACP domain probably lies at the 
end of each KR arm, where it can rotate to interact with the adjacent active sites. 

In both the fungal and the mammalian FAS structures, the close association of 
enzymic domains within one large complex permits efficient transfer of intermedi- 
ates from one active site to the next. In addition, the presence of all these enzyme 
domains on one or two polypeptides allows the cell to coordinate synthesis of all en- 
zymes needed for fatty acid synthesis. 





FIGURE 24.11 Structural studies reveal that mammalian FAS homodimer is X-shaped. The ACP domains are 
probably located adjacent to the KR domains at the ends of the arms (pdb id = 2VZ8; image courtesy of Marc 
Leibundgut and Nenad Ban, ETH [Zurich]). 


Cis Fatty Acids May Undergo Elongation and Unsaturation 


Additional Elongation As seen already, palmitate is the primary product of the 
fatty acid synthase. Cells synthesize many other fatty acids. Shorter chains are eas- 
ily made if the chain is released before reaching 16 carbons in length. Longer 
chains are made through special elongation reactions, which occur both in the 
mitochondria and at the surface of the endoplasmic reticulum (ER). The ER re- 
actions are actually quite similar to those we have just discussed: addition of two- 
carbon units at the carboxyl end of the chain by means of oxidative decarboxyla- 
tions involving malonyl-CoA. As was the case for the fatty acid synthase, this 
decarboxylation provides the thermodynamic driving force for the condensation 
reaction. The mitochondrial reactions involve addition (and subsequent reduc- 
tion) of acetyl units. These reactions (Figure 24.12) are essentially a reversal of 
fatty acid oxidation, with the exception that NADPH is utilized in the saturation 
of the double bond, instead of FADHo. 


Introduction of a Single cis Double Bond Both prokaryotes and eukaryotes are 
capable of introducing a single cis double bond in a newly synthesized fatty acid. 
Bacteria such as E. coli carry out this process in an Os-independent pathway, 
whereas eukaryotes have adopted an Oo-dependent pathway. There is a fundamen- 
tal chemical difference between the two. The O9-dependent reaction can occur any- 
where in the fatty acid chain, with no (additional) need to activate the desired bond 
toward dehydrogenation. However, in the absence of Ov, some other means must 
be found to activate the bond in question. Thus, in the bacterial reaction, dehy- 
drogenation occurs while the bond of interest is still near the B-carbonyl or 
B-hydroxy group and the thioester group at the end of the chain. 
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FIGURE 24.13 Double bonds are introduced into the 
growing fatty acid chain in E. coli by specific dehydrases. 
Palmitoleoyl-ACP is synthesized by a sequence of reac- 
tions involving four rounds of chain elongation, fol- 
lowed by double bond insertion by B-hydroxydecanoy! 
thioester dehydrase and three additional elongation 
steps. Another elongation cycle produces cis-vaccenic 
acid. 
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In E. coli, the biosynthesis of a monounsaturated fatty acid begins with four normal 
cycles of elongation to form a ten-carbon intermediate, 6-hydroxydecanoyl-ACP (Fig- 
ure 24.13). At this point, B-hydroxydecanoyl thioester dehydrase forms a double bond 
By to the thioester and in the cis configuration. This is followed by three rounds of the 
normal elongation reactions to form palmitoleoyl-ACP. Elongation may terminate at this 
point or may be followed by additional biosynthetic events. The principal unsaturated 
fatty acid in E. coli, cis-vaccenic acid, is formed by an additional elongation step, using 
palmitoleoyl-ACP as a substrate. 


Unsaturation Reactions Occur in Eukaryotes in the Middle 
of an Aliphatic Chain 


The addition of double bonds to fatty acids in eukaryotes does not occur until the 
fatty acyl chain has reached its full length (usually 16 to 18 carbons). Dehydro- 
genation of stearoyl-CoA occurs in the middle of the chain, despite the absence of 
any useful functional group on the chain to facilitate activation: 


CH;— (CHo) 1s CO—SCoA —> CH;— (CH) ;CH=CH (CHo) ;CO—SCoA 


This impressive reaction is catalyzed by stearoyl-CoA desaturase, a 53-kD enzyme 
containing a nonheme iron center. NADH and Osx are required, as are two other 
proteins: cytochrome b; reductase (a 43-kD flavoprotein) and cytochrome b; 
(16.7 kD). All three proteins are associated with the ER membrane. Cytochrome 
b; reductase transfers a pair of electrons from NADH through FAD to cyto- 
chrome b; (Figure 24.14). Oxidation of reduced cytochrome b; is coupled to re- 
duction of nonheme Fe** to Fe?* in the desaturase. The Fe** accepts a pair of 
electrons (one at a time in a cycle) from cytochrome b; and creates a cis double 
bond at the 9,10-position of the stearoyl-CoA substrate. Oy is the terminal elec- 
tron acceptor in this fatty acyl desaturation cycle. Note that two water molecules 
are made, which means that four electrons are transferred overall. Two of these 
come through the reaction sequence from NADH, and two come from the fatty 
acyl substrate that is being dehydrogenated. 


The Unsaturation Reaction May Be Followed by Chain Elongation 


Additional chain elongation can occur following this single desaturation reac- 
tion. The oleoyl-CoA produced can be elongated by two carbons to form a 20:1 
cis-A!! fatty acyl-CoA. If the starting fatty acid is palmitate, reactions similar to 
the preceding scheme yield palmitoleoyl-CoA (16:1 cis-A°), which subsequently 
can be elongated to yield cis-vaccenic acid (18:1 cis-Al). Similarly, Cig and Cj, 
fatty acids can be elongated to yield Cə and Co4 fatty acids, such as are often 
found in sphingolipids. 








2 HO 
+ 
tie 
2 Cytochrome b; Desaturase CH; — (CHo); — C=C— (CH») ,C— SCoA 
(Fe**) (Reduced) (Fe**) Oleoyl-CoA 
O 
l 
2 Cytochrome b; Desaturase CH; — (CHg) 4g — C—SCoA 
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FIGURE 24.14 The conversion of stearoyl-CoA to oleoyl-CoA in eukaryotes is catalyzed by stearoyl-CoA desat- 
urase in a reaction sequence that also involves cytochrome bs; and cytochrome b; reductase. Two electrons are 
passed from NADH through the chain of reactions as shown, and two electrons are derived from the fatty acyl 
substrate. 


Mammals Cannot Synthesize Most Polyunsaturated Fatty Acids 


Organisms differ with respect to formation, processing, and utilization of polyun- 
saturated fatty acids. E. coli, for example, does not have any polyunsaturated fatty 
acids. Eukaryotes do synthesize a variety of polyunsaturated fatty acids, certain or- 
ganisms more than others. For example, plants manufacture double bonds be- 
tween the A® and the methyl end of the chain, but mammals cannot. Plants read- 
ily desaturate oleic acid at the 12-position (to give linoleic acid) or at both the 
12- and 15-positions (producing linolenic acid). Mammals require polyunsatu- 
rated fatty acids but must acquire them in their diet. As such, these fatty acids are 
referred to as essential fatty acids. On the other hand, mammals can introduce 
double bonds between the double bond at the 8- or 9-position and the carboxyl 
group. Enzyme complexes in the ER desaturate the 5-position, provided a double 
bond exists at the 8-position, and form a double bond at the 6-position if one al- 
ready exists at the 9-position. Thus, oleate can be unsaturated at the 6,7-position 
to give an 18:2 cis-A®,A® fatty acid. 


Arachidonic Acid Is Synthesized from Linoleic Acid by Mammals 


Mammals can add additional double bonds to unsaturated fatty acids in their diets. 
Their ability to make arachidonic acid from linoleic acid is one example (Figure 
24.15). This fatty acid is the precursor for prostaglandins and other biologically ac- 
tive derivatives such as leukotrienes. Synthesis involves formation of a linoleoyl es- 
ter of CoA from dietary linoleic acid, followed by introduction of a double bond at 
the 6-position. The triply unsaturated product is then elongated (by malonyl-CoA 
with a decarboxylation step) to yield a 20-carbon fatty acid with double bonds at the 
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FIGURE 24.15 Arachidonic acid is synthesized from 
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linoleic acid in eukaryotes. This is the means by which 


animals synthesize fatty acids with double bonds a 
positions other than C-9. 


t 
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HUMAN BIOCHEMISTRY 


w3 and w6—Essential Fatty Acids with Many Functions 


Linoleic acid and a-linolenic acid are termed essential fatty acids be- 
cause animals cannot synthesize them and must acquire them in 
their diet. Linoleic acid is the precursor of arachidonic acid, and 
both of these are referred to as w6 fatty acids because, counting 
from the end (omega, w) carbon of the chain, the first double 
bond is at the sixth position (see figure). Similarly, a-linolenic acid 
is the precursor of eicosapentaenoic acid and docosahexaenoic 
acid (DHA), and these three are termed @3 fatty acids. Vegetable 
oils are rich in linoleic acid and thus satisfy the body’s w6 dietary 
requirements, whereas fish oils (for example, cod, herring, men- 
haden, and salmon) are rich in w3 fatty acids. 

The 6 fatty acids are precursors of prostaglandins, thrombox- 
anes, and leukotrienes (see Section 24.3). The w3 fatty acids have 


beneficial effects in a variety of organs and biological processes, in- 
cluding growth regulation, platelet activation, and lipoprotein 
metabolism. The w3 fats are generally cardioprotective, anti- 
inflammatory, and anticarcinogenic. 

Interestingly, especially high levels of DHA have been found in 
rod cell membranes in animal retina and in neural tissue. DHA is 
approximately 22% of total fatty acids in animal retina and 35% to 
40% of the fatty acids in retinal phosphatidylethanolamine. DHA 
supports neural and visual development, in part because it is a pre- 
cursor for eicosanoids that regulate numerous cell and organ func- 
tions. Infants can synthesize DHA and other polyunsaturated fatty 
acids, but the rates of synthesis are low. Strong evidence exists for 
the importance of these fatty acids in infant nutrition. 
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COOH 
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COOH 








a-Linoleic acid (18:3 w3) 


<<— 








COOH 
) Eicosapentaenoic acid (EPA; 20:5 w3) 
> The “w” numbering system, where the position of the 
first double bond relative to the methyl (w) group is indi- COOH 








cated (red box), is useful because the w double bond posi- 


tion is retained during chain elongation and desaturation. Docosahexaenoic acid (DHA; 22:6 w3) 











8-, 11-, and 14-positions. A second desaturation reaction at the 5-position, followed 
by a reverse acyl-CoA synthetase reaction (see Chapter 23), liberates the product, a 
20-carbon fatty acid with double bonds at the 5-, 8-, 11-, and 14-positions. 


Regulatory Control of Fatty Acid Metabolism Is an Interplay 
of Allosteric Modifiers and Phosphorylation-Dephosphorylation Cycles 


The control and regulation of fatty acid synthesis is intimately related to regulation 
of fatty acid breakdown, glycolysis, and the TCA cycle. Acetyl-CoA is an important 
intermediate metabolite in all these processes. In these terms, it is easy to appreci- 
ate the interlocking relationships in Figure 24.16. Malonyl-CoA can act to prevent 
fatty acyl-CoA derivatives from entering the mitochondria by inhibiting the carni- 
tine acyltransferase of the outer mitochondrial membrane that initiates this trans- 
port. In this way, when fatty acid synthesis is turned on (as signaled by higher lev- 
els of malonyl-CoA), -oxidation is inhibited. As we pointed out earlier, citrate is 
an important allosteric activator of acetyl-CoA carboxylase, and fatty acyl-CoAs are 
inhibitors. The degree of inhibition is proportional to the chain length of the fatty 
acyl-CoA; longer chains show a higher affinity for the allosteric inhibition site on 
acetyl-CoA carboxylase. Palmitoyl-CoA, stearoyl-CoA, and arachidyl-CoA are the 
most potent inhibitors of the carboxylase. 
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Hormonal Signals Regulate ACC and Fatty Acid Biosynthesis 


As described earlier, citrate activation and palmitoyl-CoA inhibition of acetyl-CoA 
carboxylase are strongly dependent on the phosphorylation state of the enzyme. 
This provides a crucial connection to hormonal regulation. Many of the enzymes 
that act to phosphorylate acetyl-CoA carboxylase (see Figure 24.4) are controlled by 
hormonal signals. Glucagon is a good example (Figure 24.17). As noted in Chapter 
22, glucagon binding to membrane receptors activates an intracellular cascade in- 
volving activation of adenylyl cyclase. Cyclic AMP produced by the cyclase activates 
a protein kinase, which then phosphorylates acetyl-CoA carboxylase. Unless citrate 
levels are high, phosphorylation causes inhibition of fatty acid biosynthesis. The car- 
boxylase (and fatty acid synthesis) can be reactivated by a specific phosphatase, 
which dephosphorylates the carboxylase. Also indicated in Figure 24.17 is the 
simultaneous activation by glucagon of triacylglycerol lipases, which hydrolyze tri- 
acylglycerols, releasing fatty acids for B-oxidation. Both the inactivation of acetyl- 
CoA carboxylase and the activation of triacylglycerol lipase are counteracted by 
insulin, whose receptor acts to stimulate a phosphodiesterase that converts cAMP 
to AMP. 


24:25 How Are Complex Lipids Synthesized? 


Complex lipids consist of backbone structures to which fatty acids are covalently 
bound. Principal classes include the glycerolipids, for which glycerol is the back- 
bone, and sphingolipids, which are built on a sphingosine backbone. The two 
major classes of glycerolipids are glycerophospholipids and triacylglycerols. The 
phospholipids, which include both glycerophospholipids and sphingomyelins, are 
crucial components of membrane structure. They are also precursors of hormones 
such as the eicosanoids (for example, prostaglandins) and signal molecules (such as 
the breakdown products of phosphatidylinositol) . 

Different organisms possess greatly different complements of lipids and therefore 
invoke somewhat different lipid biosynthetic pathways. For example, sphingolipids 
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FIGURE 24.16 Regulation of fatty acid synthesis and 
fatty acid oxidation are coupled as shown. Malonyl-CoA, 
produced during fatty acid synthesis, inhibits the uptake 
of fatty acylcarnitine (and thus fatty acid oxidation) by 
mitochondria.When fatty acyl-CoA levels rise, fatty acid 
synthesis is inhibited and fatty acid oxidation activity 
increases. Rising citrate levels (which reflect an abun- 
dance of acetyl-CoA) similarly signal the initiation of 
fatty acid synthesis. 
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FIGURE 24.17 Hormonal signals regulate fatty acid syn- 
thesis, primarily through actions on acetyl-CoA carboxy- 
lase. Availability of fatty acids also depends upon hor- 
monal activation of triacylglycerol lipase. 
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and triacylglycerols are produced only in eukaryotes. In contrast, bacteria usually 
have rather simple lipid compositions. Phosphatidylethanolamine accounts for 
at least 75% of the phospholipids in Æ. coli, with phosphatidylglycerol and cardi- 
olipin accounting for most of the rest. E. coli membranes possess no phosphatidyl- 
choline, phosphatidylinositol, sphingolipids, or cholesterol. On the other hand, 
some bacteria (such as Pseudomonas) can synthesize phosphatidylcholine, for exam- 
ple. In this section and the one following, we consider some of the pathways for the 
synthesis of glycerolipids, sphingolipids, and the eicosanoids, which are derived 
from phospholipids. 


Glycerolipids Are Synthesized by Phosphorylation and Acylation 
of Glycerol 


A common pathway operates in nearly all organisms for the synthesis of phospha- 
tidic acid, the precursor to other glycerolipids. Glycerokinase catalyzes the phos- 
phorylation of glycerol to form glycerol-3-phosphate, which is then acylated at both 
the l- and 2-positions to yield phosphatidic acid (Figure 24.18). The first acylation, 
at position 1, is catalyzed by glycerol-3-phosphate acyltransferase, an enzyme that in 
most organisms is specific for saturated fatty acyl groups. Eukaryotic systems can 
also utilize dihydroxyacetone phosphate as a starting point for synthesis of phos- 
phatidic acid (Figure 24.18). Again a specific acyltransferase adds the first acyl chain, 
followed by reduction of the backbone keto group by acyldihydroxyacetone phosphate 
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FIGURE 24.18 Synthesis of glycerolipids in eukaryotes begins with the formation of phosphatidic (0J 
acid, which may be formed from dihydroxyacetone phosphate or glycerol as shown. Phosphatidic acid 


reductase, using NADPH as the reductant. Alternatively, dihydroxyacetone phosphate 
can be reduced to glycerol-3-phosphate by glycerol-3-phosphate dehydrogenase. 


Eukaryotes Synthesize Glycerolipids from CDP-Diacylglycerol 

or Diacylglycerol 

In eukaryotes, phosphatidic acid is converted directly either to diacylglycerol or to 
cytidine diphosphodiacylglycerol (or simply CDP-diacylglycerol; Figure 24.19). From these 
two precursors, all other glycerophospholipids in eukaryotes are derived. Diacylglyc- 
erol is a precursor for synthesis of triacylglycerol, phosphatidylethanolamine, and 
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phosphatidylcholine. Triacylglycerol is synthesized mainly in adipose tissue, liver, and 
intestines and serves as the principal energy storage molecule in eukaryotes. Triacyl- 
glycerol biosynthesis in liver and adipose tissue occurs via diacylglycerol acyltrans- 
ferase, an enzyme bound to the cytoplasmic face of the ER. A different route is used, 
however, in intestines. Recall (see Figure 23.3) that triacylglycerols from the diet are 


broken down to 2-monoacylglycerols by specific lipases. Acyltransferases then acylate 
2-monoacylglycerol to produce new triacylglycerols (Figure 24.20). 


Phosphatidylethanolamine Is Synthesized from Diacylglycerol 
and CDP-Ethanolamine 


Phosphatidylethanolamine synthesis begins with phosphorylation of ethanolamine 
to form phosphoethanolamine (see Figure 24.19). The next reaction involves trans- 
fer of a cytidylyl group from CTP to form CDP-ethanolamine and pyrophosphate. As 
always, PP; hydrolysis drives this reaction forward. A specific phosphoethanolamine 
transferase then links phosphoethanolamine to the diacylglycerol backbone. Bio- 
synthesis of phosphatidylcholine is entirely analogous because animals can synthe- 
size it directly. All of the choline utilized in this pathway must be acquired from the 
diet. On the other hand, yeast, certain bacteria, and animal livers can convert 
phosphatidylethanolamine to phosphatidylcholine by methylation reactions involv- 
ing S-adenosylmethionine (see Chapter 25). 


Exchange of Ethanolamine for Serine Converts 
Phosphatidylethanolamine to Phosphatidylserine 


Mammals synthesize phosphatidylserine (PS) in a calcium ion—dependent reaction 
involving aminoalcohol exchange (Figure 24.21). The enzyme catalyzing this reaction 
is associated with the ER and will accept phosphatidylethanolamine (PE) and other 
phospholipid substrates. A mitochondrial PS decarboxylase can subsequently convert 
PS to PE. No other pathway converting serine to ethanolamine has been found. 


Eukaryotes Synthesize Other Phospholipids Via CDP-Diacylglycerol 


Eukaryotes also use CDP-diacylglycerol, derived from phosphatidic acid, as a precur- 
sor for several other important phospholipids, including phosphatidylinositol (PI), 
phosphatidylglycerol (PG), and cardiolipin (Figure 24.22). PI accounts for only about 
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FIGURE 24.21 The interconversion of phosphatidylethanolamine and phosphatidylserine in mammals. 


24.2 How Are Complex Lipids Synthesized? 741 


Lipases 
o CH,OH 
Ro — ! —o— l —H 
CH OH 
2-Monoacylglycerol 
(0) 


Monoacylglycerol acyltransferase 
CoASH 


CH, OH 
Diacylglycerol 
fe) 
Il 
R, C —SCoA 


Diacylglycerol acyltransferase 
CoASH 


Triacylglycerol 


FIGURE 24.20 Triacylglycerols are formed primarily by 
the action of acyltransferases on monoacylglycerol and 
diacylglycerol. 
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FIGURE 24.22 CDP-diacylglycerol is a precursor of phosphatidylinositol, phosphatidylglycerol, and cardiolipin in 
eukaryotes. 


2% to 8% of the lipids in most animal membranes, but breakdown products of PI, in- 
cluding inositol-1,4,5-trisphosphate and diacylglycerol, are second messengers in a 
vast array of cellular signaling processes. 


Dihydroxyacetone Phosphate Is a Precursor to the Plasmalogens 


Certain glycerophospholipids possess alkyl or alkenyl ether groups at the 1-position 
in place of an acyl ester group. These glyceroether phospholipids are synthesized 
from dihydroxyacetone phosphate (Figure 24.23). Acylation of dihydroxyacetone 
phosphate (DHAP) is followed by an exchange reaction, in which the acyl group is 
removed as a carboxylic acid and a long-chain alcohol adds to the 1-position. This 
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FIGURE 24.23 Biosynthesis of plasmalogens in animals. (1) Acylation at C-1 is followed by (2) exchange of the 
acyl group for a long-chain alcohol. (3) Reduction of the keto group at C-2 is followed by (4 and 5) transferase 
reactions, which add an acyl group at C-2 and a polar head-group moiety (as shown here for phospho- 
ethanolamine), and a (6) desaturase reaction that forms a double bond in the alkyl chain. The first two 
enzymes are of cytoplasmic origin, and the last transferase is located at the endoplasmic reticulum. 





et tial 
R,;— C=C — O — CH, 
O 
| | 
R,— C—O —CH O 
| + 
CH, —O — P — O — CH CHoNH; 
| 
o- 
Plasmalogen 
1-Alkyl-2-acylglycero- NAD? + 2 H,O 
phosphoethanolamine 
dorattas @ |—-naADH + O, + Ht 


R,CH,CH;— O — CH, 





o 
Il 


R,—C—O—C—H © 
Il + 
CH, — O —P — O — CHCHNH; 


o- 





1-Alkyl-2-acylglycero-3-phosphoethanolamine 


CDP 
CDP-ethanolamine 


transferase 
6 CDP- ethanolamine 


CoASH I | 


an a 








1-Alkylglycero-3-phosphate 


1-Alkylglycerophosphate | 


acyltransferase o- 
© 1-Alkyl-2-acylglycero-3-phosphate 


744 Chapter 24 Lipid Biosynthesis 


FIGURE 24.24 Platelet-activating factor, formed from 
1-alkyl-2-lysophosphatidylcholine by acetylation at C-2, 
is degraded by the action of acetylhydrolase. 


long-chain alcohol is derived from the corresponding acyl-CoA by means of an acyl- 
CoA reductase reaction involving oxidation of two molecules of NADH. The 2-keto 
group of the DHAP backbone is then reduced to an alcohol, followed by acylation. 
The resulting 1-alkyl-2-acylglycero-3-phosphate can react in a manner similar to 
phosphatidic acid to produce ether analogs of phosphatidylcholine, phosphatidyl- 
ethanolamine, and so forth (Figure 24.23). In addition, specific desaturase enzymes 
associated with the ER can desaturate the alkyl ether chains of these lipids as shown. 
The products, which contain @,6-unsaturated ether-linked chains at the C-1 posi- 
tion, are plasmalogens; they are abundant in cardiac tissue and in the central 
nervous system. The desaturases catalyzing these reactions are distinct from but sim- 
ilar to those that introduce unsaturations in fatty acyl-CoAs. 


Platelet-Activating Factor Is Formed by Acetylation 

of 1-Alkyl-2-Lysophosphatidylcholine 

A particularly interesting ether phospholipid with unusual physiological prop- 
erties, 1l-alkyl-2-acetylglycerophosphocholine, also known as platelet-activating 
factor, possesses an alkyl ether at C-1 and an acetyl group at C-2 (Figure 24.24). 
The very short chain at C-2 makes this molecule much more water soluble than 
typical glycerolipids. Platelet-activating factor displays a dramatic ability to dilate 
blood vessels (and thus reduce blood pressure in hypertensive animals) and to 
aggregate platelets. 


Sphingolipid Biosynthesis Begins with Condensation of Serine 
and Palmitoyl-CoA 


Sphingolipids, ubiquitous components of eukaryotic cell membranes, are present at 
high levels in neural tissues. The myelin sheath that insulates nerve axons is particu- 
larly rich in sphingomyelin and other related lipids. Prokaryotic organisms normally 
do not contain sphingolipids. Sphingolipids are built upon sphingosine backbones 
rather than glycerol. The initial reaction, which involves condensation of serine and 
palmitoyl-CoA with release of bicarbonate, is catalyzed by 3-ketosphinganine synthase, 
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a PLP-dependent enzyme (Figure 24.25). Reduction of the ketone product to form 
sphinganine is catalyzed by 3-keto-sphinganine reductase, with NADPH as a reactant. 
In the next step, sphinganine is acylated to form N-acyl sphinganine, which is then 
desaturated to form ceramide. Sphingosine itself does not appear to be an interme- 
diate in this pathway in mammals. 
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FIGURE 24.25 Biosynthesis of sphingolipids in animals 
begins with the 3-ketosphinganine synthase reaction, a 
PLP-dependent condensation of palmitoyl-CoA and 
serine. Subsequent reduction of the keto group, acyla- 
tion, and desaturation (via reduction of an electron 
acceptor, X) form ceramide, the precursor of other 
sphingolipids. 
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FIGURE 24.26 Glycosylceramides (such as galactosyl- 


ceramide), gangliosides, and sphingomyelins are synthe- 


sized from ceramide in animals. 








Ceramide Is the Precursor for Other Sphingolipids and Cerebrosides 


Ceramide is the building block for all other sphingolipids. Sphingomyelin, for 
example, is produced by transfer of phosphocholine from phosphatidylcholine 
(Figure 24.26). Glycosylation of ceramide by sugar nucleotides yields cerebrosides, 
such as galactosylceramide, which makes up about 15% of the lipids of myelin sheath 











CHs CH3 
D CH | H Phosphatidyl- | H 
2 UDP-Gal (CH) 19 — c= C—CHOH choline (CHo) 19 — = C — CHOH 
CH,OH H | H 
R,;—C—N—CH R,;—C—N—C 
A 0. 9 < i | > j 
oO CHOH oO CHo 
OH UDP : 1,2-Diacylglycerol 
UDP- galact es UDP- GI a 
- galacto- - Glu , R 
Om syltransferase Sphingomyelin 
UDP- Gal Eom P = 
B-D-Galactosylceramide UDP 
UDP O 
CHo 
CHo 
+ 
HO HC — N— CH; 
H CH; 














B-D-Galactosyl-(1—>4)-6-D-glucosylceramide 


N-Acetylgalactosaminyltransferase 


UDP 


CH,OH 
ail HA oN oak Ọ 
H a 0 OH H H a =N= i —R; 
HNL H OH He —OH 
H OH i 


B-D-N-Acetylgalactosamine-(1—> 4)- 
B-D-galactosyl-(1— 4)-B-D-glucosylceramide 


cup EE 
Sialytransferase 
CMP 


CHOH 
CHOH O 
le ok R 
Oo OH H 
H HC—N—C—R, 
| H 
H QH HC —OH 
Ro 


CH; —C =O Ganglioside GM, 


24.3 How Are Eicosanoids Synthesized, and What Are Their Functions? 747 
structures. Cerebrosides that contain one or more sialic acid (N-acetylneuraminic 

acid) moieties are called gangliosides. Several dozen gangliosides have been charac- 

terized, and the general form of the biosynthetic pathway is illustrated for the 

case of ganglioside GM» (Figure 24.26). Sugar units are added to the developing 

ganglioside from nucleotide derivatives, including UDP-—N-acetylglucosamine, 

UDP-galactose, and UDP-glucose. 


24:30 How Are Eicosanoids Synthesized, and What 
Are Their Functions? 


Eicosanoids, so named because they are all derived from 20-carbon fatty acids, 
are ubiquitous breakdown products of phospholipids. In response to appropriate 
stimuli, cells activate the breakdown of selected phospholipids (Figure 24.27). 
Phospholipase Ay (see Chapter 8) selectively cleaves fatty acids from the C-2 po- 
sition of phospholipids. Often these are unsaturated fatty acids, among which is 
arachidonic acid. Arachidonic acid may also be released from phospholipids by 
the combined actions of phospholipase C (which yields diacylglycerols) and dia- 
cylglycerol lipase (which releases fatty acids). 


Eicosanoids Are Local Hormones 


Animal cells can modify arachidonic acid and other polyunsaturated fatty acids, in 
processes often involving cyclization and oxygenation, to produce so-called local hor- 
mones that (1) exert their effects at very low concentrations and (2) usually act near 
their sites of synthesis. These substances include the prostaglandins (PG) (Figure 
24.27) as well as thromboxanes (Tx), leukotrienes, and other hydroxyeicosanoic acids. 
Thromboxanes, discovered in blood platelets (thrombocytes), are cyclic ethers (TxB» is 
actually a hemiacetal; see Figure 24.27) with a hydroxyl group at C-15. 


Prostaglandins Are Formed from Arachidonate by Oxidation 
and Cyclization 


All prostaglandins are cyclopentanoic acids derived from arachidonic acid. The 
biosynthesis of prostaglandins is initiated by an enzyme associated with the ER, called 
prostaglandin endoperoxide H synthase (PGHS), also known as cyclooxygenase 
(COX). The enzyme catalyzes simultaneous oxidation and cyclization of arachidonic 
acid. The enzyme is viewed as having two distinct activities, COX and peroxidase 
(POX), as shown in Figure 24.28. 


A DEEPER LOOK 





The Discovery of Prostaglandins 





The name prostaglandin was given to this class of compounds by Ulf 
von Euler, their discoverer, in Sweden in the 1930s. He extracted 
fluids containing these components from human semen. Because 
he thought they originated in the prostate gland, he named them 
prostaglandins. Actually, they were synthesized in the seminal vesi- 
cles, and it is now known that similar substances are synthesized in 
most animal tissues (both male and female). Von Euler observed 
that injection of these substances into animals caused smooth mus- 
cle contraction and dramatic lowering of blood pressure. 

Von Euler (and others) soon found that it is difficult to analyze 
and characterize these obviously interesting compounds because 
they are present at extremely low levels. Prostaglandin E a, or 
PGEgq, is present in human serum at a level of less than 10° '4M! 
In addition, they often have half-lives of only 30 seconds to a few 


minutes, not lasting long enough to be easily identified. Moreover, 
most animal tissues upon dissection and homogenization rapidly 
synthesize and degrade a variety of these substances, so the 
amounts obtained in isolation procedures are extremely sensitive 
to the methods used and highly variable even when procedures 
are carefully controlled. Sune Bergström, Bengt Samuelsson, and 
their colleagues described the first structural determinations of 
prostaglandins in the late 1950s. In the early 1960s, dramatic ad- 
vances in laboratory techniques, such as NMR spectroscopy and 
mass spectrometry, made further characterization possible. Von 
Euler received the Nobel Prize for Physiology or Medicine in 1970, 
and Bergström, Samuelsson, and John Vane shared the Nobel for 
Physiology or Medicine in 1982. 
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FIGURE 24.27 Arachidonic acid, derived from breakdown of phospholipids (PL), is the precursor of prostaglandins, 
thromboxanes, and leukotrienes. The letters used to name the prostaglandins are assigned on the basis of similar- 
ities in structure and physical properties. The class denoted PGE, for example, consists of B-hydroxyketones that 
are soluble in ether, whereas PGF denotes 1,3-diols that are soluble in phosphate buffer. PGA denotes prosta- 
glandins possessing @,B-unsaturated ketones. The number following the letters refers to the number of carbon- 
carbon double bonds. Thus, PGE contains two double bonds. 


A Variety of Stimuli Trigger Arachidonate Release 
and Eicosanoid Synthesis 


The release of arachidonate and the synthesis or interconversion of eicosanoids can 
be initiated by a variety of stimuli, including histamine, hormones such as epineph- 
rine and bradykinin, proteases such as thrombin, and even serum albumin. An im- 
portant mechanism of arachidonate release and eicosanoid synthesis involves tissue 
injury and inflammation. When tissue damage or injury occurs, special inflammatory 
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FIGURE 24.28 Prostaglandin endoperoxide H synthase 
(PGHS), the enzyme that converts arachidonic acid to 
prostaglandin PGH, possesses two distinct activities: 
cyclooxygenase (COX) and a glutathione-dependent 
hydroperoxidase (POX). The mechanism of the reaction 
begins with hydrogen atom abstraction by a tyrosine 
radical on the enzyme, followed by rearrangement to 
cyclize and incorporate two oxygen molecules. Reduc- 
tion of the peroxide at C15 completes the reaction. COX 
PGH, is the site of action of aspirin and other analgesic agents. 








N 


cells, monocytes and neutrophils, invade the injured tissue and interact with the resi- 
dent cells (such as smooth muscle cells and fibroblasts). This interaction typically leads 
to arachidonate release and eicosanoid synthesis. Examples of tissue injury in which 
eicosanoid synthesis has been characterized include heart attack (myocardial infarc- 
tion), rheumatoid arthritis, and ulcerative colitis. 


“Take Two Aspirin and...” Inhibit Your Prostaglandin Synthesis 


In 1971, biochemist John Vane was the first to show that aspirin (acetylsalicylate; Fig- 
ure 24.29) exerts most of its effects by inhibiting the biosynthesis of prostaglandins. 
Its site of action is PGHS. COX activity is destroyed when aspirin O-acetylates Ser”? 


(a) (b) 
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FIGURE 24.29 (a) The structures of several common analgesic agents. Acetaminophen is marketed under the trade 
name Tylenol. Ibuprofen is sold as Motrin, Nuprin, and Advil. (b) Acetylsalicylate (aspirin) inhibits the COX activity of 
endoperoxide synthase via acetylation (covalent modification) of Ser”. 
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A DEEPER LOOK 





The Molecular Basis for the Action of Nonsteroidal Anti-inflammatory Drugs 


Prostaglandins are potent mediators of inflammation. The first and 
committed step in the production of prostaglandins from arachi- 
donic acid is the bis-oxygenation of arachidonate to prostaglandin 
PGG». This is followed by reduction to PGHg in a peroxidase reac- 
tion. Both these reactions are catalyzed by PGHS or COX. This en- 
zyme is inhibited by the family of drugs known as nonsteroidal anti- 
inflammatory drugs, or NSAIDs. Aspirin, ibuprofen, flurbiprofen, 
and acetaminophen (trade name Tylenol) are all NSAIDs. 


There are two isoforms of COX in animals: COX-1, which car- 
ries out normal, physiological production of prostaglandins, and 
COX-2, which is induced by cytokines, mitogens, and endotoxins 
in inflammatory cells and is responsible for the production of 
prostaglandins in inflammation. 

The enzyme structure shown in panel a is that of residues 33 to 
583 of COX-1 from sheep, inactivated by ibuprofen (cyan). These 
551 residues comprise three distinct domains. The first of these, 








(a) pdb id = 1EQG 


(b) Superposition of pdb id = 1EQG and 1CX2 








on the enzyme. From this you may begin to infer something about how prosta- 
glandins (and aspirin) function. Prostaglandins are known to enhance inflammation 
in animal tissues. Aspirin exerts its powerful anti-inflammatory effect by inhibiting 
this first step in their synthesis. Aspirin does not have any measurable effect on the 
peroxidase activity of the synthase. Other nonsteroidal anti-inflammatory agents, 
such as ibuprofen (Figure 24.29) and phenylbutazone, inhibit COX by competing at 
the active site with arachidonate or with the peroxyacid intermediate (PGGg, as in 
Figure 24.28). See A Deeper Look above. 
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The most prevalent steroid in animal cells is cholesterol (Figure 24.30). Plants con- 
tain no cholesterol, but they do contain other steroids very similar to cholesterol in 
structure (see page 236). Cholesterol serves as a crucial component of cell mem- 
branes and as a precursor to bile acids (such as cholate, glycocholate, taurocholate) 
and steroid hormones (such as testosterone, estradiol, progesterone). Also, vitamin 
D; is derived from 7-dehydrocholesterol, the immediate precursor of cholesterol. Liver 
is the primary site of cholesterol biosynthesis. 


residues 33 to 72, is a small, compact module that is similar to epi- 
dermal growth factor. The second domain, composed of residues 
73 to 116, forms a right-handed spiral of four a-helical segments. 
These a-helical segments form a membrane-binding motif. The 
helical segments are amphipathic, with most of the hydrophobic 
residues facing away from the protein, where they can interact 
with a lipid bilayer. The third domain of the COX enzyme, the cat- 
alytic domain, is a globular structure that contains both the COX 
and the peroxidase active sites. 

The COX active site lies at the end of a long, narrow, hydro- 
phobic tunnel or channel. Three of the a-helices of the membrane- 
binding domain lie at the entrance to this tunnel. The walls of 
the tunnel are defined by four a-helices, formed by residues 
106 to 123, 325 to 353, 379 to 384, and 520 to 535. 

The COX-1 structure shown in panel a has a molecule of ibupro- 
fen bound in the tunnel. Deep in the tunnel, at the far end, lies 
Tyr?%, a catalytically important residue. Heme-dependent peroxi- 
dase activity is implicated in the formation of a proposed Tyr**? rad- 
ical, which is required for COX activity. Aspirin and other NSAIDs 
block the synthesis of prostaglandins by filling and blocking the tun- 
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nel, preventing the migration of arachidonic acid to Tyr% in the ac- 


tive site at the back of the tunnel. 

Why do the new “COX-2 inhibitors” bind to (and inhibit) COX-2 
but not COX-1? A single amino acid substitution makes all the dif- 
ference. Panel b shows an overlay of COX-1 (1EQG) and COX-2 
(1CX2) structures. COX-2 has a valine (blue) at position 523, which 
leaves room for binding of a Celebrex-like inhibitor (orange). On 
the other hand, COX-1 has bulkier isoleucine (red) at position 523, 
which prevents binding of the inhibitor. 

COX-2 inhibitors were introduced as pain medications in 1997, 
and by 2004 nearly half of the 100 million prescriptions written an- 
nually for NSAIDs in the United States were COX-2 inhibitors. 
However, several COX-2 inhibitors were taken off the U.S. market 
in late 2004 and early 2005, when their use was linked to heart at- 
tacks and strokes in a small percentage of users. Since that time, 
prescriptions for COX-2 inhibitors have dropped by 65%. Inter- 
estingly, although COX-2 inhibitors were originally intended to al- 
leviate pain without the risk of adverse gastrointestinal effects, less 
than 5% of patients that used COX-2 prescriptions at the peak of 
their popularity were at high risk for these adverse effects. 
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FIGURE 24.30 The structure of cholesterol, drawn (a) in 
the traditional planar motif and (b) in a form that more 


Mevalonate Is Synthesized from Acetyl-CoA Via HMG-CoA Synthase a rena stene 
The cholesterol biosynthetic pathway begins in the cytosol with the synthesis of 
mevalonate from acetyl-CoA (Figure 24.31). The first step is the B-ketothiolase— 
catalyzed Claisen condensation of two molecules of acetyl-CoA to form ace toacetyl- 
CoA. In the next reaction, acetyl-CoA and acetoacetyl-CoA join to form 3-hydroxy- 
3-methylglutaryl-CoA, which is abbreviated HMG-CoA. The reaction, a second Claisen 
condensation, is catalyzed by HMG-CoA synthase. The third step in the pathway is 
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O the rate-limiting step in cholesterol biosynthesis. Here, HMG-CoA undergoes two 
NADPH-dependent reductions to produce 3R-mevalonate (Figure 24.32). The reac- 


eee SCoA tion is catalyzed by HMG-CoA reductase, a 97-kD glycoprotein that spans the ER 











Acetyl-CoA membrane with its active site facing the cytosol. As the rate-limiting step, HMG-CoA 
a reductase is the principal site of regulation in cholesterol synthesis. 
G C — SCoA Three different regulatory mechanisms are involved: 
Acetyl-CoA , 1. Phosphorylation by cAMP-dependent protein kinases inactivates the reductase. 
Coast Thiolase This inactivation can be reversed by two specific phosphatases (Figure 24.33). 
i 2. Degradation of HMG-CoA reductase. This enzyme has a half-life of only 
3 hours, and the half-life itself depends on cholesterol levels: High [cholesterol] 
means a short half-life for HMG-CoA reductase. 
O O 3. Gene expression. Cholesterol levels control the amount of mRNA. If [choles- 
Il Il terol] is high, levels of mRNA coding for the reductase are reduced. If [choles- 
CH;— C O C —SCoA terol] is low, more mRNA is made. (Regulation of gene expression is discussed 
a Acetoacetyl-CoA in Chapter 29.) 
| 
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The Long Search for the Route of Cholesterol Biosynthesis 


Heilbron, Kamm, and Owens suggested as early as 1926 that squa- 
lene is a precursor of cholesterol. That same year, H. J. Channon 
demonstrated that animals fed squalene from shark oil produced 
more cholesterol in their tissues. Bloch and Rittenberg showed in 
the 1940s that a significant amount of the carbon in the tetracyclic 
moiety and in the aliphatic side chain of cholesterol was derived 
from acetate. In 1934, Sir Robert Robinson suggested a scheme for 


colleagues at Merck Sharpe & Dohme isolated mevalonic acid 
and also showed that mevalonate was the precursor of isoprene 
units. The search for the remaining details (described in the text) 
made the biosynthesis of cholesterol one of the most enduring 
and challenging bioorganic problems of the 1940s, 1950s, and 
1960s. Even today, several of the enzyme mechanisms remain 
poorly understood. 
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the cyclization of squalene to form cholesterol before the biosyn- 
thetic link between acetate and squalene was understood. Squalene 
is actually a polymer of isoprene units, and Bonner and Arreguin (b) 
suggested in 1949 that three acetate units could join to form five- 
carbon isoprene units (see figure a). 
In 1952, Konrad Bloch and Robert Langdon showed conclu- 
sively that labeled squalene is synthesized rapidly from labeled ace- 
tate and also that cholesterol is derived from squalene. Langdon, a 
graduate student of Bloch’s, performed the critical experiments in 
Bloch’s laboratory at the University of Chicago while Bloch spent 
the summer in Bermuda attempting to demonstrate that radio- 
actively labeled squalene would be converted to cholesterol in Squalene 
shark livers. As Bloch himself admitted, “All I was able to learn was | 









that sharks of manageable length are very difficult to catch and 
their oily livers impossible to slice” (Bloch, 1987). 

In 1953, Bloch, together with the eminent organic chemist 
R. B. Woodward, proposed a new scheme (see figure b) for the cy- 
clization of squalene. (Together with Fyodor Lynen, Bloch re- 
ceived the Nobel Prize in Medicine or Physiology in 1964 for his 
work.) The picture was nearly complete, but one crucial question 
remained: How could isoprene be the intermediate in the trans- 
formation of acetate into squalene? In 1956, Karl Folkers and his 
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à (a) An isoprene unit and a scheme for head-to-tail linking of isoprene units. (b) The cycliza- 
tion of squalene to form lanosterol, as proposed by Bloch and Woodward. 








that the thiolase reaction is more or less reversible but slightly favors the cleavage re- 
action. In the cholesterol synthesis pathway, subsequent reactions, including HMG- 
CoA reductase and the following kinase reactions, pull the thiolase-catalyzed con- 
densation forward. However, in the case of fatty acid synthesis, a succession of eight 
thiolase condensations would be distinctly unfavorable from an energetic perspec- 
tive. Given the necessity of repeated reactions in fatty acid synthesis, it makes better 
energetic sense to use a reaction that is favorable in the desired direction. 


Squalene Is Synthesized from Mevalonate 


The biosynthesis of squalene involves conversion of mevalonate to two key 5-carbon 
intermediates, isopentenyl pyrophosphate and dimethylallyl pyrophosphate, which join to 
yield farnesyl pyrophosphate and then squalene. A series of four reactions converts 
mevalonate to isopentenyl pyrophosphate and then to dimethylallyl pyrophosphate 
(Figure 24.34). The first three steps each consume an ATP, two for the purpose of 
forming a pyrophosphate at the 5-position and the third to drive the decarboxylation 
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FIGURE 24.33 HMG-CoA reductase activity is modulated 
by a cycle of phosphorylation and dephosphorylation. 





HC OH 
-OOC Cc Pg iis 
Nf, CHo Mevalonate 
Mevalonate bd 
kinase 
ADP 
v 
Phosphomevalonate =- 
kinase 
ADP 
Vv 
‘coe OH 
-00C c| (CH0—®® 
Si / 
CH, CHo 5-Pyrophosphomevalonate 
Pyrophosphomevalonate ATP + HO 
decarboxylase 
i ADP + ® + CO, 
HC P20 — OB 
J“ — CHo 
HC A Isopentenyl pyrophosphate 
Isopentenyl 


pyrophosphate isomerase 

















HC CH,O— B® 
` Z 
C=C 
Dimethylallyl pyrophosphate 
HC H yially! pyropnosp. 
C Isopentenyl pyrophosphate 
DD 
v 
A Isopentenyl pyrophosphate 
PP 
Vv H;C p CH,O—®®) 
HC CH ‚C=C 
ZNZ \ 
HC CHo C=C CHo H 
N Z NZ 
C=C Ho H 
HC H Farnesyl pyrophosphate 
NADPH + H* 







NADP} + 2 ®® 


S 
Squalene 


FIGURE 24.34 The conversion of mevalonate to squalene. In the last step, two farnesyl-PP condense to form 


squalene. 
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Statins Lower Serum Cholesterol Levels 


Chemists and biochemists have long sought a means of reducing 
serum cholesterol levels to reduce the risk of heart attack and car- 
diovascular disease. Because HMG-CoA reductase is the rate- 
limiting step in cholesterol biosynthesis, this enzyme is a likely drug 
target. Mevinolin, also known as lovastatin (see accompanying fig- 
ure), was isolated from a strain of Aspergillus terreus and developed 
at Merck Sharpe & Dohme for this purpose. It is now a widely pre- 
scribed cholesterol-lowering drug. Dramatic reductions of serum 
cholesterol are observed at dosages of 20 to 80 mg per day. 


inhibitor of the reductase with a K; of 0.6 nM. Mevinolinic acid is 
thought to behave as a transition-state analog (see Chapter 14) of 
the tetrahedral intermediate formed in the HMG-CoA reductase re- 
action (see figure). 

Derivatives of lovastatin have been found to be even more potent 
in cholesterol-lowering trials. Synvinolin lowers serum cholesterol 
levels at much lower dosages than lovastatin. Lipitor, shown bound 
at the active site of HMG-CoA reductase, is the most-prescribed drug 
in the United States, with annual sales of $9 billion. 
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Lovastatin is administered as an inactive lactone. After oral in- 
gestion, it is hydrolyzed to the active mevinolinic acid, a competitive 
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The structures of (a) (inactive) lovastatin, (active) mevinolinic acid, mevalonate, and (b) Lipitor (atorvastatin). 
(c) HMG-CoA reductase with NADP*, HMG, and CoA. (d) Lipitor bound at the HMG-CoA reductase active site. 





SCoA 


Tetrahedral intermediate 
in HMG-CoA reductase 
mechanism 





Lipitor® (red) bound at the 
active site of HMG-CoA 
reductase (pdb id = 1HWK) 





and double bond formation in the third step. Pyrophosphomevalonate decarboxylase 
phosphorylates the 3-hydroxyl group, and this is followed by trans elimination of 
the phosphate and carboxyl groups to form the double bond in isopentenyl pyro- 
phosphate. Isomerization of the double bond yields the dimethylallyl pyrophosphate. 
Condensation of these two 5-carbon intermediates produces geranyl pyrophosphate; ad- 
dition of another 5-carbon isopentenyl group gives farnesyl pyrophosphate. Both steps 
in the production of farnesyl pyrophosphate occur with release of pyrophosphate, hy- 
drolysis of which drives these reactions forward. Note too that the linkage of isoprene 
units to form farnesyl pyrophosphate occurs in a head-to-tail fashion. This is the gen- 
eral rule in biosynthesis of molecules involving isoprene linkages. The next step—the 
joining of two farnesyl pyrophosphates to produce squalene—is a “tail-to-tail” con- 
densation and represents an important exception to the general rule. 
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FIGURE 24.35 Cholesterol is synthesized from squalene 
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Squalene monooxygenase, an enzyme bound to the ER, converts squalene to 
squalene-2,3-epoxide (Figure 24.35). This reaction employs FAD and NADPH as 
coenzymes and requires Og as well as a cytosolic protein called soluble protein 
activator. A second ER membrane enzyme, 2,3-oxidosqualene lanosterol cyclase, 
catalyzes the second reaction, which involves a succession of 1,2 shifts of hydride 
ions and methyl groups. 


Conversion of Lanosterol to Cholesterol Requires 20 Additional Steps 


Although lanosterol may appear similar to cholesterol in structure, another 20 steps 
are required to convert lanosterol to cholesterol (Figure 24.35). The enzymes 
responsible for this are all associated with the ER. The primary pathway involves 
7-dehydrocholesterol as the penultimate intermediate. An alternative pathway, also 
composed of many steps, produces the intermediate desmosterol. Reduction of the 
double bond at C-24 yields cholesterol. Cholesterol esters—a principal form of cir- 
culating cholesterol—are synthesized by acyl-CoA:cholesterol acyltransferases 
(ACAT) on the cytoplasmic face of the ER. 


24:5 How Are Lipids Transported Throughout the Body? 


When most lipids circulate in the body, they do so in the form of lipoprotein 
complexes. Simple, unesterified fatty acids are merely bound to serum albumin and 
other proteins in blood plasma, but phospholipids, triacylglycerols, cholesterol, and 
cholesterol esters are all transported in the form of lipoproteins. At various sites in 
the body, lipoproteins interact with specific receptors and enzymes that transfer or 
modify their lipid cargoes. It is now customary to classify lipoproteins according to 
their densities (Table 24.1). The densities are related to the relative amounts of lipid 
and protein in the complexes. Because most proteins have densities of about 1.3 to 
1.4¢/mL, and lipid aggregates usually possess densities of about 0.8 g/mL, the more 
protein and the less lipid in a complex, the denser the lipoprotein. Thus, there are 
high-density lipoproteins (HDLs), low-density lipoproteins (LDLs), intermediate- 
density lipoproteins (IDLs), very-low-density lipoproteins (VLDLs), and also 
chylomicrons. Chylomicrons have the lowest protein-to-lipid ratio and thus are 
the lowest-density lipoproteins. They are also the largest. 


Lipoprotein Complexes Transport Triacylglycerols and Cholesterol Esters 


HDL and VLDL are assembled primarily in the ER of the liver (with smaller amounts 
produced in the intestine), whereas chylomicrons form in the intestine. LDL is not 
synthesized directly but rather is made from VLDL. LDL appears to be the major cir- 
culatory complex for cholesterol and cholesterol esters. The primary task of chy- 
lomicrons is to transport triacylglycerols. Despite all this, it is extremely important to 
note that each of these lipoprotein classes contains some of each type of lipid. The FIGURE 24.36 Ph otograph of an arterial plaque. The 
relative amounts of HDL and LDL are important in the disposition of cholesterolin view is into the artery (orange), with the plaque shown 
the body and in the development of arterial plaques (Figure 24.36). The structures of in yellow at the back. 
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ELA Composition a 

Composition (% dry weight) 
Lipoprotein Density Diameter 
Class (g/mL) (nm) Protein Cholesterol Phospholipid Triacylglycerol 
HDL 1.063-1.21 5-15 33 30 29 8 
LDL 1.019-1.063 18-28 25 50 21 4 
IDL 1.006-1.019 25-50 18 29 22 31 
VLDL 0.95-1.006 30-80 10 22 18 50 
Chylomicrons <0.95 100-500 1-2 8 7 84 








Adapted from Brown, M, and Goldstein, J., 1987. In Braunwald, E., et al., eds., Harrison’s Principles of Internal Medicine, 11th ed. New York: McGraw-Hill; and Vance, D., and Vance, J., eds., 1985. 
Biochemistry of Lipids and Membranes. Menlo Park, CA: Benjamin/Cummings. 
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(a) 





(b) 





FIGURE 24.37 A model for the structure of a typical 
lipoprotein. (a) A core of cholesterol and cholesteryl 
esters is surrounded by a phospholipid (monolayer) 
membrane. Apolipoprotein A-I is modeled here as a long 
amphipathic a-helix, with the nonpolar face of the helix 
embedded in the hydrophobic core of the lipid particle 
and the polar face of the helix exposed to solvent. (b) A 
ribbon diagram of apolipoprotein A-l. (Adapted from 
Borhani, D.W., Rogers, D. P, Engler, J. A., and Brouillette, C. G., 1997. 
Crystal structure of truncated human apolipoprotein A-I sug- 
gests a lipid-bound conformation. Proceedings of the National 
Academy of Sciences 94:12291-12296.) 





the various lipoproteins are approximately similar, and they consist of a core of mo- 
bile triacylglycerols or cholesterol esters surrounded by a single layer of phospholipid, 
into which is inserted a mixture of cholesterol and proteins (Figure 24.37). Note that 
the phospholipids are oriented with their polar head groups facing outward to inter- 
act with solvent water and that the phospholipids thus shield the hydrophobic lipids 
inside from the solvent water outside. The proteins also function as recognition sites 
for the various lipoprotein receptors throughout the body. A number of different 
apoproteins have been identified in lipoproteins (Table 24.2), and others may exist 
as well. The apoproteins have an abundance of hydrophobic amino acid residues, as 
is appropriate for interactions with lipids. A cholesterol ester transfer protein also as- 
sociates with lipoproteins. 


Lipoproteins in Circulation Are Progressively Degraded 
by Lipoprotein Lipase 


The livers and intestines of animals are the primary sources of circulating lipids. Chy- 
lomicrons carry triacylglycerol and cholesterol esters from the intestines to other tis- 
sues, and VLDLs carry lipid from liver, as shown in Figure 24.38. At various target 
sites, particularly in the capillaries of muscle and adipose cells, these particles are de- 
graded by lipoprotein lipase, which hydrolyzes triacylglycerols. Lipase action causes 
progressive loss of triacylglycerol (and apoprotein) and makes the lipoproteins 
smaller. This process gradually converts VLDL particles to IDL and then LDL parti- 
cles, which are either returned to the liver for reprocessing or redirected to adipose 
tissues and adrenal glands. Every 24 hours, nearly half of all circulating LDL is re- 
moved from circulation in this way. The LDL binds to specific LDL receptors, which 
cluster in domains of the plasma membrane known as coated pits (discussed in sub- 
sequent paragraphs). These domains eventually invaginate to form coated vesicles 
(Figure 24.39), which pinch off from the plasma membrane and form endosomes 
(literally “bodies inside” the cell). In the low pH environment of the endosome, the 
LDL particles dissociate from their receptors. The endosomes then fuse with lyso- 
somes, and the LDLs are degraded by lysosomal acid lipases. 

HDLs have much longer life spans in the body (5 to 6 days) than other lipopro- 
teins. Newly formed HDL contains virtually no cholesterol ester. However, over 
time, cholesterol esters are accumulated through the action of lecithin: cholesterol 
acyltransferase (LCAT), a 59-kD glycoprotein associated with HDLs. Another asso- 
ciated protein, cholesterol ester transfer protein, transfers some of these esters to 
VLDL and LDL. Alternatively, HDLs function to return cholesterol and cholesterol 
esters to the liver. This latter process apparently explains the correlation between 
high HDL levels and reduced risk of cardiovascular disease. (High LDL levels, on 








TABLE 24.2 | 

Concentration 

in Plasma 

Apoprotein M, (mg/100 mL) Distribution 
A-1 28,300 90-120 Principal protein in HDL 
A-2 8,700 30-50 Occurs as dimer mainly in HDL 
B-48 240,000 <5 Found only in chylomicrons 
B100 500,000 80-100 Principal protein in LDL 
C-1 7,000 4-7 Found in chylomicrons, VLDL, HDL 
C-2 8,800 3-8 Found in chylomicrons, VLDL, HDL 
C-3 8,800 8-15 Found in chylomicrons, VLDL, IDL, HDL 
D 32,500 8-10 Found in HDL 
E 34,100 3-6 Found in chylomicrons, VLDL, IDL, HDL 








Adapted from Brown, M, and Goldstein, J., 1987. In Braunwald, E., et al., eds., Harrison’s Principles of Internal Medicine, 11th 
ed. New York: McGraw-Hill; and Vance, D., and Vance, J., eds., 1985. Biochemistry of Lipids and Membranes, Menlo Park, CA: 
Benjamin/Cummings. 





24.5 How Are Lipids Transported Throughout the Body? 


@ Synthesis of apoproteins, phosphatidylcholine, 
triacylglycerol, cholesterol, cholesterol esters 
occurs in the endoplasmic reticulum 


(+) @ Assembly of components into prelipoprotein 
particles in the ER, then transfer to Golgi 


© Golgi processes the particles with additional 
phospholipids and perhaps also cholesterol 
and cholesterol esters added 


4 So Ie @ Formation of secretory vesicle 
containing lipoprotein particles 
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VEDE 
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FIGURE 24.38 Lipoprotein components are synthesized predominantly in the ER of liver cells. Following assem- 
bly of lipoprotein particles (red dots) in the ER and processing in the Golgi, lipoproteins are packaged in secre- 
ory vesicles for export from the cell (via exocytosis) and released into the circulatory system. 
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FIGURE 24.39 Endocytosis and degradation of lipoprotein particles. (ACAT is acyl-CoA cholesterol acyltransferase.) 
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amino-terminal binding domain is responsible for 
recognition and binding of LDL apoprotein. The B-100 
apolipoprotein of the LDL particle is presumed to bind 
to the fourth and fifth cysteine-rich repeats (R4 and R5). 
The O-linked oligosaccharide-rich domain may act as a 
molecular spacer, raising the binding domain above the 
glycocalyx. The cytosolic domain is required for aggre- 
gation of LDL receptors during endocytosis. 





the other hand, are correlated with an increased risk of coronary artery and cardio- 
vascular disease.) 


The Structure of the LDL Receptor Involves Five Domains 


The LDL receptor in plasma membranes (Figure 24.40) consists of 839 amino acid 
residues and is composed of five domains, two of which contain multiple subdo- 
mains. The N-terminal LDL-binding domain (292 residues) contains seven cysteine- 
rich repeats, denoted R1 to R7. The next segment (417 residues) contains three 
epidermal growth factor repeats, as well as a B-propellor module. This is followed in 
the sequence by a 58-residue segment of O-linked oligosaccharides, a 22-residue 
membrane-spanning segment, and a 50-residue segment extending into the cytosol. 
The clustering of receptors prior to the formation of coated vesicles requires the 
presence of this cytosolic segment. Note that the LDL particle binds specifically to 
the receptor at the fourth and fifth cysteine-rich repeats (R4 and R5). 


The LDL Receptor G-Propellor Displaces LDL Particles in Endosomes 


Figure 24.39 shows the release of LDL particles in endosomes that pinch off from 
the plasma membrane when cells take up LDLs. What molecular events trigger the 
release of LDL particles? A collaboration by three Nobel laureates has provided an 
answer. Johann Deisenhofer, Michael Brown, and Joseph Goldstein have deter- 
mined the structure of the extracellular domain of the LDL receptor at pH 5.3, the 
typical pH inside endosomes. At this low pH, the receptor polypeptide is folded 
back on itself, with the 6-propellor domain associated with R4 and R5, the two re- 
peats that normally bind the LDL particle (Figure 24.41). The implication is that 
the B-propellor displaces the LDL particle in the lower pH environment of the en- 
dosome. What residues at the interface between the propellor and the R4 and R5 
repeats act as the pH sensors? Three histidines at the propellor-R4/R5 interface— 
His!, His°®, and His”%®—are the likely pH-sensing residues. His!™ lies at the tip of 
a loop on R5, whereas His°® and His®*® are on the surface of the propellor domain 
(Figure 24.41). These three His residues form a cluster at the three-way junction be- 
tween R4, R5, and the 6-propellor. 


Defects in Lipoprotein Metabolism Can Lead to Elevated 
Serum Cholesterol 


The mechanism of LDL metabolism and the various defects that can occur therein 
have been studied extensively by Michael Brown and Joseph Goldstein, who received 
the Nobel Prize in Physiology or Medicine in 1985, Familial hypercholesterolemia is 
the term given to a variety of inherited metabolic defects that lead to greatly elevated 
levels of serum cholesterol, much of it in the form of LDL particles. The general 


(b) 





B-propellor 


FIGURE 24.41 (a) The cysteine-rich repeats R4 and R5 are the site of LDL particle binding. Each repeat con- 
tains two loops connected by three disulfide bonds. The second loop in each repeat carries acidic residues 
(Asp and Glu) and forms a Ca?*-binding site. (b) The B-propellor domain of the LDL receptor. (c) The struc- 
ture of the LDL receptor extracellular domain at pH 5.3 (similar to that found in endosomes). The B-propellor 
domain is associated with cysteine-rich domains R4 and R5. A cluster of histidines (His!%°, His?®?, and His°®°) 
and a variety of hydrophobic and charged interactions mediate the interaction (pdb id = 1N7D). 


24.6 How Are Bile Acids Biosynthesized? 


genetic defect responsible for familial hypercholesterolemia is the absence or dys- 
function of LDL receptors in the body. Only about half the normal level of LDL re- 
ceptors is found in heterozygous individuals (persons carrying one normal gene and 
one defective gene). Homozygotes (with two copies of the defective gene) have few, 
if any, functional LDL receptors. In such cases, LDLs (and cholesterol) cannot be 
absorbed, and plasma levels of LDL (and cholesterol) are very high. Typical hetero- 
zygotes display serum cholesterol levels of 300 to 400 mg/dL, but homozygotes carry 
serum cholesterol levels of 600 to 800 mg/dL or even higher. There are two possible 
causes of an absence of LDL receptors—either receptor synthesis does not occur at 
all, or the newly synthesized protein does not successfully reach the plasma mem- 
brane due to faulty processing in the Golgi or faulty transport to the plasma mem- 
brane. Even when LDL receptors are made and reach the plasma membrane, they 
may fail to function for two reasons. They may be unable to form clusters competent 
in coated pit formation because of folding or sequence anomalies in the carboxy- 
terminal domain, or they may be unable to bind LDL because of sequence or fold- 
ing anomalies in the LDL-binding domain. 


24:6) How Are Bile Acids Biosynthesized? 


Bile acids, which exist mainly as bile salts, are polar carboxylic acid derivatives of 
cholesterol that are important in the digestion of food, especially the solubilization 
of ingested fats. The Na* and K* salts of glycocholic acid and taurocholic acid are the 
principal bile salts (Figure 24.42). Glycocholate and taurocholate are conjugates of 
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FIGURE 24.42 Cholic acid, a bile salt, is synthesized from 
cholesterol via 7@-hydroxycholesterol. Conjugation 
with taurine or glycine produces taurocholic acid and 
glycocholic acid, respectively. Taurocholate and glyco- 
cholate are freely water soluble and are highly effective 
detergents. 
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HUMAN BIOCHEMISTRY 


Steroid 5a-Reductase—A Factor in Male Baldness, Prostatic Hyperplasia, 


and Prostate Cancer 


An enzyme that metabolizes testosterone may be involved in the 
benign conditions of male-pattern baldness (also known as andro- 
genic alopecia) and benign prostatic hyperplasia (prostate gland en- 
largement), as well as potentially fatal prostate cancers. Steroid 
5a-reductases are membrane-bound enzymes that catalyze the 
NADPH-dependent reduction of testosterone to dihydrotestos- 


dominates in the sebaceous glands of skin and liver, whereas type 
II is most abundant in the prostate, seminal vesicles, liver, and epi- 
didymis. DHT is a contributory factor in male baldness and pro- 
static hyperplasia, and it has also been shown to act as a mitogen 
(a stimulator of cell division). For these reasons, 5a-reductase in- 
hibitors are potential candidates for treatment of these human 


conditions. 

Finasteride (see figure) is a specific inhibitor of type II 5a- 
reductase. It has been used clinically for treatment of benign pro- 
static hyperplasia, and it is also marketed under the trade name 
Propecia by Merck as a treatment for male baldness. Type II 5a- 
reductase inhibitors may also be potential therapeutic agents for 
treatment of prostate cancer. Somatic mutations occur in the gene 
for type II 5a-reductase during prostate cancer progression. 

Because type I 5a-reductase is the predominant form of the en- 
zyme in human scalp, the mechanism of finasteride’s promotion 
of hair growth in men with androgenic alopecia has been 
uncertain. However, scientists at Merck have shown that whereas 
type I 5a-reductase predominates in sebaceous ducts of the skin, 
type II 5a-reductase is the only form of the enzyme present in hair 
follicles. Thus, finasteride’s therapeutic effects may arise from in- 
hibition of the type II enzyme in the hair follicle itself. 


terone (DHT) (see accompanying figure). Two isoforms of 5a- 
reductase have been identified. In humans; the type I enzyme pre- 
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cholic acid with glycine and taurine, respectively. Because they contain both nonpo- 
lar and polar domains, these bile salt conjugates are highly effective as detergents. 
These substances are made in the liver, stored in the gallbladder, and secreted as 
needed into the intestines. 

The formation of bile salts represents the major pathway for cholesterol degra- 
dation. The first step involves hydroxylation at C-7 (Figure 24.42). 7a-Hydroxylase, 
which catalyzes the reaction, is a mixed-function oxidase involving cytochrome P-450. 
Mixed-function oxidases use Oy» as substrate. One oxygen atom goes to hydroxylate 
the substrate while the other is reduced to water (Figure 24.43). The function of 
cytochrome P-450 is to activate Où for the hydroxylation reaction. Such hydroxy- 
lations are quite common in the synthetic routes for cholesterol, bile acids, and 
steroid hormones and also in detoxification pathways for aromatic compounds. 
Several of these are considered in the next section. 7a-Hydroxycholesterol is the 
precursor for cholic acid. 


NADPH + H+ NADP+ 


Cytochrome P-450 
reductase (Flavin) 


Cytochrome P-450 
reductase (Flavin-Hg) 


24:75 How Are Steroid Hormones Synthesized and Utilized? 


Steroid hormones are crucial signal molecules in mammals. (The details of their 
physiological effects are described in Chapter 32.) Their biosynthesis begins with 





7a-Hydroxylase 7a-Hydroxylase . . ; 
(Cytochrome P 450) (Cytochrome P-450) the desmolase reaction, which converts cholesterol to pregnenolone (Figure 
Fe?+ Fet 24.44). Desmolase is found in the mitochondria of tissues that synthesize steroids 
(mainly the adrenal glands and gonads). Desmolase activity includes two hydroxy- 
lases and utilizes cytochrome P-450. 
2H* + O, > H,O 
Cholesterol 7a-Hydroxycholesterol FIGURE 24.43 The mixed-function oxidase activity of 7a-hydroxylase. 
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Pregnenolone and Progesterone Are the Precursors of All Other 
Steroid Hormones 


Pregnenolone is transported from the mitochondria to the ER, where a hydroxyl 
oxidation and migration of the double bond yield progesterone. Pregnenolone syn- 
thesis in the adrenal cortex is activated by adrenocorticotropic hormone (ACTH), 
a peptide of 39 amino acid residues secreted by the anterior pituitary gland. 

Progesterone is secreted from the corpus luteum during the latter half of the 
menstrual cycle and prepares the lining of the uterus for attachment of a fertilized 
ovum. If an ovum attaches, progesterone secretion continues to ensure the suc- 
cessful maintenance of a pregnancy. Progesterone is also the precursor for synthe- 
sis of the other sex hormone steroids and the corticosteroids. Male sex hormone 
steroids are called androgens, and female hormones, estrogens. Testosterone is an 
androgen synthesized in males primarily in the testes (and in much smaller 
amounts in the adrenal cortex). Androgens are necessary for sperm maturation. 
Even nongonadal tissue (liver, brain, and skeletal muscle) is susceptible to the ef- 
fects of androgens. 

Testosterone is also produced primarily in the ovaries (and in much smaller 
amounts in the adrenal glands) of females as a precursor for the estrogens. -Estradiol 
is the most prominent estrogen (Figure 24.44). 


(Endoplasmic reticulum) 







Progesterone 


Aldosterone 


FIGURE 24.44 The steroid hormones are synthesized 
from cholesterol, with intermediate formation of preg- 
nenolone and progesterone. Testosterone, the principal 
male sex hormone steroid, is a precursor to B-estradiol. 
Cortisol, a glucocorticoid, and aldosterone, a mineralo- 
corticoid, are also derived from progesterone. 
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Stanozolol 


FIGURE 24.45 The structure of stanozolol, an anabolic 
steroid. 


Steroid Hormones Modulate Transcription in the Nucleus 


Steroid hormones act in a different manner from most hormones we have consid- 
ered. In many cases, they do not bind to plasma membrane receptors but rather 
pass easily across the plasma membrane. Steroids may bind directly to receptors in 
the nucleus or may bind to cytosolic steroid hormone receptors, which then enter 
the nucleus. In the nucleus, the hormone-receptor complex binds directly to spe- 
cific nucleotide sequences in DNA, increasing transcription of DNA to RNA (see 
Chapters 29 and 32). 


Cortisol and Other Corticosteroids Regulate a Variety of Body Processes 


Corticosteroids, including the glucocorticoids and mineralocorticoids, are made by the 
cortex of the adrenal glands on top of the kidneys. Cortisol (Figure 24.44) is rep- 
resentative of the glucocorticoids, a class of compounds that (1) stimulate gluco- 
neogenesis and glycogen synthesis in liver (by promoting the synthesis of PEP 
carboxykinase, fructose-1,6-bisphosphatase, glucose-6-phosphatase, and glycogen 
synthase); (2) inhibit protein synthesis and stimulate protein degradation in pe- 
ripheral tissues such as muscle; (3) inhibit allergic and inflammatory responses; 
(4) exert an immunosuppressive effect, inhibiting DNA replication and mitosis and 
repressing the formation of antibodies and lymphocytes; and (5) inhibit formation 
of fibroblasts involved in healing wounds and slow the healing of broken bones. 

Aldosterone, the most potent of the mineralocorticoids (Figure 24.44), is in- 
volved in the regulation of sodium and potassium balances in tissues. Aldosterone 
increases the kidney’s capacity to absorb Na*, C17, and HgO from the glomerular fil- 
trate in the kidney tubules. 


Anabolic Steroids Have Been Used Illegally to Enhance 
Athletic Performance 


The dramatic effects of androgens on protein biosynthesis have led many athletes to 
the use of synthetic androgens, which go by the blanket term anabolic steroids. Despite 
numerous warnings from the medical community about side effects, which include 
kidney and liver disorders, sterility, and heart disease, abuse of such substances is epi- 
demic. Stanozolol (Figure 24.45) was one of the agents found in the blood and urine 
of Ben Johnson following his record-setting performance in the 100-meter dash in 
the 1988 Olympic Games. Because use of such substances is disallowed, Johnson lost 
his gold medal and Carl Lewis was declared the official winner. 


SUMMARY 


24.1 How Are Fatty Acids Synthesized? The synthesis of fatty acids 
and other lipid components is different from their degradation. Fatty 
acid synthesis involves a set of reactions that follow a strategy different 
in several ways from the corresponding degradative process: 


1. Intermediates in fatty acid synthesis are linked covalently to the sulf 
hydryl groups of the acyl carrier proteins. In contrast, fatty acid break- 
down intermediates are bound to the —SH group of coenzyme A. 

2. Fatty acid synthesis occurs in the cytosol, whereas fatty acid degrada- 
tion takes place in mitochondria. 

3. In animals, the enzymes of fatty acid synthesis are components of 
one long polypeptide chain, the fatty acid synthase, whereas no sim- 
ilar association exists for the degradative enzymes. 

4. The coenzyme for the oxidation-reduction reactions of fatty acid syn- 
thesis is NADP*/NADPH, whereas degradation involves the NAD*/ 
NADH couple. 


24.2 How Are Complex Lipids Synthesized? A common pathway oper- 
ates in nearly all organisms for the synthesis of phosphatidic acid, the pre- 


cursor to other glycerolipids. Glycerokinase catalyzes the phosphorylation 
of glycerol to form glycerol-3-phosphate, which is then acylated at both the 
l- and 2-positions to yield phosphatidic acid. In eukaryotes, phosphatidic 
acid is converted directly either to diacylglycerol or to cytidine diphospho- 
diacylglycerol (or simply CDP-diacylglycerol). From these two precursors, 
all other glycerophospholipids in eukaryotes are derived. Phosphatidyl- 
ethanolamine synthesis begins with phosphorylation of ethanolamine to 
form phosphoethanolamine. The next reaction involves transfer of a 
cytidylyl group from CTP to form CDP-ethanolamine and pyrophosphate. 
Diacylglycerol then displaces CMP to form phosphatidylethanolamine. 
Eukaryotes also use CDP-diacylglycerol, derived from phosphatidic acid, as 
a precursor for several other important phospholipids, including phos- 
phatidylinositol (PI), phosphatidylglycerol (PG), and cardiolipin. 


24.3 How Are Eicosanoids Synthesized, and What Are Their Functions? 
Eicosanoids are ubiquitous breakdown products of phospholipids. In re- 
sponse to appropriate stimuli, cells activate the breakdown of selected 
phospholipids. Phospholipase Ag selectively cleaves fatty acids from the 
C-2 position of phospholipids. Often these are unsaturated fatty acids, 


among which is arachidonic acid. Arachidonic acid may also be released 
from phospholipids by the combined actions of phospholipase C 
(which yields diacylglycerols) and diacylglycerol lipase (which releases 
fatty acids). Animal cells can modify arachidonic acid and other poly- 
unsaturated fatty acids to produce so-called local hormones. These 
substances include the prostaglandins, as well as thromboxanes, leuko- 
trienes, and other hydroxyeicosanoic acids. 


24.4 How Is Cholesterol Synthesized? The cholesterol biosynthetic 
pathway begins in the cytosol with the synthesis of mevalonate from 
acetyl-CoA. The first step is the condensation of two molecules of acetyl- 
CoA to form acetoacetyl-CoA. In the next reaction, acetyl-CoA and 
acetoacetyl-CoA join to form 3-hydroxy-3-methylglutaryl-CoA, which is 
abbreviated HMG-CoA, in a reaction catalyzed by HMG-CoA synthase. 
The third step in the pathway is the rate-limiting step in cholesterol 
biosynthesis; HMG-CoA undergoes two NADPH-dependent reductions 
to produce 3R-mevalonate. Biosynthesis of squalene involves conversion 
of mevalonate to isopentenyl pyrophosphate and dimethylallyl pyro- 
phosphate. Condensation of these two 5-carbon intermediates produces 
geranyl pyrophosphate; addition of another 5-carbon isopentenyl group 
gives farnesyl pyrophosphate. Both steps in the production of farnesyl 
pyrophosphate occur with release of pyrophosphate, hydrolysis of which 
drives these reactions forward. Two farnesyl pyrophosphates join to pro- 
duce squalene. Squalene monooxygenase converts squalene to squalene- 
2,3-epoxide. A second ER membrane enzyme produces lanosterol, and 
another 20 steps are required to convert lanosterol to cholesterol. 


24.5 How Are Lipids Transported Throughout the Body? Most lipids 
circulate in the body in the form of lipoprotein complexes. Simple, un- 
esterified fatty acids are merely bound to serum albumin and other pro- 
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teins in blood plasma, but phospholipids, triacylglycerols, cholesterol, 
and cholesterol esters are all transported in the form of lipoproteins. At 
various sites in the body, lipoproteins interact with specific receptors 
and enzymes that transfer or modify their lipid cargoes. 


24.6 How Are Bile Acids Biosynthesized? The formation of bile salts 
represents the major pathway for cholesterol degradation. The first step 
involves hydroxylation at C-7. 7a-Hydroxylase is a mixed-function oxi- 
dase involving cytochrome P-450. Mixed-function oxidases use Oz as 
substrate. One oxygen atom goes to hydroxylate the substrate while the 
other is reduced to water. The function of cytochrome P-450 is to acti- 
vate Os for the hydroxylation reaction. Such hydroxylations are quite 
common in the synthetic routes for cholesterol, bile acids, and steroid 
hormones and also in detoxification pathways for aromatic compounds. 


24.7 How Are Steroid Hormones Synthesized and Utilized? Biosyn- 
thesis of steroid hormones begins with the desmolase reaction, which 
converts cholesterol to pregnenolone. Desmolase activity includes two 
hydroxylases and utilizes cytochrome P-450. Pregnenolone is trans- 
ported from the mitochondria to the ER, where a hydroxyl oxidation 
and migration of the double bond yield progesterone. Progesterone is 
also the precursor for synthesis of the sex hormone steroids and the cor- 
ticosteroids. Testosterone is an androgen synthesized in males primarily 
in the testes. B-Estradiol is the most prominent estrogen. Aldosterone, 
the most potent of the mineralocorticoids, is involved in the regulation 
of sodium and potassium balances in tissues. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. Carefully count and account for each of the atoms and charges in 
the equations for the synthesis of palmitoyl-CoA, the synthesis of 
malonyl-CoA, and the overall reaction for the synthesis of palmitoyl- 
CoA from acetyl-CoA. 

2. (Integrates with Chapters 18 and 19.) Use the relationships shown 
in Figure 24.1 to determine which carbons of glucose will be in- 
corporated into palmitic acid. Consider the cases of both citrate 
that is immediately exported to the cytosol following its synthesis 
and citrate that enters the TCA cycle. 

3. Based on the information presented in the text and in Figures 24.4 
and 24.5, suggest a model for the regulation of acetyl-CoA carboxy- 
lase. Consider the possible roles of subunit interactions, phospho- 
rylation, and conformation changes in your model. 

4. Consider the role of the pantothenic acid groups in animal 
FAS and the size of the pantothenic acid group itself, and estimate 
a maximal separation between the malonyl transferase and the 
B-ketoacyl-ACP synthase active sites. 

5. Carefully study the reaction mechanism for the stearoyl-CoA desat- 
urase in Figure 24.14, and account for all of the electrons flowing 
through the reactions shown. Also account for all of the hydrogen 
and oxygen atoms involved in this reaction, and convince yourself 
that the stoichiometry is correct as shown. 

6. Write a balanced, stoichiometric reaction for the synthesis of phos- 
phatidylethanolamine from glycerol, fatty acyl-CoA, and ethanol- 
amine. Make an estimate of the AG’ for the overall process. 

7. Write a balanced, stoichiometric reaction for the synthesis of cho- 
lesterol from acetyl-CoA. 

8. Trace each of the carbon atoms of mevalonate through the synthe- 
sis of cholesterol, and determine the source (that is, the position in 
the mevalonate structure) of each carbon in the final structure. 

9. Suggest a structural or functional role for the O-linked saccharide 
domain in the LDL receptor (Figure 24.40). 


10. Identify the lipid synthetic pathways that would be affected by ab- 
normally low levels of CTP. 


11. Determine the number of ATP equivalents needed to form palmitic 
acid from acetyl-CoA. (Assume for this calculation that each 
NADPH is worth 3.5 ATPs.) 


12. Write a reasonable mechanism for the 3-ketosphinganine synthase 
reaction, remembering that it is a pyridoxal phosphate—dependent 
reaction. 


13. Why is the involvement of FAD important in the conversion of 
stearic acid to oleic acid? 


14. Write a suitable mechanism for the HMG-CoA synthase reaction. 
What is the chemistry that drives this condensation reaction? 


15. Write a suitable reaction mechanism for the B-ketoacyl-ACP syn- 
thase, showing how the involvement of malonyl-CoA drives this 
reaction forward. 


16. In the FAS megasynthase structures, the multiple functional sites 
must lie within reach of the ACP and its bound acyl group sub- 
strates. Examine the mammalian FAS structure (see Figure 24.11) 
and determine the distances between the various functional sites. 
You could approach this problem either by using a molecular mod- 
eling program (such as PyMol at www.pymol.org) or by consulting 
appropriate references (the following end-of-chapter reference is a 
good place to start: Maier, T., Leibundgut, M., and Ban, N., 2008. 
Science 321:1315-1322). You should convince yourself, with quanti- 
tative arguments, that the intersite distances can be traversed ap- 
propriately by the ACP group. 

17. In the LDL receptor structure shown in Figure 24.41c, the 
8-propellor interaction with domains R4 and R5 is partly stabilized 
by salt bridges between acidic residues on R4 and R5 that also 
coordinate Ca?* ions. Use a molecular modeling program or con- 
sult the literature to identify at least two such interactions. Two suit- 
able references are Beglova, N., and Blacklow, S. C., 2005. Trends in 
Biochemical Sciences 30:309-316; and Rudenko, G., Henry, L., et al., 
2002. Science 298:2353-2358. 
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18. Insights into the function of LDL receptors in displacing LDL par- 
ticles in endosomes have come from an unlikely source: a study of 
LDL receptor binding by a human rhinovirus HRV2 (a common 
cold virus). Consult suitable references to learn how this study pro- 
vided support for the model of LDL particle displacement pre- 
sented in this chapter. Good references are Blacklow, S. C., 2004. 
Nature Structural and Molecular Biology 11:388-390; Verdaguer, N., 
Fita, I., et al., 2004. Nature Structural and Molecular Biology 11: 
429-434; and Beglova, N., and Blacklow, S. C., 2005. Trends in Bio- 
chemical Sciences 30:309-316. 


Preparing for the MCAT Exam 

19. Consider the synthesis of linoleic acid from palmitic acid and iden- 
tify a series of three consecutive reactions that embody chemistry 
similar to three reactions in the tricarboxylic acid cycle. 

20. Rewrite the equation in Section 24.1 to describe the synthesis of be- 
henic acid (see Table 8.1). 
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Nitrogen Acquisition and 
Amino Acid Metabolism 





ESSENTIAL QUESTIONS 


Nitrogen is an essential nutrient for all cells. Amino acids provide nitrogen for the 


























a synthesis of other nitrogen-containing biomolecules. Excess amino acids in the diet 
8 can be converted into a-keto acids and used for energy production. 
o 
3 What are the biochemical pathways that form ammonium from inorganic nitro- 
: E gen compounds prevalent in the inanimate environment? How is ammonium 
PoR- EL Sra Mas HP incorporated into organic compounds? How are amino acids synthesized and 
Soybeans. Only plants and certain microorganisms are degraded? 
able to transform the oxidized, inorganic forms of 
nitrogen available in the inanimate environment into 
reduced, biologically useful forms. Soybean plants can 
meet their nitrogen requirements both by assimilat- . g . g . . . 
ing nitrate A Cee itn ee ne N. Amino acids and nucleotides, as well as their polymeric forms (proteins and nucleic 
acids), are nitrogen-containing molecules upon which cell structure and function 
I was determined to know beans. rely. How do these various organic forms of nitrogen arise? As we look at these com- 
Henry David Thoreau (1817-1862) pounds, an obvious feature is that nitrogen atoms are typically bound to carbon 
The Writings of Henry David Thoreau, vol. 2, and/or hydrogen atoms. That is, the nitrogen atom is in a reduced state. On the 
p- 178, Houghton Mifflin (1906) other hand, the prevalent forms of nitrogen in the environment are inorganic and 
oxidized; No (dinitrogen gas) and NO; (nitrate ions) being the principal species. 
The two principal routes for nitrogen acquisition from the inanimate environment, 
KEY QUESTIONS Pops l TR . a 
nitrate assimilation and nitrogen fixation, lead to formation of ammonium ions 
25.1 Which Metabolic Pathways Allow (NH,*). Reactions that incorporate NH," into organic linkage (the reactions of am- 
Organisms to Live on Inorganic Forms monium assimilation) follow. Among these, glutamine synthetase merits particular 
of Nitrogen? attention because it conveys several important lessons in metabolic regulation. This 
25.2 What Is the Metabolic Fate of Ammonium? chapter presents the pathways of amino acid biosynthesis and degradation; those in- 
25.3 What Regulatory Mechanisms Act on volving the sulfur-containing amino acids provide an opportunity to introduce as- 
Escherichia coli Glutamine Synthetase? pects of sulfur metabolism. 
25.4 ow Do Organisms Synthesize Amino 
Acids? 
z 2 z M4 e e e 
25.5 How Does Amino Acid Catabolism Lead 25vle Which Metabolic Pathways Allow Organisms to Live 
into Pathways of Energy Production? 


on Inorganic Forms of Nitrogen? 


Nitrogen Is Cycled Between Organisms and the Inanimate Environment 


Nitrogen acquisition by biological systems is accompanied by its reduction to am- 
monium ion (NH,*) and the incorporation of NH,* into organic linkage as amino 
or amido groups (Figure 25.1). The reduction of NO,” to NH,* occurs in green 
plants, various fungi, and certain bacteria in a two-step metabolic pathway known 
as nitrate assimilation. The formation of NH4* from No gas is termed nitrogen fix- 
ation. No fixation is an exclusively prokaryotic process, although bacteria in symbi- 
otic association with certain green plants also carry out nitrogen fixation. No ani- 
mals are capable of either nitrogen fixation or nitrate assimilation, so they are 
totally dependent on plants and microorganisms for the synthesis of organic ni- 
trogenous compounds, such as amino acids and proteins, to satisfy their require- 
ments for this essential element. 

Animals release excess nitrogen in a reduced form, either as NH,* or as organic 
nitrogenous compounds such as urea. The release of N occurs both during life and 
as a consequence of microbial decomposition following death. Various bacteria re- 
turn the reduced forms of nitrogen back to the environment by oxidizing them. 
CENGAGENOW” Create your own study path for The oxidation of NH,* to NO; by nitrifying bacteria, a group of chemoau- 
Unie crapa wiin Cantal sine aU ons eniin, totrophs, provides the sole source of chemical energy for the life of these microbes. 
and Active Figures at www.cengage.com/login. ‘ i . 

Nitrate nitrogen also returns to the atmosphere as Ng as a result of the metabolic 
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FIGURE 25.1 The nitrogen cycle. Organic nitrogenous compounds are formed by the incorporation of NH4* 
into carbon skeletons. Note that denitrification and nitrogen fixation are anaerobic processes. 


activity of denitrifying bacteria. These bacteria are capable of using NO; and sim- 
ilar oxidized inorganic forms of nitrogen as electron acceptors in place of Og in 
energy-producing pathways. The NO; is reduced ultimately to dinitrogen (No). 
These bacteria thus deplete the levels of combined nitrogen, that is, N joined with 
other elements in chemical compounds. Combined nitrogen is important as nat- 
ural fertilizer. However, the denitrifying activity of bacteria is exploited in water 
treatment plants to reduce the load of combined nitrogen that might otherwise 
enter lakes, streams, and bays. 


Nitrate Assimilation Is the Principal Pathway for Ammonium 
Biosynthesis 


Nitrate assimilation occurs in two steps: the two-electron reduction of nitrate to ni- 
trite, catalyzed by nitrate reductase (Equation 25.1), followed by the six-electron 
reduction of nitrite to ammonium, catalyzed by nitrite reductase (Equation 25.2). 


(1) NO; + 2 Ht + 2¢@—>NO,- + H,O (25.1) 
(2) NO.” +8H* +6 e —> NH,* +2 HO (25.2) 





Nitrate assimilation is the predominant means by which green plants, algae, and 
many microorganisms acquire nitrogen. The pathway of nitrate assimilation ac- 
counts for more than 99% of the inorganic nitrogen (nitrate or Nə) assimilated 
into organisms. 


Nitrate Reductase Contains Cytochrome b;;7 and Molybdenum Cofactor 
NADH NO; 
[-SH—> FAD >cytochrome b5; > MoCo] (25.3) 
NADH* NO.” 


A pair of electrons is transferred from NADH via enzyme-associated sulfhydryl 
groups, FAD, cytochrome 0557, and MoCo (an essential molybdenum-containing 
cofactor) to nitrate, reducing it to nitrite. The brackets [ ] denote the protein- 
bound prosthetic groups that constitute an e transport chain between NADH 
and nitrate. Nitrate reductases typically are cytosolic 220-kD dimeric proteins. 
The structure of the molybdenum cofactor (MoCo) is shown in Figure 25.2a. 
Molybdenum cofactor is necessary for both nitrate reductase activity and the as- 
sembly of nitrate reductase subunits into the active dimeric holoenzyme form. 
Molybdenum cofactor is also an essential cofactor for a variety of enzymes that 
catalyze hydroxylase-type reactions, including xanthine dehydrogenase, aldehyde 
oxidase, and sulfite oxidase. 





Corn nitrate reductase cytochrome 
domain (FAD shown in yellow) 
(pdb id = 1CNE) 





Fungal nitrate reductase molybdenum 
cofactor domain (Mo cofactor in gold) 
(pdb id = 2BIH) 


Nitrate reductase has two structural domains, the cyto- 
chrome b domain (top) and the molybdenum cofactor 
domain (bottom). 
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FIGURE 25.2 The novel prosthetic groups of nitrate reductase and nitrite reductase. (a) The molybdenum 
cofactor of nitrate reductase. (b) Siroheme, an essential prosthetic group of nitrite reductase. Siroheme is 
novel among hemes in having eight carboxylate-containing side chains. These carboxylate groups may act 
as H+ donors during the reduction of NO.~ to NH4*. COOH COOH 





COOH 


Nitrite Reductase Contains Siroheme Six electrons are required to reduce NO.” 
to NH,*. Nitrite reductases in photosynthetic organisms obtain these electrons 
from six molecules of photosynthetically reduced ferredoxin (Fdyea). 


Light>6 Fdyeq NO,” 
[(4Fe-4S) — siroheme] (25.4) 
6 Fd,, NH,* 


Photosynthetic nitrite reductases are 63-kD monomeric proteins having a 
tetranuclear iron-sulfur cluster and a novel heme, termed siroheme, as pros- 
thetic groups. The [4Fe-4S] cluster and the siroheme act as a coupled e transfer 
center. Nitrite binds directly to siroheme, providing the sixth ligand, much as O2 
binds to the heme of hemoglobin. Nitrite is reduced to ammonium while lig- 
anded to siroheme. The structure of siroheme is shown in Figure 25.2b. 

In higher plants, nitrite reductase is found in chloroplasts, where it has ready ac- 
cess to its primary reductant, photosynthetically reduced ferredoxin. Microbial ni- 
trite reductases closely resemble nitrate reductases in having essential —SH groups 
and FAD prosthetic groups to couple enzyme-mediated NADPH oxidation to nitrite 





Spinach nitrite reductase (iron-sulfur 
cluster in gold, siroheme in red) . : 
(pdb id = 2AKJ) reduction (Figure 25.3). 


Sequence Organization of the Nitrate Assimilation Enzymes 


Plant and Fungal Nitrate Reductases 
(~200-kD homodimers) 


1 112 482 542 620 656 787 917 


Plant Nitrite Reductases 
(63-kD monomers) 






473 518 566 


Fungal Nitrite Reductases 
(~250-kD homodimers) 


FAD NAD(P)H Cys-rich FeS-siroheme/NO.— 
26 60 183 215 496 600 715 763 1176 


FIGURE 25.3 Domain organization within the enzymes of nitrate assimilation. The numbers denote residue 
number along the amino acid sequence of the proteins. 
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Organisms Gain Access to Atmospheric N, Via the Pathway 
of Nitrogen Fixation 


Nitrogen fixation involves the reduction of nitrogen gas (No) via an enzyme system 
found only in prokaryotic cells. The heart of the nitrogen fixation process is the en- 
zyme known as nitrogenase, which catalyzes the reaction 


No +10H* + 8eé—>2 NH,* E Ho (25.5) 


Note that an obligatory reduction of two protons to hydrogen gas accompanies the 
biological reduction of No to ammonia. Less than 1% of the inorganic N incorpo- 
rated into organic compounds by organisms can be attributed to nitrogen fixation; 
however, this process is the only way that organisms can tap into the enormous 
reservoir of Ns in the atmosphere. 

Although nitrogen fixation is exclusively prokaryotic, No-fixing bacteria may be 
either free-living or living as symbionts with higher plants. For example, Rhizobia are 
bacteria that fix nitrogen in symbiotic association with soybeans and other legumi- 
nous plants. Because nitrogen in a metabolically useful form is often the limiting 
nutrient for plant growth, such symbiotic associations can be an important factor in 
plant growth and agriculture. 

Despite the wide diversity of bacteria in which nitrogen fixation takes place, all 
Nofixing systems are nearly identical and all have four fundamental requirements: 
(1) the enzyme nitrogenase; (2) a strong reductant, such as reduced ferredoxin; 
(3) ATP; and (4) O>-free conditions. In addition, several modes of regulation act to 
control nitrogen fixation. 


The Nitrogenase Complex Is Composed of Two Metalloproteins Two metallopro- 
teins constitute the nitrogenase complex: the Fe-protein or nitrogenase reductase 
and the MoFe-protein, which is another name for nitrogenase. Nitrogenase reduc- 
tase is a 60-kD homodimer possessing a single [4Fe-4S] cluster as a prosthetic group. 
Nitrogenase reductase is extremely Og sensitive. Nitrogenase reductase binds MgATP 
and hydrolyzes two ATPs per electron transferred during nitrogen fixation. Because 
reduction of N, to 2 NH,* + Hg requires 8 electrons, 16 ATPs are consumed per No 
reduced. 

This ATP requirement seems paradoxical because the reaction is thermody- 
namically favorable: The A,’ for the reaction (No + 8 © + 10 H* > 2 NHy* + 
Ho) is -0.314 V. Ferredoxin, the most common e` donor for nitrogen fixation, has 
an @,’ that is more negative (see Table 20.1). The solution to the paradox is found 
in the very strong bonding between the two N atoms in No (Figure 25.4). Sub- 
stantial energy input is needed to overcome this large activation energy and break 
the N=N triple bond. In this biological system, the energy is provided by ATP. 


8@ + 10Ht + :N= N: —>» 2NHi+ Ho 


Very high energy 
of activation 


Energy —> 


> -AG 








Reaction coordinate ——> 


FIGURE 25.4 The triple bond in Nz must be broken dur- 
ing nitrogen fixation. 
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(a) E Cys” 
CysB ba Ke 


Cysa®? 


Cysa!54 





Cysa®® 


(b) 





Homocitrate 


FIGURE 25.5 Structures of the two types of metal clus- 
ters found in nitrogenase. (a) The P-cluster consists of 
two Fe,S; clusters that share an S atom. (b) The FeMo- 
cofactor contains 1 Mo, 7 Fe, and 9 S atoms. Homocitrate 
provides two oxo ligands to the Mo atom. (Adapted from 


Leigh, G.J., 1995.The mechanism of dinitrogen reduction by @ @ @ 
molybdenum nitrogenases. European Journal of Biochemistry © ® 
229:14-20.) Mo Fe S N Q 


Nitrogenase, the MoFe-protein, is a 240-kD a fstype heterotetramer. An 
aB-dimer serves as the functional unit, and each af-dimer contains two types of 
metal centers: an unusual 8Fe-7S center known as the P-cluster (Figure 25.5a) and 
the novel 7Fe-1Mo-9S cluster known as the FeMo-cofactor (Figure 25.5b). Nitroge- 
nase under unusual circumstances may contain an iron: vanadium cofactor instead 
of the molybdenum-containing one. Like nitrogenase reductase, nitrogenase is very 
oxygen labile. 


The Nitrogenase Reaction In the nitrogenase reaction (Figure 25.6), electrons 
from reduced ferredoxin pass to nitrogenase reductase, which serves as electron 
donor to nitrogenase, the enzyme that actually catalyzes No fixation. Electron trans- 
fer from nitrogenase reductase to nitrogenase takes place through docking of ni- 
trogenase reductase with an af-subunit pair of nitrogenase (Figure 25.7). Nitroge- 
nase reductase transfers e to nitrogenase one electron at a time. No is bound within 
the FeMo-cofactor metal cluster until all electrons and protons are added; no free 
intermediates, such as HN—=NH or HN—NH ; are released. Electron transfer takes 
place in the following sequence: Fe-protein — P-cluster + FeMo-cofactor > No. 
ATP hydrolysis is coupled to the transfer of an electron from the Fe-protein to the 
P-cluster. ATP hydrolysis leads to conformational change in the nitrogenase reduc- 


No + 10H* 
ae Nitrogenase 
FIGURE 25.6 The nitrogenase reaction. Depending on NADH ( Reduced > reductase ` Nitrogenase 8@ 
the bacterium, electrons for N; reduction may come Ho ferredoxin (Fe-S) (Fe-S, FeMoCo) 
from light, NADH, hydrogen gas, or pyruvate. The pri- Pyruvate 
mary e~ donor for the nitrogenase system is reduced 16 ATP 16 ADP 2 NH} + Ho 


ferredoxin. + 16P, 


25.1 





tase, so it no longer binds to nitrogenase. The ADP: oxidized nitrogenase reductase 
complex dissociates, making way for another ATP:reduced nitrogenase reductase 
complex to bind to nitrogenase. Interestingly, nitrogenase reductase is a member of 
the G-protein family; G proteins are molecular switches whose operation is driven 
by NTP hydrolysis. 

Nitrogenase is a rather slow enzyme: Its optimal rate of e transfer is about 12 e 
pairs per second per enzyme molecule; that is, it reduces only three molecules of 
nitrogen gas per second. Because its activity is so weak, nitrogen-fixing cells main- 
tain large amounts of nitrogenase so that their requirements for reduced N can be 
met. As much as 5% of the cellular protein may be nitrogenase. 


The Regulation of Nitrogen Fixation To a first approximation, two regulatory 
controls are paramount (Figure 25.8): (1) ADP inhibits the activity of nitrogenase; 
thus, as the ATP/ADP ratio drops, nitrogen fixation is blocked. (2) NH4* represses 


© S (b) 


nif gene expression \, 


$ 
Nitrogenase 


1 Y 

a > 2 NH,* + H, (a) 
/ 

16 ATP  16P; + 16ADP -77 





No + 8 @ + 10 H+ 


2 


i 
| “S— C— NHg 
(c) eZ 
N 
NAD* Nicotinamide 





Active 





Inactive 





Arg?! 








ADP-ribosyl 
group 


Which Metabolic Pathways Allow Organisms to Live on Inorganic Forms of Nitrogen? 


773 


FIGURE 25.7 Ribbon diagram of nitrogenase reductase 
(the Fe-protein, blue) : nitrogenase (FeMo protein, green) 
complex. The Fe-protein iron-sulfur cluster is shown in 
yellow, bound ADP in orange. The nitrogenase FeMo 
cofactor is shown in cyan, the P-cluster in red (pdb id = 
1N20). 


FIGURE 25.8 Regulation of nitrogen fixation. (a) ADP 
inhibits nitrogenase activity. (b) NH4* represses nif gene 
expression. (c) In some organisms, the nitrogenase 
complex is regulated by covalent modification. ADP- 


ribosylation of nitrogenase reductase leads to its 
inactivation. 
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FIGURE 25.9 The glutamate dehydrogenase reaction. 


the expression of the nif genes, the genes that encode the proteins of the nitrogen- 
fixing system. To date, some 20 nif genes have been identified with the nitrogen fix- 
ation process. Repression of nif gene expression by ammonium, the primary prod- 
uct of nitrogen fixation, is an efficient and effective way of shutting down Ns fixation 
when its end product is not needed. In addition, in some systems, covalent modifi- 
cation of nitrogenase reductase leads to its inactivation. Inactivation occurs when 
Arg’! of nitrogenase reductase receives an ADP-ribosyl group donated by NAD+. 


25:2" What Is the Metabolic Fate of Ammonium? 


Given the prevalence of N atoms in cellular components, it is surprising that only 
three enzymatic reactions introduce ammonium into organic molecules. Of these 
three, glutamate dehydrogenase and glutamine synthetase are responsible for most of the 
ammonium assimilated into carbon compounds. The third, carbamoyl-phosphate syn- 
thetase I, is a mitochondrial enzyme that participates in the urea cycle. 

Glutamate dehydrogenase (GDH) catalyzes the reductive amination of a-keto- 
glutarate to yield glutamate. Reduced pyridine nucleotides (NADH or NADPH) pro- 
vide the reducing power: 


NH,* + a-ketoglutarate + NADPH + H* —> 
glutamate + NADP* + HO (25.6) 


This reaction provides an important interface between nitrogen metabolism and cel- 
lular pathways of carbon and energy metabolism because a-ketoglutarate is a citric acid 
cycle intermediate. In vertebrates, GDH is an a¢-type multimeric enzyme localized in 
the mitochondrial matrix that uses NADPH as electron donor when operating in the 
biosynthetic direction (the direction of glutamate synthesis) (Figure 25.9). In contrast, 
when GDH acts in the catabolic direction to generate a-ketoglutarate from glutamate, 
NAD+, not NADP‘, is usually the electron acceptor. The catabolic activity is allosteri- 
cally activated by ADP and inhibited by GTP. 

Glutamine synthetase (GS) catalyzes the ATP-dependent amidation of the 
y-carboxyl group of glutamate to form glutamine (Figure 25.10). The reaction pro- 
ceeds via a y-glutamyl-phosphate intermediate, and GS activity depends on the 
presence of divalent cations such as Mg?*. Glutamine is a major N donor in the 
biosynthesis of many organic N compounds such as purines, pyrimidines, and 
other amino acids, and GS activity is tightly regulated, as we shall soon see. The 
amide-N of glutamine provides the nitrogen atom in these biosyntheses. 

Carbamoyl-phosphate synthetase I, the third enzyme capable of using ammo- 
nium to form an N-containing organic compound, catalyzes an early step in the 
urea cycle. Two ATPs are consumed, one in the activation of HCO; for reaction 
with ammonium and the other in the phosphorylation of the carbamate formed 
(see also Figure 25.22): 





| 
NH,* + HCO,” + 2 ATP — HNN—C—O—PO,”" + 2ADP + P;+2H* (25.7) 


N-acetylglutamate is an essential allosteric activator for this enzyme. 
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(a) Z O FIGURE 25.10 (a) The enzymatic reaction catalyzed by 
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The Major Pathways of Ammonium Assimilation Lead 
to Glutamine Synthesis 


In organisms that enjoy environments rich in nitrogen, GDH and GS acting in se- 
quence furnish the principal route of NH,* incorporation (Figure 25.11). However, 
GDH has a significantly higher K„ for NH,* than does GS. Consequently, in organ- 
isms such as green plants that grow under conditions where little NH;4* is available, 
GDH is not effective and GS is the only NH,*-assimilative reaction. Such a situation 
creates the need for an alternative mode of glutamate synthesis to replenish the glu- 
tamate consumed by the GS reaction. This need is filled by glutamate synthase (also 
known as GOGAT, the acronym for the other name of this enzyme—glutamate: 
oxo-glutarate amino-transferase). Glutamate synthase catalyzes the reductive amina- 
tion of a-ketoglutarate using the amide-N of glutamine as the N donor: 


Reductant + a-KG + Gln —>2 Glu + oxidized reductant (25.8) 


Two glutamates are formed—one from amination of a-ketoglutarate and the other 
from deamidation of Gln (Figure 25.12). These glutamates can now serve as am- 
monium acceptors for glutamine synthesis by GS. Organisms variously use NADH, 
NADPH, or reduced ferredoxin as reductant. Glutamate synthases are typically 
large, complex proteins; in Escherichia colt, GOGAT is an 800-kD flavoprotein con- 
taining both FMN and FAD, as well as [4Fe-4S] clusters. 





SUM: 2 NH,* + a-ketoglutarate + NADPH + ATP——— glutamine + NADP? + ADP + P; + H,O 


FIGURE 25.11 The GDH/GS pathway of ammo- 
nium assimilation. The sum of these reactions 
is the conversion of 1 a-ketoglutarate to 1 glu- 
tamine at the expense of 2 NH,*, 1 ATP and 

1 NADPH. 





776 Chapter 25 Nitrogen Acquisition and Amino Acid Metabolism 
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FIGURE 25.12 The glutamate synthase reaction (left), showing the reductants exploited by different organisms 
in this reductive amination reaction. Structure of glutamate synthase (right) (odb id = 1LM1) FAD is shown in 
blue, the Fe-S cluster in yellow. 





(a) 2 NHyt + 2 ATP + 2 glutamate aes 2 glutamine + 2 ADP + 2 P, 





FIGURE 25.13 The GS/GOGAT pathway of (b) NADPH + a-ketoglutarate + glutamine cote glutamate + NADP* 
ammonium assimilation. The sum of these 
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a-ketoglutarate to 1 glutamine at the expense 
of 2 ATP and 1 NADPH. 











Together, GS and GOGAT constitute a second pathway of ammonium assimilation, 
in which GS is the only NH,*-fixing step; the role of GOGAT is to regenerate gluta- 
mate (Figure 25.13). Note that this pathway consumes 2 equivalents of ATP and 
1 NADPH (or similar reductant) per pair of N atoms introduced into Gln, in contrast 
to the GDH/GS pathway, in which only 1 ATP and 1 NADPH are consumed per pair 
of NH,° fixed. Clearly, coping with a nitrogen-limited environment has its cost. 


25:30 What Regulatory Mechanisms Act on Escherichia coli 
Glutamine Synthetase? 


As indicated earlier, glutamine plays a pivotal role in nitrogen metabolism by 
donating its amide nitrogen to the biosynthesis of many important organic N com- 
pounds. Consistent with its metabolic importance, in prokaryotic cells such as 
E. coli, GS is regulated at three different levels: 


1. Its activity is regulated allosterically by feedback inhibition. 

2. GS is interconverted between active and inactive forms by covalent modification. 

3. Cellular amounts of GS are carefully controlled at the level of gene expression and 
protein synthesis. 


Eukaryotic versions of glutamine synthetase show none of these regulatory 
features. 

E. coli GS is a 600-kD dodecamer (@js-type subunit organization) of identical 
52-kD monomers (each monomer contains 468 amino acid residues). These 
monomers are arranged as a stack of two hexagons (Figure 25.14). The active 
sites are located at subunit interfaces within the hexagons; these active sites are 
recognizable in the X-ray crystallographic structure by the pair of divalent 
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(b) 








cations that occupy them. Adjacent subunits contribute to each active site, thus 
accounting for the fact that GS monomers are catalytically inactive. 


Glutamine Synthetase Is Allosterically Regulated 


Nine distinct feedback inhibitors (Gly, Ala, Ser, His, Trp, CTP, AMP, carbamoyl-P, and 
glucosamine-6-P) act on GS. Gly, Ala, and Ser are key indicators of amino acid me- 
tabolism in the cell; each of the other six compounds represents an end product of a 
biosynthetic pathway dependent on Gln (Figure 25.15). AMP competes with ATP for 
binding at the ATP substrate site. Gly, Ala, and Ser compete with Glu for binding at 
the active site. Carbamoyl-P binds at a site that overlaps both the Glu site and the site 
occupied by the y-PO, of ATP. 


Glutamine Synthetase Is Regulated by Covalent Modification 


Each GS subunit can be adenylylated at a specific tyrosine residue (Tyr®®’) in an 
ATP-dependent reaction (Figure 25.16). Adenylylation inactivates GS. If we de- 
fine n as the average number of adenylyl groups per GS molecule, GS activity is 
inversely proportional to n. The number n varies from 0 (no adenylyl groups) to 
12 (every subunit in each GS molecule is adenylylated). Adenylylation of GS is 
catalyzed by the converter enzyme ATP: GS: adenylyl transferase, or simply adenylyl 
transferase (AT). However, whether or not this covalent modification occurs is de- 
termined by a highly regulated cycle (Figure 26.17). AT not only catalyzes adeny- 
lylation of GS, it also catalyzes deadenylylation—the phosphorolytic removal of 
the Tyr-linked adenylyl groups as ADP. The direction in which AT operates de- 
pends on the nature of a regulatory protein, Py, associated with it. Py is a 44-kD 
protein (tetramer of 11-kD subunits): The state of Py controls the direction in 


FIGURE 25.14 The subunit organization of bacterial 
glutamine synthetase. (a) Schematic; (b) molecular 
structure (note the pairs of metal ions [dark blue] that 
define the active sites) (pdb id = 1FPY). 
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which AT acts. If Py is in its so-called Pra form, the AT:Py, complex acts to 
adenylylate GS. When Py is in its so-called Pim form, the AT: Pyp complex cat- 
alyzes the deadenylylation of GS. The active sites of AT: Pya and AT: Pp are dif- 
ferent, consistent with the difference in their catalytic roles. In addition, the 
AT:Pya and AT:Pyp complexes are allosterically regulated in a reciprocal fash- 
ion by the effectors a-KG and Gln. Gln activates AT:Py, activity and inhibits 
AT: Pup activity; the effect of a-KG on the activities of these two complexes is di- 
ametrically opposite (Figure 25.17). Further, Gln favors conversion of Pyp to Pra, 
whereas a-ketoglutarate favors the Pim over the Py, form. 

Clearly, the determining factor regarding the degree of adenylylation, n, and 
hence the relative activity of GS, is the [Gln] /[a@-KG] ratio. A high [Gln] level sig- 
nals cellular nitrogen sufficiency, and GS becomes adenylylated and inactivated. In 
contrast, a high [@-KG] level is an indication of nitrogen limitation and a need for 
ammonium fixation by GS. 


Glutamine Synthetase Is Regulated Through Gene Expression 


The gene that encodes the GS subunit in EF. coli is designated GlnA. The GinA gene 
is actively transcribed to yield GS mRNA for translation and synthesis of GS protein 
only if a specific transcriptional enhancer, NR, is in its phosphorylated form, NRrP. In 
turn, NR; is phosphorylated in an ATP-dependent reaction catalyzed by NRy, a pro- 
tein kinase (Figure 25.18). However, if NRy is complexed with Pr, it acts not as a 
kinase but as a phosphatase, and the transcriptionally active form of NR, namely 
NR--P, is converted back to NR; with the result that GinA transcription halts. Recall 
from the foregoing discussion that a high [Gln]/[a-KG] ratio favors Pya at the ex- 
pense of Prp. Under such conditions, GS gene expression is not necessary. 


25:41 How Do Organisms Synthesize Amino Acids? 


Organisms show substantial differences in their capacity to synthesize the 
20 amino acids common to proteins. Typically, plants and microorganisms can 
form all of their nitrogenous metabolites, including all of the amino acids, from 


are ar 
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FIGURE 25.18 Transcriptional regulation of GinA expres- 
sion through the reversible phosphorylation of NR. 
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Test yourself on the concepts in this figure at 
www.cengage.com/login. Glutamate Oxaloacetate a-KG Aspartate 


inorganic forms of N such as NH,* and NOs". In these organisms, the a-amino 
group for all amino acids is derived from glutamate, usually via transamination of 
the corresponding a-keto acid analog of the amino acid (Figure 25.19). In many 
cases, amino acid biosynthesis is thus a matter of synthesizing the appropriate 
a-keto acid carbon skeleton, followed by transamination with Glu. The amino 
acids can be classified according to the source of intermediates for the a-keto acid 
biosynthesis (Table 25.1). For example, the amino acids Glu, Gln, Pro, and Arg 


TABLE 25.1 I 





a-Ketoglutarate Family Aspartate Family 











Glutamate Aspartate 
Glutamine Asparagine 
Proline Methionine 
Arginine Threonine 
Lysine* Isoleucine 
Lysine* 
Pyruvate Family 3-Phosphoglycerate Family 
Alanine Serine 
Valine Glycine 
Leucine Cysteine 





Phosphoenolpyruvate and Erythrose-4-P Family 





The aromatic amino acids 
Phenylalanine 
Tyrosine 
Tryptophan 
The remaining amino acid, histidine, is derived from PRPP 
(5-phosphoribosyl-1-pyrophosphate) and ATP. 











*Different organisms use different precursors to synthesize lysine. 


HUMAN BIOCHEMISTRY 
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Humans can synthesize only 10 of the 20 common amino acids 
(see table below); the others must be obtained in the diet. Those 
that can be synthesized are classified as nonessential, meaning it 
is not essential that these amino acids be part of the diet. In ef- 
fect, humans can synthesize the a-keto acid analogs of nonessen- 
tial amino acids and form the amino acids by transamination. In 
contrast, humans are incapable of constructing the carbon skele- 
tons of essential amino acids, so they must rely on dietary sources 
for these essential metabolites. Excess dietary amino acids cannot 
be stored for future use, nor are they excreted unused. Instead, 
they are converted to common metabolic intermediates that can 
be either oxidized by the citric acid cycle to generate metabolic 
energy or used to form glucose (see Section 25.5). 

Since autotrophic cells (and many prokaryotic cells) synthesize 
all 20 amino acids, several questions arise regarding human di- 
etary requirements for amino acids. First, why is it that humans 
lack the ability to do what other organisms can do? The answer is 
that, over evolutionary time, human diets provided adequate 
amounts of those amino acids classified now as “essential.” Thus, 
the loss of the metabolic pathways for synthesis of “essential” 
amino acids did not impair the fitness of humans. That is, synthe- 
sizing “essential” amino acids became superfluous, and no evolu- 
tionary pressure operated on humans to retain the genes for these 
pathways. A second question now emerges: Are there significant 
differences between synthesis of amino acids human can make 
(the so-called “nonessential” amino acids) and those they can’t? 









Essential Nonessential 
Arginine* Alanine 
Histidine* Asparagine 
Isoleucine Aspartate 
Leucine Cysteine 
Lysine Glutamate 
Methionine Glutamine 
Phenylalanine Glycine 
Threonine Proline 
Tryptophan Serine 
Valine Tyrosine 





Human Dietary Requirements for Amino Acids 


The big table summarizes amino acid biosynthesis from citric acid 
cycle intermediates in terms of the number of reactions needed to 
make an amino acid from a TCA cycle intermediate.* Nonessen- 
tial amino acids are shown in blue; essential amino acids in red. 
Two conclusions stand out: Nonessential amino acids require 
fewer reaction steps for synthesis than essential amino acids, and 
nonessential amino acids tend to be more abundantly represented 
in proteins than essential amino acids. Thus, evolutionary loss was 
not random: The biosynthetic pathways lost were those for amino 
acids requiring the most reaction steps. 


*From Srinivasan, V., Morowitz, H., and Smith, E., 2007. Essential amino 
acids, from LUCA to LUCY. Complexity 13:8-9. 








Amino Acid Reaction Steps Mole % in Proteinst 
1 Alanine 1 7.9 
2 Aspartic acid 1 5.3 
3 Glutamic acid 1 6.7 
4 Asparagine 2 4.1 
5 Glutamine 2 4.0 
6 Serine 5 6.9 
7 Glycine 6 6.9 
8 Proline 6 4.8 
9 Cysteine 7 1.5 
10 Threonine 6 5.4 
11 Valine 9 6.7 
12 Isoleucine 13 5.9 
13 Leucine 14 10.0 
14 Lysine 14 5.9 
15 Methionine 17 2.4 
16 Arginine 24 5.4 
17 Histidine 27 2:3 
18 Phenylalanine 29 4.0 
19 Tyrosinet 30 3.0 
20 Tryptophan 33 Ti 











*Arginine and histidine are essential in the diets of juveniles, not adults. 
tTyrosine is classified as nonessential only because it is readily formed from essential 
phenylalanine. 





tMole percentages are taken from amino acid representations among proteins in the 
Swiss-Prot protein knowledgebase: ca.expasy.org/sprot. 
+Note that “nonessential” tyrosine can only be made from “essential” phenylanine. 





(and, in some instances, Lys) are all members of the a-ketoglutarate family be- 
cause they are all derived from the citric acid cycle intermediate a-ketoglutarate. 
We return to this classification scheme later when we discuss the individual 


biosynthetic pathways. 


Amino Acids Are Formed from a@-Keto Acids by Transamination 


Transamination involves transfer of an a-amino group from an amino acid to the 
a-keto position of an a-keto acid (Figure 25.19). In the process, the amino donor be- 
comes an a-keto acid while the a-keto acid acceptor becomes an a-amino acid: 


Amino acid, + a-keto acidg—— a-keto acid; + amino acids 


(25.9) 
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A DEEPER LOOK 
The Mechanism of the Aminotransferase (Transamination) Reaction 


The aminotransferase (transamination) reaction is a workhorse in amino acid substrate. This is followed by hydrolysis of the ketimine 
biological systems. It provides a general means for exchange of intermediate to yield the product a-keto acid. Left in the active 
nitrogen between amino acids and a-keto acids. This vital reaction site is a pyridoxamine phosphate intermediate, which combines 
is catalyzed by pyridoxal phosphate (PLP). The mechanism in- with another (substrate) a-keto acid to form a second ketimine, 
volves loss of the C, proton, followed by an aldimine-ketimine which rearranges to form an aldamine, followed by release as an 
tautomerization—literally a “flip-flop” of the Schiff base double amino acid. Transaldiminization with a lysine at the active site 
bond from the pyridoxal aldehyde carbon to the a-carbon of the completes the reaction. 





NH» 
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o- 
a | 
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A The mechanism of PLP-catalyzed transamination reactions. 











The predominant amino acid/a-keto acid pair in these reactions is glutamate/ 
a-ketoglutarate, with the net effect that glutamate is the primary amino donor 
for the synthesis of amino acids. Transamination reactions are catalyzed by 
aminotransferases (the preferred name for enzymes formerly termed transami- 
nases). Aminotransferases are named according to their amino acid substrates, as in 
glutamate—aspartate aminotransferase. Aminotransferases are prime examples of 
enzymes that catalyze double displacement (ping-pong)-type bisubstrate reactions 
(see Figure 13.23). 


The Pathways of Amino Acid Biosynthesis Can Be Organized 
into Families 


As indicated in Table 25.1, the amino acids can be grouped into families on the 
basis of the metabolic intermediates that serve as their precursors. 


25.4 


The a-Ketoglutarate Family of Amino Acids Includes Glu, Gln, 
Pro, Arg, and Lys 


Amino acids derived from a-ketoglutarate include glutamate (Glu), glutamine 
(Gln), proline (Pro), arginine (Arg), and in fungi and protoctists such as Euglena, 
lysine (Lys). The routes for Glu and Gln synthesis were described when we consid- 
ered pathways of ammonium assimilation. 

Proline is derived from glutamate via a series of four reactions involving 
activation, then reduction, of the y-carboxyl group to an aldehyde (glutamate- 
5-semialdehyde), which spontaneously cyclizes to yield the internal Schiff base, 
A! pyrroline-5-carboxylate (Figure 25.20). NADPH-dependent reduction of the pyrro- 
line double bond gives proline. 

Arginine biosynthesis involves enzymatic steps that are also part of the urea 
cycle, a metabolic pathway that allows certain animals (including humans) to ex- 
crete any excess nitrogen arising from overconsumption of protein. Net synthesis 
of arginine depends on the formation of ornithine. Interestingly, ornithine is de- 
rived from glutamate via a reaction pathway reminiscent of the proline biosyn- 
thetic pathway (Figure 25.21). Glutamate is first N-acetylated in an acetyl-CoA- 
dependent reaction to yield N-acetylglutamate (Figure 25.21). An ATP-dependent 
phosphorylation of N-acetylglutamate to give N-acetylglutamate-5-phosphate primes 
this substrate for a reduced pyridine nucleotide-dependent reduction to the semi- 
aldehyde. N-acetylglutamate-5-semialdehyde then is aminated by a glutamate- 
dependent aminotransferase, giving N-acelylornithine, which is deacylated to orni- 
thine. In mammals, ornithine is made from glutamate via a pathway that does not 
involve an N-acetyl block. 

Ornithine has three metabolic roles: (1) to serve as a precursor to arginine, (2) to 
function as an intermediate in the urea cycle, and (3) to act as an intermediate in 
Arg degradation. In any case, the 6-NH3* of ornithine is carbamoylated in a reac- 
tion catalyzed by ornithine transcarbamoylase. The carbamoyl group is derived 
from carbamoyl-P synthesized by carbamoyl-phosphate synthetase I (CPS-I). CPS-I 
is the mitochondrial CPS isozyme; it uses two ATPs in catalyzing the formation of 
carbamoyl-P from NH; and HCOs_, one to activate the bicarbonate ion and the 
other to phosphorylate the carbamate arising from carboxyphosphate through its 





How Do Organisms Synthesize Amino Acids? 





o H o o H o 
\ | 4 Q Y | 
C — CHo — CHo — C —C C — CHy — CH — C — C 
=” | Mae / | Na 
+ _O O + 
NH? a Ne NHi 
Glutamate ATP N Glutamate-5-P 
O © NAD(P)H 
+ Ht 
8 NAD(P)* 
P; 
o Che oO H (0) 
SZ AVZ © ZN © \ | 4 
3 < C — CH; — CH, — C—C 
Aa ff * Mo SZ Z y TaN 
C N > H,O H wat O 
| Noe NAD(P)+ NAD(P)H Nie ° i 
H + Ht Al-Pyrroline-5-carboxylate Glutamate-5-semialdehyde 
Proline 


FIGURE 25.20 The pathway of proline biosynthesis from glutamate. The enzymes are (1) y-glutamyl kinase, 
(2) glutamate-5-semialdehyde dehydrogenase, and (4) A'-pyrroline-5-carboxylate reductase; reaction (3) occurs 
nonenzymatically. 
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FIGURE 25.21 The bacterial pathway of ornithine biosynthesis from glutamate. The enzymes are 
(1) N-acetylglutamate synthase, (2) N-acetylglutamate kinase, (3) N-acetylglutamate-5-semialdehyde 
dehydrogenase, (4) N-acetylornithine 6-aminotransferase, and (5) N-acetylornithine deacetylase. 


reaction with the ammonium ion (Figure 25.22). CPS-I represents the committed 
step in the urea cycle, and CPS-I is allosterically activated by N-acetylglutamate. Be- 
cause N-acetylglutamate is both a precursor to ornithine synthesis and essential to 
the operation of the urea cycle, it serves to coordinate these related pathways. 

The product of the ornithine transcarbamoylase reaction is citrulline (Figure 
25.23). Ornithine and citrulline are two a-amino acids of metabolic importance 
that nevertheless are not among the 20 a-amino acids commonly found in pro- 
teins. Like CPS-I, ornithine transcarbamoylase is a mitochondrial enzyme. The re- 
actions of ornithine synthesis and the rest of the urea cycle enzymes occur in the 
cytosol. 

The pertinent feature of the citrulline side chain is the ureido group. In a com- 
plex reaction catalyzed by argininosuccinate synthetase, this ureido group is first ac- 
tivated by ATP to yield a citrullyl-AMP derivative, followed by displacement of AMP 
by aspartate to give argininosuccinate (Figure 25.23). The formation of arginine is 
then accomplished by argininosuccinase, which catalyzes the nonhydrolytic elimi- 
nation of fumarate from argininosuccinate. This reaction completes the biosynthe- 
sis of Arg. 
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The Urea Cycle Acts to Excrete Excess N Through Arg Breakdown 


The carbon skeleton of arginine is derived principally from a-ketoglutarate, but the 
N and C atoms composing the guanidino group (Figure 25.23) of the Arg side chain 
come from NH,4*, HCO;~ (as carbamoyl-P), and the a-NHp» groups of glutamate and 
aspartate. The circle of the urea cycle is closed when ornithine is regenerated from 
Arg by the arginase-catalyzed hydrolysis of arginine. Urea is the other product of 
this reaction and lends its name to the cycle. Note that the N atoms in urea arose 
from NH,‘ and the Asp amino group. In terrestrial vertebrates, urea synthesis is re- 
quired to excrete excess nitrogen generated by increased amino acid catabolism— 
for example, following dietary consumption of more than adequate amounts of pro- 
tein. Urea formation is basically confined to the liver. 

A healthy adult human male eating a typical American diet will consume about 
100 g of protein per day. Because such an individual will remain in nitrogen balance 
(neither increasing or decreasing his net protein levels), his body must dispose of 
about 1 mole of excess N derived from the amino acids in this dietary protein. Glu- 
tamate is key to this process because of the position of glutamate dehydrogenase at 
the interface of amino acid and carbohydrate metabolism and the importance of glu- 
tamate to the urea cycle. 

Increases in amino acid catabolism lead to elevated glutamate levels and a rise in 
Nacetylglutamate, the allosteric activator of CPS-I. Stimulation of CPS-I raises over- 
all urea cycle activity because activities of the remaining enzymes of the cycle simply 
respond to increased substrate availability. Removal of potentially toxic NH4* by 
CPS-I is another important aspect of this regulation. The urea cycle is linked to the 
citric acid cycle through fumarate, a by-product of the action of argininosuccinase 
(Figure 25.23, reaction 3). 

Lysine biosynthesis in some fungi and in the protoctist Euglena also stems from 
a-ketoglutarate, making lysine a member of the a-ketoglutarate family of amino acids 
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FIGURE 25.22 The mechanism of action of CPS-I. 
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CENGAGENOW” ACTIVE FIGURE 25.23 The urea cycle series of reactions: The enzymes are (1) ornithine trans- 
carbamoylase (OTCase), (2a and 2b) argininosuccinate synthetase, (3) argininosuccinase, and (4) arginase. Test 
yourself on the concepts in this figure at www.cengage.com/login. 
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The Urea Cycle as Both an Ammonium and a Bicarbonate Disposal Mechanism 


Excretion of excess NH,* in the innocuous form of urea has tra- 
ditionally been viewed as the physiological role of the urea cycle. 
However, the urea cycle also provides a mechanism for the excre- 
tion of excess HCO 3~ arising principally from a-carboxyl groups 
generated during the catabolism of a-amino acids. The following 


1), and the second is simply protonated and dehydrated to form 
CO» (reaction 2), which is easily excreted. One interpretation of the pre- 
ceding is that these coupled reactions allow a weak acid (NH4 ) to protonate 
the conjugate base of a stronger acid (HCO; ). At first glance, this pro- 
tonation would appear thermodynamically unfavorable, but recall 
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equations illustrate this property: 


(1) HCO,” + 2 NH;* —> HNCONH: + 2 HO + Ht 
(2) HCO,” + H*—> HO + COs 


that in the urea cycle, 4 equivalents of ATP are consumed per equiv- 
alent of urea synthesized: 2 ATPs in the synthesis of carbamoyl-P, 
and 2 more as 1 ATP is converted to AMP + PP; in the synthesis of 
argininosuccinate from citrulline (Figure 25.23). If this interpreta- 
tion is correct, the urea cycle may be considered an ATP-driven proton 
pump that transfers H* ions from NH to HCO; against a thermody- 
namic barrier. In the process, the potentially toxic waste products, ammo- 
nium and bicarbonate, are rendered innocuous and excreted. 


Sum: 
2 HCO, ae 2 NH,t—> HeNCONHo» Oe CO» ag 3 HO 








That is, 2 moles of HCO;~ are eliminated in the synthesis of each 
mole of urea: One is incorporated into the product, urea (reaction 








in these organisms. (As we shall see, the other organisms capable of lysine synthesis— 
namely, bacteria, other fungi, algae, and green plants—use aspartate as a precursor.) 
To make lysine from a-ketoglutarate requires a lengthening of the carbon skeleton by 
one CHe unit to yield a-ketoadipate (Figure 25.24). This addition is accomplished by a 
series of reactions reminiscent of the initial stages of the citric acid cycle. First, a two- 
carbon acetyl-CoA unit is added to the a-carbon of a-ketoglutarate to form homo- 
citrate. Then, in a reaction sequence like that catalyzed by aconitase, homoisocitrate 
is formed from homocitrate. Oxidative decarboxylation (as in isocitrate dehydroge- 
nase) removes one carbon (the original a-carboxyl group of a-ketoglutarate), leaving 
a-ketoadipate. A glutamate-dependent aminotransferase enzyme then aminates 
a-ketoadipate to give a-aminoadipate. Next, the 6-COO™ group is activated in an ATP- 
dependent adenylylation reaction, priming this 6-COO™ group for reduction to an 
aldehyde by NADPH. a-Aminoadipic-6-semialdehyde is then reductively aminated by ad- 
dition of glutamate to its aldehydic carbon in an NADPH-dependent reaction leading 
to the formation of saccharopine. Oxidative cleavage of saccharopine by way of an 
NAD*-dependent dehydrogenase activity yields a-ketoglutarate and lysine. This path- 
way is known as the a@-aminoadipic acid pathway of lysine biosynthesis. Interestingly, ly- 
sine degradation in animals leads to formation of a-aminoadipate by a reverse series of 
reactions identical to those occurring along the last steps of this biosynthetic pathway. 


The Aspartate Family of Amino Acids Includes Asp, Asn, Lys, 
Met, Thr, and Ile 


The members of the aspartate family of amino acids include aspartate (Asp), as- 
paragine (Asn), lysine (via the diaminopimelic acid pathway), methionine (Met), 
threonine (Thr), and isoleucine (Ile). 

Aspartate is formed from the citric acid cycle intermediate oxaloacetate by trans- 
fer of an amino group from glutamate via a PLP-dependent aminotransferase reac- 
tion (Figure 25.25). Like glutamate synthesis from a-ketoglutarate, aspartate syn- 
thesis is a drain on the citric acid cycle. As we already saw, the Asp amino group 
serves as the N donor in the conversion of citrulline to arginine. In Chapter 26, we 
shall see that this —NHy is also the source of one of the N atoms of the purine ring 
system during nucleotide biosynthesis, as well as the C-6-amino-group of the major 
purine adenine. In addition, the entire aspartate molecule is used in the biosyn- 
thesis of pyrimidine nucleotides. 

Asparagine is formed by amidation of the -carboxyl group of aspartate. In bac- 
teria, in analogy with glutamine synthesis, the nitrogen added in this amidation 
comes directly from NH,"*. In other organisms, asparagine synthetase catalyzes the 
ATP-dependent transfer of the amido-N of glutamine to aspartate to yield gluta- 
mate, AMP, PP;, and asparagine (Figure 25.26). 
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FIGURE 25.24 Lysine biosynthesis in certain fungi and Euglena: the 
a-aminoadipic acid pathway. 
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HUMAN BIOCHEMISTRY 





Asparagine and Leukemia 


Leukemia is a cancer of the bone marrow that affects the stance limits its ability to cross cell membranes and thus its use 
production of normal lymphocytes (white blood cells). Acute in chemotherapy. Hopefully, “second-generation” compounds 
lymphoblastic leukemia (ALL) and acute myeloblastic leukemia based on this structure’s affinity for asparagine synthetase will 
(AML) are caused by overproduction of immature lymphocytes. lead to the development of useful drugs to treat these leukemias. 
Both normal and malignant lymphocytes are highly dependent 

on the uptake of asparagine from the blood for growth. Admin- 

istration of E. coli asparaginase, an enzyme that converts Asn to 

Asp and NH,* (page 806), is one chemotherapeutical approach 

to treat childhood ALL and some forms of AML, but patients of- 

ten develop resistance to treatment with this “foreign” protein. H O O 4 
Inhibition of asparagine synthetase presents an alternative way HN} = ll 

to deprive malignant lymphocytes of essential Asn, and as- bia S=N—P—O O 
paragine synthetase inhibitors might offer a clinical strategy for "OC 2 | 

treating asparaginase-resistant leukemias. The adenylated sul- CH5 o 

foximine shown in the figure is an analog of the aspartyl-AMP 

intermediate formed in the asparagine synthetase reaction (Fig- HO OH 
ure 25.26). In vitro, this compound inhibits asparagine syn- 

thetase at nanomolar concentrations. The polarity of this sub- An adenylated sulfoximine 
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Threonine, methionine, and lysine biosynthesis in bacteria proceeds from the 
common precursor aspartate, which is converted first to aspartyl-B-phosphate and 
then to B-aspartyl-semialdehyde. The first reaction is an ATP-dependent phosphoryla- 
tion catalyzed by aspartokinase (Figure 25.27, reaction 1). In E. coli, there are three 
isozymes of aspartokinase, designated aspartokinases I, II, and III. Each of these 
isozymes is uniquely controlled by one of the three end-product amino acids. Form 
I is feedback-inhibited by threonine and form III, by lysine. Form II is not feedback- 
inhibited, but its synthesis is repressed by methionine. 

B-Aspartyl-semialdehyde is formed via NADPH-dependent reduction of aspartyl- 
B-phosphate in a reaction catalyzed by f-aspartyl-semialdehyde dehydrogenase 
(Figure 25.27, reaction 2). From here, the pathway of lysine synthesis diverges. The 
methyl carbon of pyruvate is condensed with B-aspartyl-semialdehyde, and H,O is 
eliminated to yield the cyclic compound 2, 3-dihydropicolinate (Figure 25.27, reaction 
10). Thus, lysine synthesized by this pathway must be considered a member of both 
the aspartate and the pyruvate families of amino acids. Lysine is a feedback in- 
hibitor of this branch-point enzyme. Dihydropicolinate is then reduced in an NADPH- 
dependent reaction to A!-piperidine-2,6-dicarboxylate (Figure 25.27, reaction 11). A 
series of reactions, including a hydrolytic opening of the piperidine ring, a succiny- 
lation, a glutamate-dependent amination, and the hydrolytic removal of succinate, 
results in the formation of the symmetric L,L-a,e-diaminopimelate (Figure 25.27, 
reactions 12 through 14). Epimerization of this intermediate to the meso form, 
followed by decarboxylation, yields the end product lysine (Figure 25.27, reac- 
tions 15 and 16). Because this pathway proceeds through the symmetric L,L-a,€- 
diaminopimelate, one-half of the COs evolved in the terminal decarboxylase step 
is derived from the carboxyl group of pyruvate and one-half from the a-carboxyl 
of Asp. 

The other metabolic branch diverging from f-aspartyl-semialdehyde leads to 
threonine and methionine via homoserine, an analog of serine that is formed by the 
NADPH-dependent reduction of B-aspartyl-semialdehyde (Figure 25.27, reaction 
3) catalyzed by homoserine dehydrogenase. From homoserine, the biosynthetic 
pathways leading to methionine and threonine separate. To form methionine, the 
—OH group of homoserine is first succinylated by homoserine acyltransferase 
(Figure 25.27, reaction 6). Methionine is a feedback inhibitor of this enzyme. The 
succinyl group of O-succinylhomoserine is then displaced by cysteine to yield cys- 
lathionine (Figure 25.27, reaction 7). The sulfur atom in methionine is contributed 
by a cysteine sulfhydryl. Cystathionine is then split to give pyruvate, NH,*, and ho- 
mocysteine, a nonprotein amino acid whose side chain is one —CH»— group longer 
than that of Cys (Figure 25.27, reaction 8). Methylation of the homocysteine —SH 
via methyl transfer from the methyl donor, N°-methyl-THF (see Chapter 26) gives 
methionine (Figure 25.27, reaction 9). 

In passing, it is important to note the role of methionine itself in methylation re- 
actions. The enzyme S-adenosylmethionine synthase catalyzes the reaction of methio- 
nine with ATP to form S-adenosylmethionine, or SAM (Figure 25.28). SAM is a substrate 
of methyltransferases in a variety of methyl-donor reactions, such as the formation 
of phosphatidylcholine from phosphatidylethanolamine (see Figure 8.6). Following 
decarboxylation, SAM is a source of propylamine for the synthesis of polyamines 
(Figure 25.28). 

The remaining amino acids of the aspartate family are threonine and isoleucine. 
Threonine, like methionine, is synthesized from homoserine. Indeed, homoserine 
is the primary alcohol analog of the secondary alcohol Thr. To move this —OH 
from C-4 to C-3 requires activation of the hydroxyl through ATP-dependent phos- 
phorylation by homoserine kinase (see Figure 25.27, reaction 4). As the first reac- 
tion unique to Thr biosynthesis, homoserine kinase is feedback inhibited by threo- 
nine. The last step is catalyzed by threonine synthase, a PLP-dependent enzyme (see 
Figure 25.27, reaction 5). 

Isoleucine is included in the aspartate family of amino acids because four of 
its six carbons derive from Asp (via threonine) and only two come from pyruvate. 
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FIGURE 25.27 Biosynthesis of threonine, methionine, and lysine, members of the aspartate family of amino acids. 
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FIGURE 25.28 The synthesis of S-adenosylmethionine (SAM). 
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Nevertheless, four of the five enzymes necessary for isoleucine synthesis are common 
to the pathway for biosynthesis of valine, so discussion of isoleucine synthesis is pre- 
sented under the biosynthesis of the pyruvate family of amino acids. 


The Pyruvate Family of Amino Acids Includes Ala, Val, and Leu 


The pyruvate family of amino acids includes alanine (Ala), valine (Val), and leucine 
(Leu). Transamination of pyruvate, with glutamate as amino donor, gives alanine. 
Because these transamination reactions are readily reversible, alanine degradation 
occurs via the reverse route, with a-ketoglutarate serving as amino acceptor. 

Transamination of pyruvate to alanine is a reaction found in virtually all organ- 
isms, but valine, leucine, and isoleucine are essential amino acids, and as such, they 
are not synthesized in mammals. The pathways of valine and isoleucine synthesis 
can be considered together because one set of four enzymes is common to the last 
four steps of both pathways (Figure 25.29). Both pathways begin with an a-keto 
acid. Isoleucine can be considered a structural analog of valine that has one extra 
—CH,— unit, and its a-keto acid precursor, namely, a-ketobutyrate, is one carbon 
longer than the valine precursor, pyruvate. Interestingly, a-ketobutyrate is formed 
from threonine by threonine deaminase (Figure 25.29, reaction 1). This PLP- 
dependent enzyme (also known as threonine dehydratase or serine dehydratase) is 
feedback-inhibited by isoleucine, the end product. Note that part of the carbon 
skeleton for Ile comes from Asp by way of Thr. From here on, the Val and Ie path- 
ways employ the same set of enzymes. The first reaction involves the generation of 
hydroxyethyl-thiamine pyrophosphate from pyruvate in a reaction analogous to 
those catalyzed by transketolase and the pyruvate dehydrogenase complex. The 
two-carbon hydroxyethyl group is transferred from TPP to the respective keto acid 
acceptor by acetohydroxy acid synthase (acetolactate synthase) to give a-acetolactate 
or a-aceto-a-hydroxybutyrate (Figure 25.29, reaction 2). NAD (P)H-dependent reduc- 
tion of these a-keto hydroxy acids yields the dihydroxy acids a,6-dihydroxyisovalerate 
and a,6-dihydroxy-B-methylvalerate (Figure 25.29, reaction 3). Dehydration of each of 
these dihydroxy acids by dihydroxy acid dehydratase gives the appropriate a-keto 
acid carbon skeletons a-ketoisovalerate and a-keto-B-methylvalerate (Figure 25.29, 
reaction 4). Transamination by the branched-chain amino acid aminotransferase 
yields Val or Ile, respectively (Figure 25.29, reaction 5). 

Leucine synthesis depends on these reactions as well, because a-ketoisovalerate is 
a precursor common to both Val and Leu (Figure 25.30). Although Val and Leu dif- 
fer by only a single —CHp»— in their respective side chains, the carboxyl group of 
a-ketoisovalerate first picks up two carbons from acetyl-CoA to give a~sopropylmalate 
in a reaction catalyzed by isopropylmalate synthase; the enzyme is sensitive to feed- 
back inhibition by Leu (Figure 25.30, reaction 1). Isopropylmalate dehydratase 
(Figure 29.30, reaction 2) converts the @-isomer to the 6-form, which undergoes an 
NAD*-dependent oxidative decarboxylation by isopropylmalate dehydrogenase 
(Figure 29.30, reaction 3), so the carboxyl group of a-ketoisovalerate is lost as COs. 
Amination of a-ketoisocaproate by leucine aminotransferase (Figure 29.30, reaction 
4) gives Leu. 


The 3-Phosphoglycerate Family of Amino Acids Includes 
Ser, Gly, and Cys 


Serine, glycine, and cysteine are derived from the glycolytic intermediate 
3-phosphoglycerate. The diversion of 3-PG from glycolysis is achieved via 
3-phosphoglycerate dehydrogenase (Figure 25.31, reaction 1). This NAD*- 
dependent oxidation of 3-PG yields 3-phosphohydroxypyruvate—which, as an a-keto 
acid, is a substrate for transamination by glutamate to give 3-phosphoserine (Figure 
25.31, reaction 2). Serine phosphatase then generates serine (Figure 25.31, reaction 
3). Serine inhibits the first enzyme, 3-PG dehydrogenase, and thereby feedback- 
regulates its own synthesis. 
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FIGURE 25.29 Biosynthesis of valine and isoleucine. 
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Isoleucine 


Glycine is made from serine via two related enzymatic processes. In the first, 
serine hydroxymethyltransferase, a PLP-dependent enzyme, catalyzes the transfer 
of the serine 6-carbon to tetrahydrofolate (THF), the principal agent of one-carbon 
metabolism (Figure 25.32a). Glycine and N°, N'°-methylene-THF are the products. 
In addition, glycine can be synthesized by a reversal of the glycine oxidase reaction 
(Figure 25.32b). Here, glycine is formed when N°, N'°-methylene-THF condenses 
with NH," and COs. Via this route, the -carbon of serine becomes part of glycine. 
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FIGURE 25.30 Biosynthesis of leucine. 
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FIGURE 25.31 Biosynthesis of serine from 
3-phosphoglycerate. 


The conversion of serine to glycine is a prominent means of generating one-carbon 
derivatives of THF, which are so important for the biosynthesis of purines and the 
C-5 methyl group of thymine (a pyrimidine, see Chapter 26), as well as the amino 
acid methionine. Glycine itself contributes to both purine and heme synthesis. 
Cysteine synthesis is accomplished by sulfhydryl transfer to serine (Figure 25.33). In 
some bacteria, HS condenses directly with serine via a PLP-dependent enzyme- 
catalyzed reaction (Figure 25.33a), but in most microorganisms and green plants, the 
sulfhydrylation reaction requires an activated form of serine, O-acetylserine (Figure 
25.33b). O-acetylserine is made by serine acetyltransferase, with the transfer of an 
acetyl group from acetyl-CoA to the —OH of Ser. This enzyme is inhibited by Cys. 
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FIGURE 25.32 Biosynthesis of glycine from serine (a) via 


serine hydroxymethyltransferase and (b) via glycine 
oxidase. 
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O-Acetylserine then undergoes sulfhydrylation by HəS with elimination of acetate; the 
enzyme is O-acetylserine sulfhydrylase. 


Sulfide Synthesis from Sulfate Involves S-Containing ATP Derivatives Given the 
prevailing oxidative nature of our environment and the reactivity and toxicity of HəS, 
the source of sulfide for Cys synthesis merits discussion. In microorganisms and 
plants, sulfide is the product of sulfate assimilation. Sulfate is the common inorganic 
form of combined sulfur, and its assimilation involves several interesting ATP deriva- 
tives. ATP sulfurylase (Figure 25.34, reaction 1) catalyzes the formation of adenosine- 
5'-phosphosulfate (APS). Then, adenosine-5'-phosphosulfate-3'-phosphokinase cat- 
alyzes the formation of 3’-phosphoadenosine-5'-phosphosulfate (PAPS) from APS + 
ATP (Figure 25.34, reaction 2). Sulfite is then liberated from PAPS through the re- 
duction by reduced thioredoxin, leaving 3'-phosphoadenosine-5’-phosphate as a prod- 
uct (Figure 25.34, reaction 3). Sulfite (SOs_) is then reduced to sulfide (S7) in a mul- 
tielectron transfer reaction catalyzed by sulfite oxidase (Figure 25.34, reaction 4); 
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FIGURE 25.33 Cysteine biosynthesis. (a) Direct sulfhydrylation of serine by HS. (b) H2S-dependent sulfhydryla- 
tion of O-acetylserine. 


25.4 How Do Organisms Synthesize Amino Acids? 797 








NHo 
S 
N 
D <LI j 
I N ra 
P — OCH, 
| 
o- 
OH OH 
Adenosine 5'-phosphosulfate (APS) 
NHo 
N > 
N 
< d 
N7NyZ 
(0) 
O OH 
| 
OS P =O 
| 
o- 


3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 


© 


PAPS + Thioredoxin 0,2 + Thioredoxin 


| | Sulfite Lg —S | 
SH SH 


| O 
om i P 
3'-Phosphoadenosine 
5'-phosphate 
O OH 
| 
TO 210 
| 
o- 
+ 3H* + s [sH + 3 NADP} + 3 H,O 
3 NADPH 3 a 3 Tiy FIGURE 25.34 Sulfate assimilation and the generation of 


Sulfide sulfide for synthesis of organic S compounds. 


NADPH is the electron donor. Sulfite reductase, like nitrite reductase, possesses siro- 
heme as a prosthetic group (see Figure 25.2). 3'-Phosphoadenosine-5’-phosphosulfate 
is not only an intermediate in sulfate assimilation; it also serves as the substrate for syn- 
thesis of sulfate esters, such as the sulfated polysaccharides found in the glycocalyx of 
animal cells. 


The Aromatic Amino Acids Are Synthesized from Chorismate 


The aromatic amino acids, phenylalanine, tyrosine, and tryptophan, are derived 
from a shared pathway that has chorismic acid (Figure 25.35) as a key intermediate. 
Indeed, chorismate is common to the synthesis of cellular compounds having ben- 
zene rings, including these amino acids, the fat-soluble vitamins E and K, folic acid, 
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FIGURE 25.35 Some of the aromatic compounds 
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and coenzyme Q and plastoquinone (the two quinones necessary to electron trans- 
port during respiration and photosynthesis, respectively). Lignin, a polymer of nine- 
carbon aromatic units, is also a derivative of chorismate. Lignin and related com- 
pounds can account for as much as 35% of the dry weight of higher plants; clearly, 
enormous amounts of carbon pass through the chorismate biosynthetic pathway. 


Chorismate Is Synthesized from PEP and Erythrose-4-P Chorismate biosynthesis 
occurs via the shikimate pathway (Figure 25.36). The precursors for this pathway 
are the common metabolic intermediates phosphoenolpyruvate and erythrose-4- 
phosphate. These intermediates are linked to form 3-deoxy-b-arabino-heptulosonate-7- 
phosphate (DAHP) by DAHP synthase (Figure 25.36, reaction 1). Although this 
reaction is remote from the ultimate aromatic amino acid end products, it is an im- 
portant point for regulation of aromatic amino acid biosynthesis, as we shall see. In 
the next step on the way to chorismate, DAHP is cyclized to form a six-membered 
saturated ring compound, 5-dehydroquinate (Figure 25.36, reaction 2), in a reaction 
catalyzed by dehydroquinate synthase (NAD* is a coenzyme in this reaction but is 
not modified by it). A sequence of reactions ensues that introduces unsaturations 
into the ring through dehydration (Figure 25.36, reaction 3, 5-dehydroquinate de- 
hydratase) and reduction (reaction 4, shikimate dehydrogenase), yielding shikimate. 
Phosphorylation of shikimate by shikimate kinase (reaction 5), then addition of 
PEP by 3-enolpyruvylshikimate-5-phosphate synthase (reaction 6), followed by cho- 
rismate synthase (reaction 7), gives chorismate. Thus, two equivalents of PEP are 
needed to form chorismate from erythrose-4-P. 


Phenylalanine and Tyrosine At chorismate, the pathway separates into three 
branches, each leading specifically to one of the aromatic amino acids. The branches 
leading to phenylalanine and tyrosine both pass through prephenate (Figure 25.37). In 
some organisms, such as E. coli, the branches are truly distinct because prephenate 
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FIGURE 25.36 The shikimate pathway leading to the 
Chorismate synthesis of chorismate. 


does not occur as a free intermediate but rather remains bound to the bifunctional 
enzyme that catalyzes the first two reactions after chorismate. In any case, chorismate 
mutase is the first reaction leading to Phe or Tyr (Figure 25.37, reaction 1). In the Phe 
branch, the —OH group para to the prephenate carboxyl is removed by prephenate 
dehydratase (Figure 25.37, reaction 2). In the Tyr branch, this —OH is retained; 
instead, an oxidative decarboxylation of prephenate catalyzed by prephenate 
dehydrogenase (Figure 25.37, reaction 4) yields 4-hydroxyphenylpyruvate. Glutamate- 
dependent aminotransferases (phenylalanine aminotransferase [Figure 25.27, reac- 
tion 3] and tyrosine aminotransferase [reaction 5]) introduce the amino groups into 
the two a-keto acids, phenylpyruvate and 4-hydroxyphenylpyruvate, to give Phe and Tyr, 
respectively. Some mammals can synthesize Tyr from Phe obtained in the diet via 
phenylalanine-4-monooxygenase (also known as phenylalanine hydroxylase), using O» 
and tetrahydrobioplerin, an analog of tetrahydrofolic acid, as co-substrates (Figure 25.38). 


Tryptophan The pathway of tryptophan synthesis is perhaps the most thoroughly 
studied of any biosynthetic sequence, particularly in terms of its genetic organiza- 
tion and expression. Synthesis of Trp from chorismate requires six steps (see Fig- 
ure 25.37). In most microorganisms, the first enzyme, anthranilate synthase (see 
Figure 25.37, reaction 6), is an a9Bo-type protein, with the B-subunit acting in a 
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FIGURE 25.37 The biosynthesis of phenylalanine, tyrosine, and tryptophan from chorismate. 
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Unlike animals, plants can synthesize all 20 of the common amino 
acids. Inhibitors acting specifically on the plant enzymes that are ca- 
pable of carrying out the biosynthesis of the “essential” amino acids 
(that is, enzymes that animals lack) have been developed. These 
substances appear to be ideal for use as herbicides because they 
should show no effect on animals. Glyphosate, sold commercially as 
RoundUp, is a PEP analog that acts as an uncompetitive inhibitor of 
3-enolpyruvylshikimate-5-P synthase (Figure 25.36). Sulfmeturon 


Amino Acid Biosynthesis Inhibitors as Herbicides 


thase, an enzyme common to Val, Leu, and Ile biosynthesis (Figure 
25.29), is the active ingredient in Oust. Aminotriazole, sold as 
Amitrole, blocks His biosynthesis by inhibiting imidazole glycerol-P 
dehydratase (Figure 25.40). PPT (phosphinothricin) is a potent in- 
hibitor of glutamine synthetase. Although Gln is a nonessential amino 
acid and glutamine synthetase is a ubiquitous enzyme, PPT is rela- 
tively safe for animals because it does not cross the blood-brain bar- 
rier and is rapidly cleared by the kidneys. 


methyl, a sulfonylurea herbicide that inhibits acetohydroxy acid syn- 
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glutamine-amidotransferase role to provide the —NHo group of anthranilate. Or, 
given high levels of NH,"*, the a-subunit can carry out the formation of anthranilate di- 
rectly by a process in which the activity of the 6-subunit is unnecessary. Furthermore, 
in certain enteric bacteria, such as E. coli and Salmonella typhimurium, the second reac- 
tion of the pathway, the phosphoribosyl-anthranilate transferase reaction (see Figure 
25.37, reaction 7), is an activity catalyzed by the a-subunit of anthranilate synthase. 
PRPP (5-phosphoribosyl-1-pyrophosphate), the substrate of this reaction, is also a precursor 
for purine biosynthesis (see Chapter 26). Phosphoribosyl-anthranilate then undergoes a 
rearrangement wherein the ribose moiety is isomerized to the ribulosyl form in enol-1- 
(o-carboxyphenylamino)-1-deoxyribulose-5-phosphate by N-(5'-phosphoribosyl)-anthranilate 


802 Chapter 25 Nitrogen Acquisition and Amino Acid Metabolism 


A DEEPER LOOK 


Intramolecular Tunnels Connect Distant Active Sites in Some Enzymes 


Molecular tunneling is the transfer of a reaction intermediate pro- 
duced at one active site to another active site in the same enzyme 
through an intramolecular tunnel that connects them. Trypto- 
phan synthase (Figure 25.39) was the first enzyme discovered with 
this structural feature. For tryptophan synthase, the intermediate 
is indole, but for most of these enzymes studied thus far, the in- 
termediate is ammonia (NH3) derived from glutamine. Two such 
enzymes have been presented earlier in this chapter: asparagine 
synthetase (see Figure 25.26) and glutamate synthase (see Figure 
25.12). Another will soon be considered: imidazole glycerol phos- 


phate synthase (Figure 25.40). Several enzymes in nucleotide 
metabolism (see Chapter 26) also have this attribute, including 
carbamoyl phosphate synthetase II (see Figure 26.14), glutamine 
5-phosphoribosyl-a-pyrophosphate amidotransferase (see Figure 
26.3), and CTP synthetase (see Figure 26.16). One advantage of 
molecular tunnels is that they sequester reactive intermediates 
from potentially unproductive side reactions in the intracellular 
environment (some of which might be harmful). Also, by direct- 
ing the intermediate from one active site to another, these tunnels 
favor a particular reaction sequence. 











FIGURE 25.39 Tryptophan synthase ribbon diagram. The 


a-subunit is in blue, and 


the B-subunit is in orange 


-terminal domain) and red (C-terminal domain). The 


tunnel connecting them is outlined by the yellow dot 
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isomerase (see Figure 25.37, reaction 8). Decarboxylation and ring closure ensue 
to yield the indole nucleus as indole-3-glycerol phosphate (indole-3-glycerol phos- 
phate synthase, reaction 9). The final two reactions (10 and 11 in Figure 25.37) are 
both catalyzed by tryptophan synthase, an &œəßə-type protein. The a-subunit cleaves 
indoleglycerol-3-phosphate to form indole and 3-glycerol phosphate. The indole is 
then passed to the 6-subunit, which adds serine in a PLP-dependent reaction. 

X-ray crystallographic analysis of S. typhimurium tryptophan synthase shows that 
the active sites of the a- and -subunits are separated from each other by 2.5 nm but 
are connected by a hydrophobic tunnel wide enough to accommodate indole (Fig- 
ure 25.39). Thus, indole, the product of the reaction catalyzed by the a-subunit (see 
Figure 25.37, reaction 10), can be transferred directly to the B-subunit, which cat- 
alyzes condensation with serine to yield Trp (see Figure 25.37, reaction 11). Thus, 
indole is not lost from the enzyme complex and diluted in the surrounding milieu. 
This phenomenon of direct transfer of enzyme-bound metabolic intermediates, or 
tunneling, increases the efficiency of the overall pathway by preventing loss and di- 
lution of the intermediate. 


Histidine Biosynthesis and Purine Biosynthesis Are Connected 
by Common Intermediates 


Like aromatic amino acid biosynthesis, histidine biosynthesis shares metabolic in- 
termediates with the pathway of purine nucleotide synthesis. The pathway involves 
ten separate steps, the first being an unusual reaction that links ATP and PRPP 


25.4 How Do Organisms Synthesize Amino Acids? 803 


:NHo 
aar. FY 


Oo N 
N 
H H + | | X 
HCN 
H 0-8-8 NT Te- 8-0- 
HO OH 
‘ ATP B 
5-Phosphoribosyl- 
a-pyrophosphate (PRPP) 
(1) > PE 
n7 n—By8-8-® H 
TS: oo. HO + 2NADH + 
zH 
BA N Cu 2 NAD? 3 Ht 





























HO OH HəCOH 
N!-5'-Phosphoribosyl-ATP LHistidinol Histidine 
HO 
(2) 2 
o 
PP H,O 
H H 
n\n t uc-N Hom, 
| fu a-Ketoglutarate Glutamate | CH 
— Cx u 
R, 7 
HN N ~ ~ 
i CHo = CH, 
®-o-HC o N-C | Ə 
H HC — NH4 C=O 
H H | | 
4 A CH, — O —® CH, — O0 —® 
HO OH L-Histidinol phosphate Imidazole acetol 
N!-5'-Phosphoribosyl-AMP phosphate 
ô H,O H,O (6) 


n\n ® n\n e H 
— O 














Cs 
- IN IN Glutamine Glutamate | N 
@®—O— HC o N—C HN—CH H— C—OH 
H H © > H—-C—H B th laa 
H H a 
C=O CH, —O — 
HO OH 2 ® 
N!-5'-Phosphoribosylformimino- H— C—OH mo hea 
5-aminoimidazole-4- pose 
carboxamide ribonucleotide H— C— OH 
CH, — O —® n7 Nt ® 
N1.5’-Phosphoribulosylformimino- = 
5-aminoimidazole-4- HN—C NH 
carboxamide ribonucleotide \ 2 
FIGURE 25.40 The pathway of histidine biosynthesis. The enzymes are (1) ATP-phosphoribosyl transferase, 5-Aminoimidazole-4- 
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phosphatase, and (9) histidinol dehydrogenase. To purine biosynthesis 
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(Figure 25.40). Five carbon atoms from PRPP and one from ATP end up in histi- 
dine. Step 5 involves some novel chemistry: The substrate, phosphoribulosylformimino- 
5-aminoimidazole-4-carboxamide ribonucleotide, picks up an amino group (from the 
amide of glutamine) in a reaction accompanied by cleavage and ring closure to 
yield two imidazole compounds—the histidine precursor, imidazole glycerol phosphate, 
and a purine nucleotide precursor, 5-aminoimidazole-4-carboxamide ribonucleotide 
(AICAR). Note that AICAR as a purine nucleotide precursor can ultimately re- 
plenish the ATP consumed in reaction 1. Nine enzymes act in histidine’s ten syn- 
thetic steps. Reactions 9 and 10, the successive NAD*-dependent oxidations of an 
alcohol to an aldehyde and then to a carboxylic acid, are catalyzed by the same 
dehydrogenase. 


25:5" How Does Amino Acid Catabolism Lead into Pathways 
of Energy Production? 


In normal human adults, close to 90% of the energy requirement is met by oxi- 
dation of carbohydrates and fats; the remainder comes from oxidation of the car- 
bon skeletons of amino acids. The primary physiological purpose of amino acids is 
to serve as the building blocks for protein biosynthesis. The dietary amount of free 
amino acids is trivial under most circumstances. However, if excess protein is con- 
sumed in the diet or if the amount of amino acids released during normal turnover 
of cellular proteins exceeds the requirements for new protein synthesis, the amino 
acid surplus must be catabolized. Also, if carbohydrate intake is insufficient (as dur- 
ing fasting or starvation) or if carbohydrates cannot be appropriately metabolized 
due to disease (as in diabetes mellitus), body protein becomes an important fuel for 
metabolic energy. 


The 20 Common Amino Acids Are Degraded by 20 Different Pathways 
That Converge to Just 7 Metabolic Intermediates 


Because the 20 common amino acids of proteins are distinctive in terms of their car- 
bon skeletons, each amino acid requires its own unique degradative pathway. Be- 
cause amino acid degradation normally supplies only 10% of the body’s energy, 
then, on average, degradation of any given amino acid will satisfy less than 1% of 
energy needs. Therefore, we will not discuss these pathways in detail. It so happens, 
however, that degradation of the carbon skeletons of the 20 common a-amino 
acids converges to just 7 metabolic intermediates: acetyl-CoA, succinyl-CoA, pyruvate, 
a-ketoglutarate, fumarate, oxaloacetate, and acetoacetate. Because succinyl-CoA, pyru- 
vate, a-ketoglutarate, fumarate, and oxaloacetate can serve as precursors for glucose 
synthesis, amino acids giving rise to these intermediates are termed glucogenic. 
Those degraded to yield acetyl-CoA or acetoacetate are termed ketogenic, because 
these substances can be used to synthesize fatty acids or ketone bodies. Some amino 
acids are both glucogenic and ketogenic (Figure 25.41). 


The C-3 Family of Amino Acids: Alanine, Serine, and Cysteine The carbon skele- 
tons of alanine, serine, and cysteine all converge to pyruvate (Figure 25.42). 
Transamination of alanine yields pyruvate: 





Alanine + a-ketoglutarate pyruvate + glutamate (25.10) 





Deamination of serine by serine dehydratase also yields pyruvate. Cysteine is con- 
verted to pyruvate via a number of paths. 

The carbon skeletons of three other amino acids also become pyruvate. Glycine is 
convertible to serine and thus to pyruvate. The three carbon atoms of tryptophan 
that are not part of its indole ring appear as alanine (and, hence, pyruvate) upon 
Trp degradation. Threonine by one of its degradation routes is cleaved to glycine and 
acetaldehyde. The glycine is then converted to pyruvate via serine; the acetaldehyde 
is oxidized to acetyl-CoA (Figure 25.42). 
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A DEEPER LOOK 


Histidine residues in the active sites of enzymes often act directly 
in the enzyme’s catalytic mechanism. Catalytic participation by the 
imidazole group of His and the presence of imidazole as part of 
the purine ring system support a current speculation that life 





Histidine—A Clue to Understanding Early Evolution? 


before the full evolution of protein molecules must have been 
RNA based. This notion correlates with the discovery that RNA 
molecules can have catalytic activity, an idea captured in the term 
ribozyme (see Chapter 13). 








The C-4 Family of Amino Acids: Aspartate and Asparagine Transamination of 
aspartate gives oxaloacetate: 





Aspartate + a-ketoglutarate oxaloacetate + glutamate (25.11) 





Hydrolysis of asparagine by asparaginase yields aspartate and NH,". Alternatively, 
aspartate degradation via the urea cycle leads to a different citric acid cycle inter- 
mediate, namely, fumarate (Figure 25.23). 


The C-5 Family of Amino Acids Is Converted to a-Ketoglutarate Via Glutamate 
The five-carbon citric acid cycle intermediate a-ketoglutarate is always a product 
of transamination reactions involving glutamate. Thus, glutamate and any amino 
acid convertible to glutamate are classified within the C-5 family. These amino 
acids include glutamine, proline, arginine, and histidine (Figure 25.43). 
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The histidine carbons, numbered 1 through 5, become 
carbons 1 through 5 of glutamate, as indicated. 


a-Ketoglutarate 
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The Serine Dehydratase Reaction—A £-Elimination 
The degradation of serine to pyruvate (see Figure 25.42) isan ex- goes a net two-electron reduction or oxidation (depending on the 
ample of a pyridoxal phosphate-catalyzed B-elimination reaction. direction), whereas B-eliminations occur with no net oxidation or 
B-Eliminations mediated by PLP yield products that have under- reduction of PLP. Note too that the aminoacrylate released from 
gone a two-electron oxidation at Cg. Serine is thus oxidized to pyru- PLP is unstable in aqueous solution. It rapidly tautomerizes to the 
vate, with release of ammonium ion (see accompanying figure). At preferred imine form, which is spontaneously hydrolyzed to yield 
first, this looks like a transaminase half-reaction, but there is anim- the a-keto acid product—pyruvate in this case. 
portant difference. In each transaminase half-reaction, PLP under- 
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Degradation of Valine, Isoleucine, and Methionine Leads to Succinyl-CoA The 
breakdown of valine, isoleucine, and methionine converges at propionyl-CoA (Figure 
25.44). Methionine first becomes Sadenosylmethionine and then homocysteine 
(see Figure 25.28). The carboxyl groups from all three are lost as COs. The two dis- 
tal carbon atoms of leucine become acetyl-CoA. Propionyl-CoA is subsequently con- 
verted to methylmalonyl-CoA and thence to succinyl-CoA via the same reactions me- 
diating the oxidation of fatty acids that have odd numbers of carbon atoms (see 
Chapter 23). 


Leucine Is Degraded to Acetyl-CoA and Acetoacetate Leucine is one of only two 
purely ketogenic amino acids; the other is lysine. Deamination of leucine via 
a transamination reaction yields a-ketoisocaproate, which is oxidatively decarboxylated 
to isovaleryl-CoA (Figure 25.45). Subsequent reactions, one of which is a biotin- 
dependent carboxylation, give B-hydroxy-B-methylglutaryl-CoA, which is then cleaved 
to yield acetyl-CoA and acetoacetate, a ketone body (see Figure 23.26). Neither of these 
products is convertible to glucose. 

The initial steps in valine, leucine, and isoleucine degradation are identical. All three 
are first deaminated to a-keto acids by the branched-chain amino acid aminotrans- 
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FIGURE 25.45 Leucine is degraded to acetyl-CoA and acetoacetate. 


ferase, a mitochondrial enzyme. The resulting a-keto acids are then oxidatively decar- 
boxylated by the CoA-dependent branched-chain @-keto acid dehydrogenase complex 
(BCKAD complex) to form CoA derivatives. Maple syrup urine disease is a hereditary 
defect in the oxidative decarboxylation of these branched-chain a-keto acids. The meta- 
bolic block created by this defect leads to elevated levels of valine, leucine, and 
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isoleucine (and their corresponding branched-chain a-keto acids) in the blood and 
urine. The urine of individuals with this disease smells like maple syrup. The defect is 
fatal unless dietary intake of these amino acids is greatly restricted early in life. 


Lysine Degradation Lysine degradation proceeds by several pathways, but the 
saccharopine pathway found in liver predominates (Figure 25.46). This degradative 
route proceeds backward along the lysine biosynthetic pathway through saccharopine 
and a-aminoadipate to a-ketoadipate (see Figure 25.24). Next, a-ketoadipate under- 
goes oxidative decarboxylation to glutaryl-CoA, which is then transformed into 
acetoacetyl-CoA and ultimately into the ketone body, acetoacetate. 

As indicated earlier, degradation of the nonindole carbons of tryptophan yields 
pyruvate. The indole ring of Trp is converted by a series of reactions to 
a-ketoadipate and ultimately acetoacetate by these same reactions of Lys degradation. 


Phenylalanine and Tyrosine Are Degraded to Acetoacetate and Fumarate The first 
reaction in phenylalanine degradation is the hydroxylation reaction of tyrosine biosyn- 
thesis (see Figure 25.38). Both these amino acids thus share a common degradative 
pathway. Transamination of Tyr yields the a-keto acid p-hydroxyphenylpyruvate (Figure 
25.47, reaction 1). p-Hydroxyphenylpyruvate dioxygenase, a vitamin C—dependent 
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HUMAN BIOCHEMISTRY 


Alkaptonuria and phenylketonuria are two human genetic dis- 
eases arising from specific enzyme defects in phenylalanine 
degradation. Alkaptonuria is characterized by urinary excretion 
of large amounts of homogentisate and results from a deficiency 
in homogentisate dioxygenase (step 3, Figure 25.47). Air oxida- 
tion of homogentisate causes urine to turn dark on standing, but 
the only malady suffered by carriers of this disease is a tendency 
toward arthritis later in life. 

In contrast, phenylketonurics, whose urine contains excessive 
phenylpyruvate (see accompanying figure), suffer severe mental re- 
tardation if the defect is not recognized immediately after birth 
and treated by putting the victim on a diet low in phenylalanine. 
These individuals are deficient in phenylalanine hydroxylase (Fig- 
ure 25.38), and the excess Phe that accumulates is transaminated 
to phenylpyruvate and excreted. 





Hereditary Defects in Phe Catabolism Underlie Alkaptonuria and Phenylketonuria 
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enzyme, then carries out a ring hydroxylation—oxidative decarboxylation to yield 
homogentisate (Figure 25.47, reaction 2). Ring opening and isomerization (Figure 
25.47, reactions 3 and 4) give 4-fumarylacetoacetate, which is hydrolyzed to acetoacetate 
and fumarate (reaction 6). 


Animals Differ in the Form of Nitrogen That They Excrete 


Animals often enjoy a dietary surplus of nitrogen. Excess nitrogen liberated upon 
metabolic degradation of amino acids is excreted by animals in three different ways, 
in accord with the availability of water. Aquatic animals simply release free ammonia 
to the surrounding water; such animals are termed ammonotelic (from the Greek 
telos, meaning “end”). On the other hand, terrestrial and aerial species employ mech- 
anisms that convert ammonium to less toxic waste compounds that require little HO 
for excretion. Many terrestrial vertebrates, including humans, are ureotelic, meaning 
that they excrete excess N as urea, a highly water-soluble nonionic substance. Urea is 
formed by ureoteles via the urea cycle (see Figure 25.23). The uricotelic organisms 
are those animals using the third means of N excretion, conversion to uric acid, a 
rather insoluble purine analog. Birds and reptiles are uricoteles. Uric acid metabo- 
lism is discussed in the next chapter. Some animals can switch from ammonotelic to 
ureotelic to uricotelic metabolism, depending on water availability. 


SUMMARY 


25.1 Which Metabolic Pathways Allow Organisms to Live on Inorganic 
Forms of Nitrogen? Nitrogen, an element essential to life, occurs in 


25.2 What Is the Metabolic Fate of Ammonium? Despite the great 
diversity of organic nitrogenous compounds found in cells, only a limited 


the environment principally as atmospheric Ny and as NOs~ ions in so- 
lution in soils and water. The metabolic pathways of nitrogen fixation 
and nitrate assimilation reduce these oxidized forms of nitrogen to the 
metabolically useful form, ammonium. Nitrate assimilation is a two- 
enzyme pathway: nitrate reductase and nitrite reductase. Nitrate reduc- 
tase is a molybdenum cofactor-dependent flavohemoprotein. Nitrite 
reductase catalyzes the six-electron reduction of NO, to NH,” via a 
siroheme-dependent reaction. Nitrogen fixation is carried out by the ni- 
trogenase system; biological reduction of Ny to 2 NH,* is an ATP- 
dependent eight-electron transfer reaction, with Hg as an obligatory by- 
product. Nitrogenase is a metal-rich enzyme having an 8Fe-7S cluster as 
well as a 7Fe-1Mo-9S cluster known as the FeMo-cofactor. Nitrogenase is 
regulated in two ways: Its activity is inhibited by ADP, and its synthesis is 
repressed by NH,*. 


set of reactions incorporate ammonium ions into organic linkage: 
(1) glutamate dehydrogenase (GDH), (2) glutamine synthetase (GS), 
and (3) carbamoyl-P synthetase. Of these, the first two are quantitatively 
more important. Glutamate dehdyrogenase, by adding NH,’ to the citric 
acid cycle intermediate a-ketoglutarate, sits at the interface between 
nitrogen metabolism and carbohydrate (and energy) metabolism. 
Glutamine synthetase catalyzes the ATP-dependent amidation of the 
y-carboxyl group of Glu. Glutamine is the major donor of —NH, groups 
for the synthesis of many nitrogen-containing organic compounds, in- 
cluding purines, pyrimidines, and other amino acids. As such, its activity 
is tightly regulated. Most ammonium assimilation into organic linkage 
proceeds by one of two routes, depending on NH,’ availability: the 
GDH-GS route when ammonium is abundant and the glutamate syn- 
thase (GOGAT)-GS route when [NH,*] is limiting. 


25.3 What Regulatory Mechanisms Act on Escherichia coli Glutamine 
Synthetase? Glutamine synthetase is a paradigm of enzyme regulation, 
because its activity can be modulated at three different levels: (1) allo- 
steric regulation by feedback inhibition, (2) covalent modification 
through adenylylation of Tyr?’ in each of the 12 GS polypeptide chains, 
and (3) regulation of gene expression by the phosphorylated form of the 
transcriptional enhancer NR,. Allosteric inhibitors of GS include five 
amino acids (Gly, Ser, Ala, His, and Trp), two nucleotides (one purine 
[AMP] and one pyrimidine [CTP]), one aminosugar (glucosamine-6-P), 
and carbamoyl-P. Adenylylation of GS converts it from a more active, allo- 
sterically unresponsive form to a less active, allosterically sensitive form. 
The ratio of adenylylated GS to deadenylylated GS is ultimately deter- 
mined by the [Gln]/[a-KG] ratio, with a low ratio favoring the deadeny- 
lylated state and thus greater synthesis of glutamine. 


25.4 How Do Organisms Synthesize Amino Acids? In many cases, 
amino acid biosynthesis is a matter of synthesizing the appropriate 
a-keto acid carbon skeleton for the amino acid and then transaminating 
this a-keto acid using Glu as amino donor by action of an aminotrans- 
ferase reaction. Mammals have retained the ability to synthesize the 
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a-keto acid analog for 10 of the 20 common amino acids (the so-called 
nonessential amino acids), but the ability to make the other 10 a-keto 
acid analogs has been lost over evolutionary time, rendering these 
10 amino acids as essential in the diet. The common amino acids can be 
grouped into families on the basis of the metabolic progenitor that 
serves as their precursor: The a-ketoglutarate family includes Glu, Gln, 
Pro, Arg, and (sometimes) Lys; the pyruvate family includes Ala, Val, and 
Leu; the aspartate family includes Asp, Asn, Met, Thr, Ile, and (some- 
times) Lys; the 3-phosphoglycerate family includes Ser, Gly, and Cys; and 
the PEP and erythrose-4-P family includes the aromatic amino acids Phe, 
Tyr, and Trp. Histidine is a special case—it is formed from PRPP and 
ATP. AICAR is a byproduct. 


25.5 How Does Amino Acid Catabolism Lead into Pathways of Energy 
Production? The 20 common amino acids are degraded by 20 differ- 
ent pathways that converge to just 7 metabolic intermediates: pyruvate, 
acetyl-CoA, acetoacetate, oxaloacetate, a-ketoglutarate, succinyl-CoA, 
and fumarate. All seven of these compounds are intermediates in or 
readily feed into the pathways of energy production (citric acid cycle 
and oxidative phosphorylation). 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 

chapter at www.cengage.com/login. 

1. What is the oxidation number of N in nitrate, nitrite, NO, NoO, 
and No? 

2. How many ATP equivalents are consumed per N atom of ammo- 
nium formed by (a) the nitrate assimilation pathway and (b) the 
nitrogen fixation pathway? (Assume for this problem NADH, 
NADPH, and reduced ferredoxin are each worth 3 ATPs.) 

3. Suppose at certain specific metabolite concentrations in vivo the 
cyclic cascade regulating E. coli glutamine synthetase has reached a 
dynamic equilibrium where the average state of GS adenylylation is 
poised at n = 6. Predict what change in n will occur if: 

a. [ATP] increases. 

b. Pya/Pup increases. 

c. [a-KG]/[Gln] increases. 
d. [P;] decreases. 

4. How many ATP equivalents are consumed in the production of 
1 equivalent of urea by the urea cycle? 

5. Why are persons on a high-protein diet (such as the Atkins diet) 
advised to drink lots of water? 

6. How many ATP equivalents are consumed in the biosynthesis of 
lysine from aspartate by the pathway shown in Figure 25.27? 

7. If PEP labeled with “C in the 2-position serves as the precursor to 
chorismate synthesis, which C atom in chorismate is radioactive? 

8. (Integrates with Chapter 22.) Write a balanced equation for the syn- 
thesis of glucose (by gluconeogenesis) from aspartate. 

9. For each of the 20 common amino acids, give the name of the 
enzyme that catalyzes the reaction providing its a-amino group. 

10. Which vitamin is central in amino acid metabolism? Why? 

11. Vitamins Bg, Bio, and folate may be recommended for individuals 
with high blood serum levels of homocysteine (a condition called 
hyperhomocysteinemia). How might these vitamins ameliorate homo- 
cysteinemia? 

12. (Integrates with Chapter 19.) On the basis of the following informa- 
tion, predict a reaction mechanism for the mammalian branched- 
chain a-keto acid dehydrogenase complex (the BCKAD complex). 
This complex carries out the oxidative decarboxylation of the a-keto 
acids derived from valine, leucine, and isoleucine. 

a. One form of maple syrup urine disease responds well to adminis- 
tration of thiamine. 


b. Lipoic acid is an essential coenzyme. 
c. The enzyme complex contains a flavoprotein. 

13. People with phenylketonuria must avoid foods containing the low- 
calorie sweetener Aspartame, also known as NutraSweet. Find the 
structure of Aspartame in the Merck Index (or other scientific 
source) and state why these people must avoid this substance. 

14. Glyphosate (otherwise known as RoundUp) is an analog of PEP. It 
acts as a noncompetitive inhibitor of 3-enolpyruvylshikimate-5-P 
synthase; it has the following structure in its fully protonated state: 


HOOC—CH,—NH—CH,—PO;H, 





Consult Figures 25.35 and 25.36 and construct a list of the diverse 
metabolic consequences that might be experienced by a plant cell 
exposed to glyphosate. 


15. (Integrates with Chapter 18.) When cells convert glucose to glycine, 
which carbon atoms of glucose are represented in glycine? 


16. Although serine is a nonessential amino acid, serine deficiency syn- 
drome has been observed in humans. One such form of the syn- 
drome is traceable to a deficiency in 3-phosphoglycerate dehydro- 
genase (see Figure 25.31). Individuals with this syndrome not only 
are serine-deficient but also are impaired in their ability to synthe- 
size another common amino acid, as well as a class of lipids. De- 
scribe why. 


17. Go to www.pdb.org and examine the pdb file 1LM1 for glutamate 
synthase. Find its iron-sulfur cluster and FMN prosthetic group. 
Discover how this enzyme is organized into an N-terminal domain 
that functions in ammonia removal from glutamine (the glutam- 
inase domain) and the a-ketoglutarate—binding site near the Fe/S 
and flavin prosthetic groups. Consult van den Heuvel, R. H. H., et 
al., 2002. Structural studies on the synchronization of catalytic cen- 
ters in glutamate synthase. Journal of Biological Chemistry 277: 
24579-24583, to see how these two sites are connected by a tunnel 
for passage of ammonia from glutamine to a-ketoglutarate. 


18. The thermic effect of food is a term used to describe the energy cost 
of processing the food we eat, digesting it, and either turning it into 
precursors for needed biosynthesis, usable energy in the form of 
ATP, or storing the excess intake as fat. The thermic effect is usually 
approximated at 10% of the total calories consumed, but the ther- 
mic effect of fat is only 2% to 3% of total fat calories and the ther- 
mic effect of protein is 30% or more of calories consumed as pro- 
tein. Why do you suppose dietary protein has a much higher 
thermic effect than either dietary carbohydrate or fat? 
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Preparing for the MCAT Exam 

19. From the dodecameric (œı9) structure of glutamine synthetase 
shown in Figure 25.14, predict the relative enzymatic activity of GS 
monomers (isolated a-subunits). 


20. Consider the synthesis and degradation of tyrosine as shown in 
Figures 25.37, 25.38, and 25.47 to determine where the carbon 
atoms in PEP and erythrose-4-P would end up in acetoacetate and 
fumarate. 
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Synthesis and Degradation 
of Nucleotides 





ESSENTIAL QUESTION 


Virtually all cells are capable of synthesizing purine and pyrimidine nucleotides. 
These compounds then serve as essential intermediates in metabolism and as the 
building blocks for DNA and RNA synthesis. 


How do cells synthesize purines and pyrimidines? 


Nucleotides are ubiquitous constituents of life, actively participating in the majority 
of biochemical reactions. Recall that ATP is the “energy currency” of the cell, that 
uracil nucleotide derivatives of carbohydrates are common intermediates in cellu- 
lar transformations of carbohydrates (see Chapter 22), and that biosynthesis of 
phospholipids proceeds via cytosine nucleotide derivatives (see Chapter 24). In 
Chapter 30, we will see that GTP serves as the immediate energy source driving the 
endergonic reactions of protein synthesis. Many of the coenzymes (such as coen- 
zyme A, NAD, NADP, and FAD) are derivatives of nucleotides. Nucleotides also act 
in metabolic regulation, as in the response of key enzymes of intermediary metabo- 
lism to the relative concentrations of AMP, ADP, and ATP (PFK is a prime example 
here; see also Chapter 18). Furthermore, cyclic derivatives of purine nucleotides 
such as cAMP and cGMP have no other role in metabolism than regulation. Last but 
not least, nucleotides are the monomeric units of nucleic acids. Deoxynucleoside 
triphosphates (dNTPs) and nucleoside triphosphates (NTPs) serve as the immedi- 
ate substrates for the biosynthesis of DNA and RNA, respectively (see Part 4). 


26v10 Can Cells Synthesize Nucleotides? 


Nearly all organisms can make the purine and pyrimidine nucleotides via so-called de 
novo biosynthetic pathways. (De novo means “anew”; a less literal but more apt transla- 
tion might be “from scratch” because de novo pathways are metabolic sequences that 
form complex end products from rather simple precursors.) Many organisms also have 
salvage pathways to recover purine and pyrimidine compounds obtained in the diet or 
released during nucleic acid turnover and degradation. Whereas the ribose of nu- 
cleotides can be catabolized to generate energy, the nitrogenous bases do not serve as 
energy sources; their catabolism does not lead to products used by pathways of energy 
conservation. Compared to slowly dividing cells, rapidly proliferating cells synthesize 
larger amounts of DNA and RNA per unit time. To meet the increased demand for nu- 
cleic acid synthesis, substantially greater quantities of nucleotides must be produced. 
The pathways of nucleotide biosynthesis thus become attractive targets for the clinical 
control of rapidly dividing cells such as cancers or infectious bacteria. Many antibiotics 
and anticancer drugs are inhibitors of purine or pyrimidine nucleotide biosynthesis. 


26:25 How Do Cells Synthesize Purines? 


Substantial insight into the de novo pathway for purine biosynthesis was provided in 
1948 by John Buchanan, who cleverly exploited the fact that birds excrete excess ni- 
trogen principally in the form of uric acid, a water-insoluble purine analog. Buchanan 
fed isotopically labeled compounds to pigeons and then examined the distribution of 
the labeled atoms in wric acid (Figure 26.1). By tracing the metabolic source of the var- 
ious atoms in this end product, he showed that the nine atoms of the purine ring 


© Adam Woolfitt/CORBIS 





Pigeon drinking at Gaia Fountain, Siena, Italy. The basic 
features of purine biosynthesis were elucidated ini- 
tially from metabolic studies of nitrogen metabolism 
in pigeons. Pigeons excrete excess N as uric acid,a 
purine analog. 





Guano, a substance found on some coasts 
frequented by sea birds, is composed chiefly of 
the birds’ partially decomposed excrement. .. . 
The name for the purine guanine derives 
from the abundance of this base in guano. 


J. C. Nesbit 
On Agricultural Chemistry and the Nature 
and Properties of Peruvian Guano (1850) 





KEY QUESTIONS 


26.1 Can Cells Synthesize Nucleotides? 





26.2 ow Do Cells Synthesize Purines? 
26.3 Can Cells Salvage Purines? 





26.4 ow Are Purines Degraded? 





26.5 ow Do Cells Synthesize Pyrimidines? 





26.6 ow Are Pyrimidines Degraded? 





26.7 ow Do Cells Form the Deoxyribo- 
ucleotides That Are Necessary for DNA 
Synthesis? 


=] 











26.8 ow Are Thymine Nucleotides Synthesized? 





CENGAGENOW” Create your own study path for 
this chapter with tutorials, simulations, animations, 
and Active Figures at www.cengage.com/login. 
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FIGURE 26.1 Nitrogen waste is excreted by birds princi- 
pally as the purine analog, uric acid. 





Human phosphoribosyl- 
pyrophosphate synthetase I 
(pdb id = 2H06) 
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FIGURE 26.2 The metabolic origin of the nine atoms in the purine ring system. 


system (Figure 26.2) are contributed by aspartic acid (N-1), glutamine (N-3 and N-9), 
glycine (C-4, C-5, and N-7), CO» (C-6), and THF one-carbon derivatives (C-2 and 
C-8). THF is tetrahydrofolate, a coenzyme serving as a one-carbon transfer agent, not 
only in purine ring synthesis but also in amino acid metabolism (see Figures 25.27 
and 25.32) and in synthesis of the pyrimidine thymine (see Figure 26.26). The for- 
mation and function of THF is summarized in A Deeper Look on pages 816-817. 


IMP Is the Immediate Precursor to GMP and AMP 


The de novo synthesis of purines occurs in an interesting manner: The atoms form- 
ing the purine ring are successively added to ribose-5-phosphate; thus, purines are di- 
rectly synthesized as nucleotide derivatives by assembling the atoms that comprise 
the purine ring system directly on the ribose. In step 1, ribose-5-phosphate is acti- 
vated via the direct transfer of a pyrophosphoryl group from ATP to C-1 of the 
ribose, yielding 5-phosphoribosyl-a-pyrophosphate (PRPP) (Figure 26.3). The enzyme 
is ribose-5-phosphate pyrophosphokinase. PRPP is the limiting substance in purine 
biosynthesis. The two major purine nucleoside diphosphates, ADP and GDP, are 
negative effectors of ribose-5-phosphate pyrophosphokinase. However, because 
PRPP serves additional metabolic needs, the next reaction is actually the committed 
step in the pathway. 

Step 2 (Figure 26.3) is catalyzed by glutamine phosphoribosyl pyrophosphate 
amidotransferase. The anomeric carbon atom of the substrate PRPP is in the 
a-configuration; the product is a B-glycoside (recall that all the biologically impor- 
tant nucleotides are B-glycosides). The N atom of this M-glycoside becomes N-9 of 
the nine-membered purine ring; it is the first atom added in the construction of this 
ring. Glutamine phosphoribosyl pyrophosphate amidotransferase is subject to feed- 
back inhibition by GMP, GDP, and GTP as well as AMP, ADP, and ATP. The G series 
of nucleotides interacts at a guanine-specific allosteric site on the enzyme, whereas 
the adenine nucleotides act at an A-specific site. The pattern of inhibition by these 
nucleotides is such that residual enzyme activity is expressed until sufficient 
amounts of both adenine and guanine nucleotides are synthesized. Glutamine 
phosphoribosyl pyrophosphate amidotransferase is also sensitive to inhibition by 
the glutamine analog azaserine (Figure 26.4). Azaserine has been used as an anti- 
tumor agent because it irreversibly inactivates glutamine-dependent enzymes by re- 
acting with nucleophilic groups at the glutamine-binding site. Two such enzymes 
are found at steps 2 and 5 of the purine biosynthetic pathway. 

Step 3 is carried out by glycinamide ribonucleotide synthetase (GAR synthetase) via 
its ATP-dependent condensation of the glycine carboxyl group with the amine of 
5-phosphoribosyl-B-amine (see Figure 26.3). The reaction proceeds in two stages. First, 
the glycine carboxyl group is activated via ATP-dependent phosphorylation. Next, an 
amide bond is formed between the activated carboxyl group of glycine and the 
B-amine. Glycine contributes C-4, C-5, and N-7 of the purine. 

Step 4 is the first of two THF-dependent reactions in the purine pathway. GAR 
transformylase transfers the N -formyl group of N'°-formyl-THF to the free amino 


> CENGAGENOW” ACTIVE FIGURE 26.3 The de novo pathway for purine synthesis. IMP (inosine monophos- 
phate or inosinic acid) serves as a precursor to AMP and GMP Test yourself on the concepts in this figure 
at www.cengage.com/login. 
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A DEEPER LOOK 


Tetrahydrofolate and One-Carbon Units 


Folic acid, a B vitamin found in green plants, fresh fruits, yeast, (a) 
and liver, takes its name from folium, Latin for “leaf.” Folic acid is 
a pterin (the 2-amino-4-oxo derivative of pteridine); pterins are 
named from the Greek word pté ryj, for “wing,” because these sub- 
stances were first identified as the pigments in insect wings. Mam- 
mals cannot synthesize pterins and thus cannot make folates; they 
derive folates from their diet or from microorganisms in their in- 
testines. (See A Deeper Look on page 818 for the complete struc- 
ture of folate.) 

Folates are acceptors and donors of one-carbon units for all ox- 
idation levels of carbon except CO, (for which biotin is the rele- 
vant carrier). The active form is tetrahydrofolate (THF). THF is 
formed through two successive reductions of folate by dihydrofolate 
reductase (panel a of figure). One-carbon units in three different 
oxidation states may be bound to THF at the N° or N” nitrogens H 
(table and panel b of figure). The one-carbon unit carried by THF 

HoN 


l 
can come from formate (HCOO7), the a-carbon of glycine, the N N. a 
B-carbon of serine (see Figure 25.32), or the 3-position carbon ae | 
in the imidazole ring of histidine. NADPH-dependent reactions HN L 
N“ “CH, —N —R 
5 | 


H 
9 1 


HoN N. N. 
2 T i SS 
HN Z 0 
N° “CHg=—N—RB 
ő | 


H 
Folate 


NADPH + H* 


NADP+ 


interconvert the oxidation states of the various THF-bound one- 


carbon units. 


H 
Dihydrofolate 


NADPH + H*. 






One-Carbon 
Formt 


Oxidation 


Number* Oxidation Level 


Tetrahydrofolate Form 





NADP* 


H 
| 
HN. N [iy 
y H 
HN H 
N CH —N—R 
O | 
H H 


Tetrahydrofolate 


=2 Methanol (most reduced) 
0 Formaldehyde 
2 Formate (most oxidized) 


—CH; 
—CH,— 
—CH=O 
—CH=O 
—CH=NH 
—CH= 


N*-Methyl-THF 
N°,N'°-Methylene-THF 
N°-Formyl-THE 
N!°-Formyl-THF 
N°-Formimino-THF 
N°,N'°-Methenyl-THF 











*Calculated by assigning valence bond electrons to the more electronegative atom and then counting the charge on the 
quasi ion. A carbon assigned four valence electrons would have an oxidation number of 0. The carbon in N>-methyl-THF 
is assigned six electrons from the three C—H bonds and thus has an oxidation number of —2. 

tNote: All vacant bonds in the structures shown are to atoms more electronegative than C. 








group of GAR to yield a-N-formylglycinamide ribonucleotide (FGAR). Thus, C-8 of the 

purine is “formyl-ly” introduced. Although all of the atoms of the imidazole portion 

of the purine ring are now present, the ring is not closed until Reaction 6. 

Step 5 is catalyzed by FGAR amidotransferase (also known as FGAM synthetase). ATP- 
dependent transfer of the glutamine amido group to the C-4-carbonyl of FGAR yields 
formylglycinamidine ribonucleotide (FGAM). The imino-N becomes N-3 of the purine. 

Step 6 is an ATP-dependent dehydration that leads to formation of the imidazole 

Il ye ring. ATP is used to phosphorylate the oxygen atom of the formyl group, activating 








FIGURE 26.4 The structure of azaserine. Azaserine acts 
as an irreversible inhibitor of glutamine-dependent 
enzymes by covalently attaching to nucleophilic 
groups in the glutamine-binding site. 


it for the ring closure step that follows. Because the product is 5-aminoimidazole ribo- 
nucleotide, or AIR, this enzyme is called AIR synthetase. In avian liver, the enzymatic 
activities for steps 3, 4, and 6 (GAR synthetase, GAR transformylase, and AIR syn- 
thetase) reside on a single, 110-kD multifunctional polypeptide. 

In step 7, carbon dioxide is added at the C-4 position of the imidazole ring by AIR 
carboxylase in an ATP-dependent reaction; the carbon of COs will become C-6 of the 
purine ring. The product is carboxyaminoimidazole ribonucleotide (CAIR). 

In step 8, the amino-N of aspartate provides N-1 through linkage to the C-6 carboxyl 
function of CAIR. ATP hydrolysis drives the condensation of Asp with CAIR. The 
product is N-succinylo-5-aminoimidazole-4-carboxamide ribonucleotide (SAICAR). SAICAR 
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synthetase catalyzes the reaction. The enzymatic activities for steps 7 and 8 reside on a 
single, bifunctional polypeptide in avian liver. 

Step 9 removes the four carbons of Asp as fumarate in a nonhydrolytic cleavage. 
The product is 5-aminoimidazole-4-carboxamide ribonucleotide (AICAR); the enzyme is 
adenylosuccinase (adenylosuccinate lyase). Adenylosuccinase acts again in that part of 
the purine pathway leading from IMP to AMP and takes its name from this latter re- 
action (see following). AICAR is also a byproduct of the histidine biosynthetic path- 
way (see Chapter 25), but because ATP is the precursor to AICAR in that pathway, 
no net purine synthesis is achieved. 

Step 10 adds the formyl carbon of N!°formyl-THF as the ninth and last atom nec- 
essary for forming the purine nucleus. The enzyme is called AICAR transformylase; 
the products are THF and N-formylaminoimidazole-4-carboxamide ribonucleotide (FAICAR). 

Step 11 involves dehydration and ring closure to form the purine nucleotide IMP 
(inosine-5'-monophosphate or inosinic acid); this completes the initial phase of purine 
biosynthesis. The enzyme is IMP cyclohydrolase (also known as JMP synthase and 
inosinicase). Unlike step 6, this ring closure does not require ATP. In avian liver, the en- 
zymatic activities catalyzing steps 10 and 11 (AICAR transformylase and inosinicase) ac- 
tivities reside on 67-kD bifunctional polypeptides organized into 135-kD dimers. 
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Note that 6 ATPs are required in the purine biosynthetic pathway from ribose-5- 
phosphate to IMP: one each at steps 1, 3, 5, 6, 7, and 8. However, 7 high-energy 
phosphate bonds (equal to 7 ATP equivalents) are consumed because a-PRPP for- 
mation in Reaction 1 followed by PP; release in Reaction 2 represents the loss of 


2 ATP equivalents. 


HUMAN BIOCHEMISTRY 


Folate Analogs as Antimicrobial and Anticancer Agents 


The dependence of de novo purine biosynthesis on folic acid com- 
pounds at steps 4 and 10 means that antagonists of folic acid me- 
tabolism indirectly inhibit purine formation and, in turn, nucleic 
acid synthesis, cell growth, and cell division. Clearly, rapidly divid- 
ing cells such as malignancies or infective bacteria are more sus- 
ceptible to these antagonists than slower-growing normal cells. 
Among the folic acid antagonists are sulfonamides (see accompa- 
nying figure). Folic acid is a vitamin for animals and is obtained in 
the diet. In contrast, bacteria synthesize folic acid from precursors, 
including p-aminobenzoic acid (PABA), and thus are more suscep- 
tible to sulfonamides than are animal cells. 


(a) 
Sulfonamides have the generic structure: HoN ~< \- 
O 
Ź 
PABA (p-aminobenzoic acid) HoN C 
Non 


THF (tetrahydrofolate) 


Formation of THF, the functional folate form, depends on re- 
duction of folate (and dihydrofolate or DHF) by dihydrofolate 
reductase, or DHFR (see A Deeper Look on page 816). Metho- 
trexate (amethopterin), aminopterin, and trimethoprim are three 
analogs of folic acid. The first two have been used in cancer 
chemotherapy and the treatment of autoimmune disorders. Each 
binds to DHFR with about 1000-fold greater affinity than folate or 
DHF, thus acting as a virtually irreversible inhibitor of THF for- 
mation. Trimethoprim acts more effectively on bacterial DHFR 
and is prescribed for infections of the urinary tract. 
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AMP and GMP Are Synthesized from IMP 


IMP is the precursor to both AMP and GMP. These major purine nucleotides are 
formed via distinct two-step metabolic pathways that diverge from IMP. The branch 
leading to AMP (adenosine 5’-monophosphate) involves the displacement of the 6-O 
group of inosine with aspartate (Figure 26.5) in a GIP-dependent reaction, followed 
by the nonhydrolytic removal of the four-carbon skeleton of Asp as fumarate; the Asp 
amino group remains as the 6-amino group of AMP. Adenylosuccinate synthetase and 
adenylosuccinase are the two enzymes. Recall that adenylosuccinase also acted at step 
9 in the pathway from ribose-5-phosphate to IMP. Fumarate production provides a con- 
nection between purine synthesis and the citric acid cycle. 

The formation of GMP from IMP requires oxidation at C-2 of the purine ring, fol- 
lowed by a glutamine-dependent amidotransferase reaction that replaces the oxygen 
on C-2 with an amino group to yield 2-amino, 6-oxy purine nucleoside monophosphate, or 
as this compound is commonly known, guanosine monophosphate (Figure 26.5). The 
enzymes in the GMP branch are IMP dehydrogenase and GMP synthetase. Note 
that, starting from ribose-5-phosphate, 8 ATP equivalents are consumed in the syn- 
thesis of AMP and 9 in the synthesis of GMP. 


The Purine Biosynthetic Pathway Is Regulated at Several Steps 


The regulatory network that controls purine synthesis is schematically represented 
in Figure 26.6. To recapitulate, the purine biosynthetic pathway from ribose-5- 
phosphate to IMP is allosterically regulated at the first two steps. Ribose-5-phosphate 
pyrophosphokinase, although not the committed step in purine synthesis, is subject 
to feedback inhibition by ADP and GDP. The enzyme catalyzing the next step, gluta- 
mine phosphoribosyl pyrophosphate amidotransferase, has two allosteric sites, one 
where the “A” series of nucleoside phosphates (AMP, ADP, and ATP) binds and feed- 
back-inhibits, and another where the corresponding “G” series binds and inhibits. 
Furthermore, PRPP is a “feed-forward” activator of this enzyme. Thus, the rate of IMP 
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CENGAGENOW™ ANIMATED FIGURE 26.5 The synthesis 
of AMP and GMP from IMP. (a) AMP synthesis: The two 

reactions of AMP synthesis mimic steps 8 and 9 in the 

purine pathway leading to IMP. (b) GMP synthesis. See 

this figure animated at www.cengage.com/login. 





Human adenylosuccinate 
lyase (AMP in red) 
(pdb id =2J91) 


820 Chapter 26 Synthesis and Degradation of Nucleotides 





pect ep N 
TA a as. 
NE 
i, 
a ` 
`x 
` 

















T SON 
a ee A ` 
al PR —_ N 
H “ 
/ Pd XMP > GMP —> GDP —> GTP 
Pe ei TA 7 r 
(7) “ * Ø 
v í y ! IMP dehydrogenase 
Ribose-5-phosphate i + Y 
pyrophosphokinase ' Gln-PRPP amidotransferase 
Ribose-5-P > a-PRPP > Phosphoribosyl-6-l-amine —> —> IMP 
A A 
1 1 
1 
@ © Adenylosuccinate 
\ 
\ k synthetase m @. 
\ ~Y ~X 
\ w ` 
\ oe Adenylosuccinate —> AMP —> ADP —> ATP 
~N Wa a, oer 
`, ` rf} 
N, See rf 
iy, re af 


FIGURE 26.6 The regulatory circuit controlling purine 
biosynthesis. 
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formation by this pathway is governed by the levels of the final end products, the ade- 
nine and guanine nucleotides. 

The purine pathway splits at IMP. The first enzyme in the AMP branch, adenyl- 
osuccinate synthetase, is competitively inhibited by AMP. Its counterpart in the GMP 
branch, IMP dehydrogenase, is inhibited in a similar fashion by GMP. Thus, the fate of 
IMP is determined by the relative levels of AMP and GMP, so any deficiency in the 
amount of either of the principal purine nucleotides is self-correcting. This reciproc- 
ity of regulation is an effective mechanism for balancing the formation of AMP and 
GMP to satisfy cellular needs. Note also that reciprocity is even manifested at the level 
of energy input: GTP provides the energy to drive AMP synthesis, whereas ATP serves 
this role in GMP synthesis (Figure 26.6). 


ATP-Dependent Kinases Form Nucleoside Diphosphates 
and Triphosphates from the Nucleoside Monophosphates 


The products of de novo purine biosynthesis are the nucleoside monophosphates 
AMP and GMP. These nucleotides are converted by successive phosphorylation re- 
actions into their metabolically prominent triphosphate forms, ATP and GTP. The 
first phosphorylation, to give the nucleoside diphosphate forms, is carried out by two 
base-specific, ATP-dependent kinases, adenylate kinase and guanylate kinase. 


Adenylate kinase: AMP + ATP —> 2 ADP 
Guanylate kinase: GMP + ATP —> GDP + ADP 





These nucleoside monophosphate kinases also act on deoxynucleoside mono- 
phosphates to give dADP or dGDP. 

Oxidative phosphorylation (see Chapter 20) is primarily responsible for the con- 
version of ADP into ATP. ATP then serves as the phosphoryl donor for synthesis of 
the other nucleoside triphosphates from their corresponding NDPs in a reaction 
catalyzed by nucleoside diphosphate kinase, a nonspecific enzyme. For example, 


GDP + ATP =—GTP + ADP 





Because this enzymatic reaction is readily reversible and nonspecific with respect to 
both phosphoryl acceptor and donor, in effect any NDP can be phosphorylated by 


any NTP, and vice versa. The preponderance of ATP over all other nucleoside 
triphosphates means that, in quantitative terms, it is the principal nucleoside 
diphosphate kinase substrate. The enzyme does not discriminate between the ri- 
bose moieties of nucleotides and thus functions in phosphoryl transfers involving 
deoxy-NDPs and deoxy-NTPs as well. 


26:38 Can Cells Salvage Purines? 


Nucleic acid turnover (synthesis and degradation) is an ongoing metabolic process 
in most cells. Messenger RNA in particular is actively synthesized and degraded. 
These degradative processes can lead to the release of free purines in the form of 
adenine, guanine, and hypoxanthine (the base in IMP). Purines represent a meta- 
bolic investment by cells. So-called salvage pathways exist to recover them in useful 
form. Salvage reactions involve resynthesis of nucleotides from bases via phospho- 
ribosyltransferases. 


Base + PRPP =— nucleoside-5’-phosphate + PP; 


The subsequent hydrolysis of PP; to inorganic phosphate by pyrophosphatases ren- 
ders the phosphoribosyltransferase reaction effectively irreversible. 

The purine phosphoribosyltransferases are adenine phosphoribosyltransferase 
(APRT), which mediates AMP formation, and hypoxanthine-guanine phosphoribosyl- 
transferase (HGPRT), which can act on either hypoxanthine to form IMP or guanine 
to form GMP (Figure 26.7). 


26:45 How Are Purines Degraded? 


Because nucleic acids are ubiquitous in cellular material, significant amounts are in- 
gested in the diet. Nucleic acids are degraded in the digestive tract to nucleotides by 
various nucleases and phosphodiesterases. Nucleotides are then converted to nucleo- 
sides by base-specific nucleotidases and nonspecific phosphatases. 
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Human HGPRT (Mg” in yellow, 
a-PRPP in blue, purine analog in red) 
(pdb id = 1D6N) 


FIGURE 26.7 Purine salvage by the HGPRT 
reaction. 
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HUMAN BIOCHEMISTRY 


Lesch-Nyhan Syndrome—HGPRT Deficiency Leads to a Severe Clinical Disorder 


The symptoms of Lesch-Nyhan syndrome are tragic: a crippling Ribose-5-P 

gouty arthritis due to excessive uric acid accumulation (uric acid 

is a purine degradation product, discussed in the next section) Qa 
and, worse, severe malfunctions in the nervous system that lead SoS 
to mental retardation, spasticity, aggressive behavior, and self- y 


mutilation. Lesch-Nyhan syndrome results from a complete defi- 
ciency in HGPRT activity. The structural gene for HGPRT is 
located on the X chromosome, and the disease is a congenital, re- 
cessive, sex-linked trait manifested only in males. The severe con- 
sequences of HGPRT deficiency argue that purine salvage has 
greater metabolic importance than simply the energy-saving re- 
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covery of bases. Although HGPRT might seem to play a minor Blocked by HGPRT genetic Nucleotides ----7 a 
role in purine metabolism, its absence has profound conse- deficiency in Lesch-Nyhan 

quences: De novo purine biosynthesis is dramatically increased, syndrome Y 

and uric acid levels in the blood are elevated. Presumably, these HGPRT y Purine turnover 
changes ensue because lack of consumption of PRPP by HGPRT JN 

elevates its availability for glutamine-PRPP amidotransferase, en- A y 

hancing overall de novo purine synthesis and, ultimately, uric a-PRPP Hypoxanthine 

acid production (see accompanying figure). The dramatically el- Guanine 


evated uric acid levels lead to the particular neurological aberra- 
tions characteristic of the syndrome. Fortunately, deficiencies in 
HGPRT activity in fetal cells can be detected following amniocen- 
tesis. However, no medication ameliorates the neurological and 
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behavioral consequences of this disease. 











Nucleosides are hydrolyzed by nucleosidases or nucleoside phosphorylases to re- 
lease the purine base: 


nucleosidase 


Nucleoside + HO base + ribose 


nucleoside phosphorylase 
E ee a 


Nucleoside + P; base + ribose-1-P 


The pentoses liberated in these reactions provide the only source of metabolic en- 
ergy available from purine nucleotide degradation. 

Feeding experiments using radioactively labeled nucleic acids as metabolic tracers 
have demonstrated that little of the nucleotide ingested in the diet is incorporated 
into cellular nucleic acids. Dietary purines are converted to uric acid (see following 
discussion) in the gut and excreted, and pyrimidine nucleosides are inefficiently ab- 
sorbed into the bloodstream. These findings confirm the de novo pathways of nu- 
cleotide biosynthesis as the primary source of nucleic acid precursors. Ingested bases 
are, for the most part, excreted. Nevertheless, cellular nucleic acids do undergo 
degradation in the course of the continuous recycling of cellular constituents. 


The Major Pathways of Purine Catabolism Lead to Uric Acid 


The major pathways of purine catabolism in animals lead to uric acid formation (Fig- 
ure 26.8). The various nucleotides are first converted to nucleosides by intracellular 
nucleotidases. These nucleotidases are under strict metabolic regulation to ensure 
that their substrates, which act as intermediates in many vital processes, are not 
depleted below critical levels. Nucleosides are then degraded by the enzyme purine 
nucleoside phosphorylase (PNP) to release the purine base and ribose-I-P. Note that 
neither adenosine nor deoxyadenosine is a substrate for PNP. Instead, these nucleo- 
sides are first converted to inosine by adenosine deaminase. The PNP products are 
merged into xanthine by guanine deaminase and xanthine oxidase, and xanthine is 
then oxidized to uric acid by this latter enzyme. 
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The Purine Nucleoside Cycle in Skeletal Muscle Serves 
as an Anaplerotic Pathway 


Deamination of AMP to IMP by AMP deaminase (Figure 26.8) followed by resyn- 
thesis of AMP from IMP by the de novo purine pathway enzymes, adenylosuccinate 
synthetase and adenylosuccinate lyase, constitutes a purine nucleoside cycle (Figure 
26.9). This cycle has the net effect of converting aspartate to fumarate plus 
NH,*. Although this cycle might seem like senseless energy consumption, it plays 
an important role in energy metabolism in skeletal muscle: The fumarate that it 
generates replenishes the levels of citric acid cycle intermediates lost in amphi- 
bolic side reactions (see Chapter 19). Skeletal muscle lacks the usual comple- 
ment of anaplerotic enzymes and relies on enhanced levels of AMP deaminase, 
adenylosuccinate synthetase, and adenylosuccinate lyase to compensate. 


Xanthine Oxidase 


Xanthine oxidase (Figure 26.8) is present in large amounts in liver, intestinal mu- 
cosa, and milk. It oxidizes hypoxanthine to xanthine and xanthine to uric acid. 
Xanthine oxidase is a rather indiscriminate enzyme, using molecular oxygen to 
oxidize a wide variety of purines, pteridines, and aldehydes, producing H2O, as a 
product. Xanthine oxidase possesses FAD, nonheme Fe-S centers, and molybdenum 
cofactor (a molybdenum-containing pterin complex; see Figure 25.2) as electron- 
transferring prosthetic groups. Its mechanism of action is diagrammed in Figure 
26.10. In humans and other primates, uric acid is the end product of purine catab- 
olism and is excreted in the urine. Birds, terrestrial reptiles, and many insects also 
excrete uric acid, but in these organisms, uric acid represents the major nitrogen 


HUMAN BIOCHEMISTRY 





Severe Combined Immunodeficiency Syndrome—A Lack of Adenosine Deaminase 
Is One Cause of This Inherited Disease 


Severe combined immunodeficiency syndrome, or SCID, is a 
group of related inherited disorders characterized by the lack of 
an immune response to infectious disease. This immunological in- 
sufficiency is attributable to the inability of B and T lymphocytes 
to proliferate and produce antibodies in response to an antigenic 
challenge. About 30% of SCID patients suffer from a deficiency in 
the enzyme adenosine deaminase (ADA). ADA deficiency is also im- 
plicated in a variety of other diseases, including AIDS, anemia, and 
various lymphomas and leukemias. Gene therapy, the repair of a ge- 
netic deficiency through introduction of a functional recombinant version 
of the gene, has been attempted on individuals with SCID due to a 
defective ADA gene (see Section 12.6). ADA is a Zn?*-dependent 


enzyme, and Zn?* deficiency can also lead to reduced immune 
function. 

In the absence of ADA, deoxyadenosine is not deaminated to 
deoxyinosine as normal (see Figure 26.8). Instead, it is salvaged by 
a nucleoside kinase, which converts it to dAMP. Phosphorylation 
of dAMP leads to dATP, a potent feedback inhibitor of deoxynu- 
cleotide biosynthesis (discussed later in this chapter). Without 
deoxyribonucleotides, DNA cannot be replicated and cells cannot 
divide (see accompanying figure). Rapidly proliferating cell types 
such as lymphocytes are particularly susceptible if DNA synthesis is 
impaired. 





Nucleoside 
. kinase 
Deoxyadenosine > dAMP —> —> dATP 
I 
1 
ADA @ 
deficiency i 
Deoxyinosine ADP 3 dADP dATP DNA 
GDP dGDP dGTP replication 
CDP dCDP dCTP stalled 
UDP. dUDP dTTP 
X v 2 





Deoxynucleoside 5'-triphosphate 





(dNTP) synthesis 





824 Chapter 26 Synthesis and Degradation of Nucleotides 


© 


Pir 
DA Q 
ow oy at 
AMP IMP 
H,O HO 


Nucleotidase 


Nucleotidase 


p ® ® ji 


Adenosine 























deaminase \ A HN N 
Adenosine > Inosine | \ 
LN Paine Sy < ~y Hypoxanthine Adenosine deaminase. Bound 
HO + š H inhibitor (purple) identifies the 
2 NH nucleoside PULP 
phosphorylase active site; Zn2* ion (cyan). 
j ee 
(PNP) er (pdb id = 1WXY) 
2 2 
Xanthine 
oxidase 
HO, 
V 
) O2 
o O o + 
H,O ® ® wy 
N eee 
HN \ fF i HN HN 
A. | \ Xanthosine > asl. | > eee an = 
O N N Nucleotidase Purine f o N Xanthine 
H nucleoside d H oxidase 
phosphorylase a a 
XMP (PNP) Xanthine ric aci 
NH,* 
Guanine 
deaminase 
HO 


o ew 1 
$ De LA. TS 





Nucleotidase Purine s HoN N N 
nucleoside 
phosphorylase ; 
(PNP) Guanine 


GMP 
FIGURE 26.8 The major pathways for purine catabolism. 


excretory compound, because, unlike mammals, they do not also produce urea 
(see Chapter 25). Instead, the catabolism of all nitrogenous compounds, includ- 
ing amino acids, is channeled into uric acid. This route of nitrogen catabolism 
allows these animals to conserve water by excreting crystals of uric acid in paste- 
like solid form. 


Gout Is a Disease Caused by an Excess of Uric Acid 


Gout is the clinical term describing the physiological consequences accompanying 
excessive uric acid accumulation in body fluids. Uric acid and urate salts are rather 
insoluble in water and tend to precipitate from solution if produced in excess. The 
most common symptom of gout is arthritic pain in the joints as a result of urate de- 
position in cartilaginous tissue. The joint of the big toe is particularly susceptible. 
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the urinary tract. Hyperuricemia, chronic elevation of blood uric acid levels, occurs (enol tautomer) (fully oxidized Mo”! form) 
in about 3% of the population as a consequence of impaired excretion of uric acid 
or overproduction of purines. Purine-rich foods such as caviar (fish eggs rich in nu- 
cleic acids) may exacerbate the condition. The biochemical causes of gout are var- 
ied. However, a common treatment is allopurinol (Figure 26.11). This hypoxanthine 
analog binds tightly to xanthine oxidase, thereby inhibiting its activity and prevent- 
ing uric acid formation. Hypoxanthine and xanthine do not accumulate to harmful 
concentrations because they are more soluble and thus more easily excreted. 


FIGURE 26.10 Xanthine oxidase catalyzes a hydroxylase- 
type reaction. 


Different Animals Oxidize Uric Acid to Form Excretory Products 


The subsequent metabolism of uric acid in organisms that don’t excrete it is 
shown in Figure 26.12. In molluscs and in mammals other than primates, uric acid 
is oxidized by urate oxidase to allantoin and excreted. In bony fishes (teleosts), 
uric acid degradation proceeds through yet another step wherein allantoin is hy- 
drolyzed to allantoic acid by allantoinase before excretion. Cartilaginous fish 
(sharks and rays) and amphibians further degrade allantoic acid via the enzyme 
allantoicase to liberate glyoxylic acid and 2 equivalents of urea. Even simpler 
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FIGURE 26.11 Allopurinol, an analog of hypoxanthine, is a potent inhibitor of xanthine oxidase. 
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«í FIGURE 26.12 The catabolism of uric acid to allantoin, allantoic acid, urea, or ammonia in various animals. 


animals, such as most marine invertebrates (crustacea and so forth), use urease to 
hydrolyze urea to CO» and ammonia. In contrast to animals that must rid them- 
selves of potentially harmful nitrogen waste products, microorganisms often are 
limited in growth by nitrogen availability. Many possess an identical pathway of 
uric acid degradation, using it instead to liberate NH; from uric acid so that it can 
be assimilated into organic-N compounds essential to their survival. 


26:55 How Do Cells Synthesize Pyrimidines? 


In contrast to purines, pyrimidines are not synthesized as nucleotide derivatives. In- 
stead, the pyrimidine ring system is constructed before a ribose-5-P moiety is 
attached. Also, only two precursors, carbamoyl-P and aspartate, contribute atoms to 
the six-membered pyrimidine ring (Figure 26.13), compared to seven precursors for 
the nine purine atoms. 

Mammals have two enzymes for carbamoyl phosphate synthesis. Carbamoyl phos- 
phate for pyrimidine biosynthesis is formed by carbamoyl phosphate synthetase II 
(CPS-II), a cytosolic enzyme. Recall that carbamoyl phosphate synthetase I is a mi- 
tochondrial enzyme dedicated to the urea cycle and arginine biosynthesis (see Fig- 
ures 25.22 and 25.23). The substrates of carbamoyl phosphate synthetase II are 
HCO; ", HO, glutamine, and two ATPs (Figure 26.14). Thus, the N atom in car- 
bamoyl phosphate made by CPS-II comes from the amide group of glutamine. The 
first ATP is used to form carboxy phosphate, an activated form of COs (step 1, Fig- 
ure 26.14). The glutamine amide displaces phosphate from carboxy phosphate to 
give carbamate (step 2). Phosphorylation of carbamate by the second ATP in the 
overall reaction yields carbamoyl phosphate (step 3). Because carbamoyl phosphate 
made by CPS-I in mammals has no fate other than incorporation into pyrimidines, 
mammalian CPS-II can be viewed as the committed step in the pyrimidine de novo 
pathway (Figure 26.15, step 1). Bacteria and plants have but one CPS, and its car- 
bamoyl phosphate product is incorporated into arginine as well as pyrimidines. 
Thus, the committed step in bacterial pyrimidine synthesis is the next reaction, 
which is mediated by aspartate transcarbamoylase (ATCase). 

ATCase catalyzes the condensation of carbamoyl phosphate with aspartate to 
form carbamoyl aspartate (Figure 26.15, step 2). No ATP input is required at this 
step because carbamoyl phosphate represents an “activated” carbamoyl group. 

Step 3 of pyrimidine synthesis involves ring closure and dehydration via linkage of 
the —NHb group introduced by carbamoyl phosphate with the former B-COO™ of 
aspartate; this reaction is mediated by the enzyme dihydroorotase. The product of 
the reaction is dihydroorotate (DHO), a six-membered ring compound. Dihydroorotate 
is not a true pyrimidine, but its oxidation yields orotate, which is. This oxidation (step 
4) is catalyzed by dihydroorotate dehydrogenase. Bacterial dihydroorotate dehydro- 
genases are NAD*-linked flavoproteins which are somewhat unusual in possessing 
both FAD and FMN; these enzymes also have nonheme Fe-S centers as additional re- 
dox prosthetic groups. The eukaryotic version of dihydroorotate dehydrogenase is a 
protein component of the inner mitochondrial membrane; its immediate e~ accep- 
tor is a quinone, and oxidation of the reduced quinone by the mitochondrial e 
transport chain can drive ATP synthesis via oxidative phosphorylation. At this stage, 
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FIGURE 26.13 The metabolic origin of the six atoms of the pyrimidine ring. 
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FIGURE 26.15 The de novo pyrimidine biosynthetic pathway. 
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FIGURE 26.16 (a) CTP synthesis from UTP. (b) Structure 
of CTP synthetase dimer with bound Mg? ions (pdb 
id = 181M). 


ribose-5-phosphate is joined to N-1 of orotate in appropriate N-G-glycosidic configu- 
ration, giving the pyrimidine nucleotide orotidine-5’-monophosphate, or OMP (step 5, 
Figure 26.15). The ribose phosphate donor is PRPP; the enzyme is orotate phospho- 
ribosyltransferase. The next reaction is catalyzed by OMP decarboxylase. Decar- 
boxylation of OMP gives UMP (uridine-5’-monophosphate, or uridylic acid), one of the 
two common pyrimidine ribonucleotides. 


“Metabolic Channeling” by Multifunctional Enzymes of Mammalian 
Pyrimidine Biosynthesis 

CPS-II (Figure 26.15), glutamine 5-phosphoribosyl-a-pyrophosphate amidotrans- 
ferase (see Figure 26.3), and CTP synthetase (Figure 26.16) are enzymes with metabo- 
lite tunnels (see A Deeper Look on page 802). In a subtle distinction, metabolic 
channeling refers to the transfer of metabolites between different enzymic sites on 
a multifunctional polypeptide chain. 

In bacteria, the six enzymes of de novo pyrimidine biosynthesis exist as distinct 
proteins, each independently catalyzing its specific step in the overall pathway. In 
contrast, in mammals, the six enzymatic activities are distributed among only three 
proteins, two of which are multifunctional polypeptides. The first three steps of 
pyrimidine synthesis, CPS-II, aspartate transcarbamoylase, and dihydroorotase, are 
all localized on a single 210-kD cytosolic polypeptide. This multifunctional enzyme 
is the product of a solitary gene, yet it is equipped with the active sites for all three 
enzymatic activities. Step 4 in Figure 26.15 is catalyzed by DHO dehydrogenase, a 
separate enzyme associated with the outer surface of the inner mitochondrial mem- 
brane, but the enzymatic activities mediating steps 5 and 6, namely, orotate phos- 
phoribosyltransferase and OMP decarboxylase in mammals, are also found on a sin- 
gle cytosolic polypeptide known as UMP synthase. 

The purine biosynthetic pathway of avian liver also provides examples of meta- 
bolic channeling. Recall that steps 3, 4, and 6 of de novo purine synthesis are cat- 
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alyzed by three enzymatic activities localized on a single multifunctional polypep- 
tide, and steps 7 and 8 and steps 10 and 11 by respective bifunctional polypeptides 
(see Figure 26.3). Fatty acid biosynthesis in animals is another instance of this mode 
of metabolic organization (see Figure 24.9). 

Such multifunctional enzymes confer an advantage: The product of one reaction 
in a pathway is the substrate for the next. In multifunctional enzymes, such prod- 
ucts remain bound and are channeled directly to the next active site, rather than 
dissociated into the surrounding medium for diffusion to the next enzyme. This 
metabolic channeling is more efficient for a variety of reasons: Transit time for 
movement from one active site to the next is shortened, substrates are not diluted 
into the solvent phase, chemically reactive intermediates are protected from de- 
composition in the aqueous milieu, no pools of intermediates accumulate, and in- 
termediates are shielded from interactions with other enzymes that might metabo- 
lize them. 


UMP Synthesis Leads to Formation of the Two Most Prominent 
Ribonucleotides—UTP and CTP 


The two prominent pyrimidine ribonucleotide products are derived from UMP 
via the same unbranched pathway. First, UDP is formed from UMP via an ATP- 
dependent nucleoside monophosphate kinase. 


UMP + ATP == UDP + ADP 





Then, UTP is formed by nucleoside diphosphate kinase. 
UDP + ATP == UTP + ADP 








Amination of UTP at the 6-position gives CTP. The enzyme CTP synthetase is a 
glutamine amidotransferase (Figure 26.16). ATP hydrolysis provides the energy to 
drive the reaction. 


Pyrimidine Biosynthesis Is Regulated at ATCase in Bacteria 
and at CPS-II in Animals 


Pyrimidine biosynthesis in bacteria is allosterically regulated at aspartate trans- 
carbamoylase (ATCase). Escherichia coli ATCase is feedback-inhibited by the end 
product, CTP. ATP, which can be viewed as a signal of both energy availability and 
purine sufficiency, is an allosteric activator of ATCase. CTP and ATP compete for a 
common allosteric site on the enzyme. In many bacteria, UTP, not CTP, acts as the 
ATCase feedback inhibitor. 

In animals, CPS-II catalyzes the committed step in pyrimidine synthesis and 
serves as the focal point for allosteric regulation. UDP and UTP are feedback 


HUMAN BIOCHEMISTRY 





Mammalian CPS-II Is Activated In Vitro by MAP Kinase and In Vivo by Epidermal 
Growth Factor 





The rate-limiting step in mammalian de novo pyrimidine synthesis 
is catalyzed by CPS-I, and proliferating cells require lots of pyrim- 
idine nucleotides for growth and cell division. Normally, CPS-II 
is feedback-inhibited by UTP, but in vitro phosphorylation of 
CPS-II by MAP kinase (mitogen-activated protein kinase) creates a co- 
valently modified (phosphorylated) CPS-II that is no longer sensi- 
tive to UTP inhibition. Furthermore, phosphorylated CPS-II is 
more responsive to PRPP activation. Both of these responses favor 
enhanced pyrimidine biosynthesis. This regulation occurs in vivo 


when epidermal growth factor (EGF), a mitogen, initiates an in- 
tracellular cascade of reactions that culminates in MAP kinase ac- 
tivation. By this action, pyrimidine nucleotides are made available 
for RNA and DNA synthesis, processes that are central to the cell 
proliferation that follows EGF activation. 


A mitogen is a hormone that stimulates mitosis (cell division). 
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FIGURE 26.18 Pyrimidine degradation. 
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FIGURE 26.17 A comparison of the regulatory circuits that control pyrimidine synthesis in E.coli and animals. 


inhibitors of CPS-II, whereas PRPP and ATP are allosteric activators. With the ex- 
ception of ATP, none of these compounds are substrates of CPS-II or of either of 
the two other enzymic activities residing with it on the trifunctional polypeptide. 
Figure 26.17 compares the regulatory circuits governing pyrimidine synthesis in 
bacteria and animals. 


26:6: How Are Pyrimidines Degraded? 


In some organisms, free pyrimidines, like purines, are salvaged and recycled to 
form nucleotides via phosphoribosyltransferase reactions similar to those discussed 
earlier. In humans, however, pyrimidines are recycled from nucleosides, but free 
pyrimidine bases are not salvaged. Pyrimidine catabolism results in degradation of 
the pyrimidine ring to products reminiscent of the original substrates, aspartate, 
COs, and ammonia (Figure 26.18). -Alanine can be recycled into the synthesis of 
coenzyme A. Catabolism of the pyrimidine base, thymine (5-methyluracil), yields 
B-aminoisobutyric acid instead of B-alanine. 

Pathways presented thus far in this chapter account for the synthesis of the four 
principal ribonucleotides: ATP, GTP, UTP, and CTP. These compounds serve im- 
portant coenzymic functions in metabolism and are the immediate precursors for 
ribonucleic acid (RNA) synthesis. Roughly 90% of the total nucleic acid in cells is 
RNA, with the remainder being deoxyribonucleic acid (DNA). DNA differs from 
RNA in being a polymer of deoxyribonucleotides, one of which is deoxythymidylic 
acid. We now turn to the synthesis of these compounds. 


2627/5 How Do Cells Form the Deoxyribonucleotides 
That Are Necessary for DNA Synthesis? 


The deoxyribonucleotides have only one metabolic purpose: to serve as precursors 
for DNA synthesis. In most organisms, ribonucleoside diphosphates (NDPs) are the 
substrates for deoxyribonucleotide formation. Reduction at the 2'-position of the ri- 
bose ring in NDPs produces 2’-deoxy forms of these nucleotides (Figure 26.19). 
This reaction involves replacement of the 2’-OH by a hydride ion (H:~) and is cat- 
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alyzed by an enzyme known as ribonucleotide reductase. Enzymatic ribonucleotide re- 
duction involves a free radical mechanism, and three classes of ribonucleotide re- 
ductases are known, differing from each other in their mechanisms of free radical 
generation. Class I enzymes, found in E. coli and virtually all eukaryotes, are 
Fe-dependent and generate the required free radical on a specific tyrosyl side chain. 


E. coli Ribonucleotide Reductase Has Three Different 
Nucleotide-Binding Sites 


The enzyme system for dNDP formation consists of four proteins, two of which con- 
stitute the ribonucleotide reductase proper, an enzyme of the ay Bp» type. The other 
two proteins, thioredoxin and thioredoxin reductase, function in the delivery of re- 
ducing equivalents, as we shall see shortly. The two proteins of ribonucleotide re- 
ductase are designated R1 (86 kD) and R2 (43.5 kD), and each is a homodimer in 
the holoenzyme (Figure 26.20). The RI homodimer carries two types of regulatory 
sites in addition to the catalytic site (the active site). Substrates (ADP, CDP, GDP, 
and UDP) bind at the catalytic site. One regulatory site—the substrate specificity 
site—binds ATP, dATP, dGTP, or dTTP, and which of these nucleotides is bound 
there determines which nucleoside diphosphate is bound at the catalytic site. The 
other regulatory site, the overall activity site, binds either the activator ATP or the 
negative effector dATP; the nucleotide bound here determines whether the enzyme 
is active or inactive. Activity depends also on residues Cys**®, Cys**>, and Cys*? in R1. 
The 2 Fe atoms within the single active site formed by the R2 homodimer generate 
the free radical required for ribonucleotide reduction on a specific R2 residue, 
Tyr!®?, which in turn generates a thiyl free radical (Cys-S-) on Cys**®. Cys*-S- initi- 
ates ribonucleotide reduction by abstracting the 3'-H from the ribose ring of the nu- 
cleoside diphosphate substrate (Figure 26.21) and forming a free radical on C-3’. 
Subsequent dehydration forms the deoxyribonucleotide product. 


Thioredoxin Provides the Reducing Power 
for Ribonucleotide Reductase 


NADPH is the ultimate source of reducing equivalents for ribonucleotide reduc- 
tion, but the immediate source is reduced thioredoxin, a small (12-kD) protein with 
reactive Cys-sulfhydryl groups situated near one another in the sequence Cys-Gly- 
Pro-Cys. These Cys residues are able to undergo reversible oxidation—reduction be- 
tween (—S—S—) and (—SH HS—.) and, in their reduced form, serve as primary 


(a) 
ATP, dATP, } Specificity-determining 


R dGTP. dTTP f effectors 







A~— ATP, dATp $ Activity-determining 
effectors 





ADP, CDP, } Substrates 


we UDP, GDP 
G 


R2 (Bo) 








O Oo Base 
TO 
Ribonucleotide OH 
reductase 
oO oO Base 








H 


3 2 
dNDP OH H 


FIGURE 26.19 Deoxyribonucleotide synthesis involves 
reduction at the 2’-position of the ribose ring of 
nucleoside diphosphates. 


(b) 








One aß unit of S. typhimurium class 1b ribo- 
nucleotide reductase with bound dGTP (green) 


FIGURE 26.20 E.coli ribonucleotide reductase. and 2 Fe atoms (yellow). (pdb id = 2BQ1) 
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FIGURE 26.21 The free radical mechanism of ribonucleotide reduction. H, designates the C-3’ hydrogen and 


H, the C-2' hydrogen atom. 


electron donors to regenerate the reactive —SH pair of the ribonucleotide reduc- 
tase active site (Figure 26.21). In turn, the sulfhydryls of thioredoxin must be re- 
stored to the (—SH HS—) state for another catalytic cycle. Thioredoxin reductase, 
an @o-type enzyme composed of 58-kD flavoprotein subunits, mediates the NADPH- 
dependent reduction of thioredoxin (Figure 26.22). Thioredoxin functions in a 
number of metabolic roles besides deoxyribonucleotide synthesis, the common de- 
nominator of which is reversible sulfide : sulfhydryl transitions. 

The substrates for ribonucleotide reductase are CDP, UDP, GDP, and ADP, and 
the corresponding products are (CDP, dUDP, dGDP, and dADP. Because CDP is not 
an intermediate in pyrimidine nucleotide synthesis, it must arise by dephosphory- 
lation of CTP, for instance, via nucleoside diphosphate kinase action. Although uri- 
dine nucleotides do not occur in DNA, UDP is a substrate. The formation of (UDP 
is justified because it is a precursor to dTTP, a necessary substrate for DNA synthe- 


sis (see following discussion). 


Both the Specificity and the Catalytic Activity of Ribonucleotide 
Reductase Are Regulated by Nucleotide Binding 


Ribonucleotide reductase activity must be modulated in two ways in order to 
maintain an appropriate balance of the four deoxynucleotides essential to DNA 
synthesis, namely, dATP, dGTP, dCTP, and dTTP. First, the overall activity of the 


NADPH 
+ Ht 
NADP+ 
Thioredoxin reductase Thioredoxin Ribonucleotide 
reductase 





FIGURE 26.22 (a) The (—S—S—)/(—SH HS—) oxidation-reduction cycle involving ribonucleotide reductase, 
thioredoxin, thioredoxin reductase, and NADPH. (b) Thioredoxin reductase (œ, dimer) with bound ligands: 
FAD (gold) and NADP analog (blue). Each a-subunit is associated with a thioredoxin molecule (yellow) (pdb id 
= 1F6M). 
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Energy status of cell is robust; [ATP] is high. Make DNA: 


@ ATP occupies activity site A: ribonucleotide reductase ON 
(2) ATP in specificity site S favors CDP or UDP in catalytic site C —> [dCDP], [dUDP] 4 


@ dcppe 
dUDP 


@ dTTP occupies specificity site S, favoring GDP or ADP in catalytic site C 
GDP—+» dGDP—+» dGTP 


@® dGTP occupies specificity site S, favoring ADP in catalytic site C —> [dADP] t 
@ dATP replaces ATP in activity site A: ribonucleotide reductase OFF 


Saeed —»> dUMP—-> dTMP — > —_> dTTP 


enzyme must be turned on and off in response to the need for dNTPs. Second, 
the relative amounts of each NDP substrate transformed into dNDP must be con- 
trolled so that the right balance of d(ATP:dGTP:dCTP:dTTP is produced. The 
two different effector-binding sites on ribonucleotide reductase, discrete from the 
substrate-binding catalytic site, are designed to serve these purposes. As noted previ- 
ously, these two regulatory sites are designated the overall activity site and the sub- 
strate specificity site. Only ATP and dATP are able to bind at the overall activity site. 
ATP is an allosteric activator and dATP is an allosteric inhibitor, and they compete 
for the same site. If ATP is bound, the enzyme is active, whereas if its deoxy coun- 
terpart, dATP, occupies this site, the enzyme is inactive. That is, ATP is a positive 
effector and dATP is a negative effector with respect to enzyme activity, and they 
compete for the same site. 

The second regulatory site, the substrate specificity site, can bind either ATP, 
dTTP, dGTP, or dATP, and the substrate specificity of the enzyme is determined 
by which of these nucleotides occupies this site. If ATP is in the substrate specificity 
site, ribonucleotide reductase preferentially binds pyrimidine nucleotides (UDP 
or CDP) at its active site and reduces them to (UDP and dCDP. With dTTP in the 
specificity-determining site, GDP is the preferred substrate. When dGTP binds to 
the specificity site, ADP becomes the favored substrate for reduction. The ratio- 
nale for these varying affinities is as follows (Figure 26.23): High [ATP] is consis- 
tent with cell growth and division and, consequently, the need for DNA synthesis. 
Thus, ATP binds in the overall activity site of ribonucleotide reductase, turning it 
on and promoting production of dNTPs for DNA synthesis. Under these condi- 
tions, ATP is also likely to occupy the substrate specificity site, so UDP and CDP are 
bound at the catalytic site and reduced to d(UDP and dCDP. Both of these pyrimi- 
dine deoxynucleoside diphosphates are precursors to dTTP. Thus, elevation of 
dUDP and dCDP levels leads to an increase in [dTTP]. High dTTP levels increase 
the likelihood that it will occupy the substrate specificity site, in which case GDP be- 
comes the preferred substrate and dGTP levels rise. Upon dGTP association with 
the substrate specificity site, ADP is the favored substrate, leading to ADP reduction 
and the eventual accumulation of dATP. Binding of dATP to the overall activity site 
then shuts the enzyme down. In summary, the relative affinities of the three 
classes of nucleotide binding sites in ribonucleotide reductase for the various sub- 
strates, activators, and inhibitors are such that the formation of dNDPs proceeds 
in an orderly and balanced fashion. As these dNDPs are formed in amounts con- 
sistent with cellular needs, their phosphorylation by nucleoside diphosphate ki- 
nases produces dNTPs, the actual substrates of DNA synthesis. 


26:8: How Are Thymine Nucleotides Synthesized? 


The synthesis of thymine nucleotides proceeds from other pyrimidine deoxyri- 
bonucleotides. Cells have no requirement for free thymine ribonucleotides and 
do not synthesize them. Small amounts of thymine ribonucleotides do occur in 
tRNA (tRNA is notable for having unusual nucleotides), but these Ts arise via 
methylation of U residues already incorporated into the tRNA. Both dUDP and 


FIGURE 26.23 Regulation of deoxynucleotide biosyn- 
thesis: the rationale for the various affinities displayed 
by the two nucleotide-binding regulatory sites on ribo- 
nucleotide reductase. 
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FIGURE 26.24 Pathways of dTMP synthesis. dTMP pro- Proma dunn dune URE 5 dunu K ami 
duction is dependent on dUMP formation from dCDP From dCDP: dCDP dCMP > dUMP > dTMP 
and dUDP. 


dCDP can lead to formation of dUMP, the immediate precursor for dTMP syn- 
thesis (Figure 26.24). Interestingly, formation of dUMP from dUDP passes 
through dUTP, which is then cleaved by dUTPase, a pyrophosphatase that re- 
moves PP; from dUTP. The action of dUTPase prevents dUTP from serving as a 
substrate in DNA synthesis. An alternative route to (UMP formation starts with 


A DEEPER LOOK 


Fluoro-Substituted Analogs as Therapeutic Agents 


Carbon-fluorine bonds are exceedingly rare in nature, and fluo- 
rine is an uncommon constituent of biological molecules. F has 
three properties attractive to drug designers: (1) It is the smallest 
replacement for an H atom in organic synthesis, (2) fluorine is 
the most electronegative element, and (3) the F—C bond is rela- 
tively unreactive. This steric compactness and potential for strong 
inductive effects through its electronegativity renders F a useful 
substituent in the construction of inhibitory analogs of enzyme 
substrates. One interesting strategy is to devise fluorinated pre- 
cursors that are taken up and processed by normal metabolic 
pathways to generate a potent antimetabolite. A classic example is 
fluoroacetate. FCH,COO™ is exceptionally toxic because it is read- 
ily converted to fluorocitrate by citrate synthase of the citric acid 
cycle (see Chapter 19). In turn, fluorocitrate is a powerful in- 
hibitor of aconitase. The metabolic transformation of an other- 
wise innocuous compound into a poisonous derivative is termed 
lethal synthesis. 5-Fluorouracil and 5-fluorocytosine are also exam- 
ples of this strategy (see Human Biochemistry on page 835). oR Cie 
Unlike hydrogen, which is often abstracted from substrates as 5 
H*, electronegative fluorine cannot be readily eliminated as the 
corresponding F*. Thus, enzyme inhibitors can be fashioned in 
which F replaces H at positions where catalysis involves H removal ’,N'°-methylene- THF 
as H*. Thymidylate synthase catalyzes removal of H from dUMP 
as H* through a covalent catalysis mechanism. A thiol group on this 
enzyme normally attacks the 6-position of the uracil moiety 
of 2'-deoxyuridylic acid so that C-5 can act as a carbanion in attack 
on the methylene carbon of N°,N!°-methylene-THF (see accompa- 
nying figure). Regeneration of free enzyme then occurs through 
loss of the C-5 H atom as H* and dissociation of product dTMP. If 
F replaces H at C-5 as in 2’-deoxy-5-fluorouridylate (FAUMP), the 
enzyme is immobilized in a very stable ternary [enzyme:FdUMP: 
methylene-THF] complex and effectively inactivated. Enzyme in- 
hibitors like FAUMP whose adverse properties are elicited only 
through direct participation in the catalytic cycle are variously 
called mechanism-based inhibitors, suicide substrates, or Trojan 
horse substrates. 











» The effect of the 5-fluoro substitution on the mechanism of action 

of thymidylate synthase. An enzyme thiol group (from a Cys side chain) 
ordinarily attacks the 6-position of (UMP so that C-5 can react as a 
carbanion with N®°,N!°-methylene-THF. Normally, free enzyme is 
regenerated following release of the hydrogen at C-5 as a proton. 
Because release of fluorine as F* cannot occur, the ternary (three-part) 
complex of [enzyme : fluorouridylate : methylene-THF] is stable and per- 
sists, preventing enzyme turnover. (The N°,N!°-methylene-THF structure 
is given in abbreviated form.) Ternary complex 
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FIGURE 26.25 (a) The dCMP deaminase reaction. (b) Trimeric (CMP deaminase. Each chain has a bound dCTP 
molecule (purple) and a Mg?* ion (orange) (pdb id = 1XS4). 


dCDP, which is dephosphorylated to dCMP and then deaminated by dCMP 
deaminase (Figure 26.25), leaving (UMP. dCMP deaminase provides a second 
point for allosteric regulation of dNTP synthesis; it is allosterically activated by 
dCTP and feedback-inhibited by dTTP. Of the four dNTPs, only dCTP does not 
interact with either of the regulatory sites on ribonucleotide reductase (see Fig- 
ure 26.20). Instead, it acts upon dCMP deaminase. 

Synthesis of dTMP from dUMP is catalyzed by thymidylate synthase (Figure 
26.26). This enzyme methylates (UMP at the 5-position to create dTMP; the 
methyl donor is the one-carbon folic acid derivative N°,N'°-methylene-THF. The 
reaction is actually a reductive methylation in which the one-carbon unit is trans- 
ferred at the methylene level of reduction and then reduced to the methyl level. 
The THF cofactor is oxidized at the expense of methylene reduction to yield 
DHF. DHFR then reduces DHF back to THF for service again as a one-carbon 
vehicle (see panel a of the figure in A Deeper Look on page 816). Thymidylate 
synthase sits at a junction connecting dNTP synthesis with folate metabolism. 


HUMAN BIOCHEMISTRY 





Fluoro-Substituted Pyrimidines in Cancer Chemotherapy, Fungal Infections, 
and Malaria 


5-Fluorouracil (5-FU; see part a of the figure) is a thymine analog. It (a) (b) 
is converted in vivo to 5’-fluorouridylate by a PRPP-dependent phos- 9 

phoribosyltransferase and passes through the reactions of ANTP H N ® N” ® 
potent inhibitor of dTMP synthase (see A Deeper Look on page | | 
834). 5-FU is used as a chemotherapeutic agent in the treatment of (0) N oO 

human cancers. Similarly, 5-/lworocytosine (see part b) is used as an l 

antifungal drug because fungi, unlike mammals, can convert it to 
2'-deoxy-5-fluorouridylate. Furthermore, malarial parasites can use 5-Fluorouracil 5-Fluorocytosine 
exogenous orotate to make pyrimidines for nucleic acid synthesis, (c) 

whereas mammals cannot. Thus, 5-/lworoorotate (see part c) is an O 


effective antimalarial drug because it is selectively toxic to these H & 
parasites. 


synthesis, culminating ultimately as 2’-deoxy-5-fluorouridylic acid, a a. 





5-Fluoroorotate 
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FIGURE 26.26 (a) The thymidylate synthase reaction. The 5-CH; group is ultimately derived from the B-carbon 
of serine. (b) Thymidylate synthase dimer. Each monomer has a bound folate analog (green) and dUMP (light 


blue) (pdb id = 1JUJ). 


It has become a preferred target for inhibitors designed to disrupt DNA synthe- 
sis. An indirect approach is to employ folic acid precursors or analogs as anti- 
metabolites of dTMP synthesis (see panel b of the figure in A Deeper Look on 
page 818). Purine synthesis is affected as well because it is also dependent on 
THF (see Figure 26.3). 


SUMMARY 





26.1 Can Cells Synthesize Nucleotides? Nucleotides are ubiquitous 
constituents of life and nearly all cells are capable of synthesizing them 
“from scratch” via de novo pathways. Rapidly proliferating cells must 
make lots of purine and pyrimidine nucleotides to satisfy demands for 
DNA and RNA synthesis. Nucleotide biosynthetic pathways are attractive 
targets for the clinical control of rapidly dividing cells such as cancers or 
infectious bacteria. Many antibiotics and anticancer drugs are inhibitors 
of purine or pyrimidine nucleotide biosynthesis. 


26.2 How Do Cells Synthesize Purines? The nine atoms of the purine 
ring system are derived from aspartate (N-1), glutamine (N-3 and N-9), 
glycine (C-4, C-5, and N-7), COs (C-6), and THF one-carbon derivatives 
(C-2 and C-8). The atoms of the purine ring are successively added to 
ribose-5-phosphate, so purines begin as nucleotide derivatives through as- 
sembly of the purine ring system directly on the ribose. Because purine 
biosynthesis depends on folic acid derivatives, it is sensitive to inhibition 
by folate analogs. Distinct, two-step metabolic pathways diverge from IMP, 


one leading to AMP and the other to GMP. Purine biosynthesis is regu- 
lated at several stages: Reaction 1 (ribose-5-phosphate pyrophosphoki- 
nase) is feedback-inhibited by ADP and GDP; the enzyme catalyzing re- 
action 2 (glutamine phosphoribosyl pyrophosphate amidotransferase) 
has two inhibitory allosteric sites, one where adenine nucleotides bind 
and another where guanine nucleotides bind. PRPP is a “feed-forward” 
activator of this enzyme. The first reaction in the conversion of IMP to 
AMP involves adenylosuccinate synthetase, which is inhibited by AMP; the 
first step in the conversion of IMP to GMP is catalyzed by IMP dehydro- 
genase and is inhibited by GMP. ATP-dependent kinases form nucleoside 
diphosphates and triphosphates from AMP and GMP. 


26.3 Can Cells Salvage Purines? Purine ring systems represent a meta- 
bolic investment by cells, and salvage pathways exist to recover them 
when degradation of nucleic acids releases free purines in the form of 
adenine, guanine, and hypoxanthine (the base in IMP). Hypoxanthine- 
guanine phosphoribosyltransferase (HGPRT) acts on either hypoxan- 


thine to form IMP or guanine to form GMP; an absence of HGPRT is the 
basis of Lesch-Nyhan syndrome. 


26.4 How Are Purines Degraded? Dietary nucleic acids are digested 
to nucleotides by various nucleases and phosphodiesterases, the nu- 
cleotides are converted to nucleosides by base-specific nucleotidases 
and nonspecific phosphatases, and then nucleosides are hydrolyzed to 
release the purine base. Only the pentoses of nucleotides serve as 
sources of metabolic energy. In humans, the purine ring is oxidized to 
uric acid by xanthine oxidase and excreted. Gout occurs when bodily 
fluids accumulate an excess of uric acid. Skeletal muscle operates a 
purine nucleoside cycle as an anaplerotic pathway. 


26.5 How Do Cells Synthesize Pyrimidines? In contrast to formation of 
the purine ring system, the pyrimidine ring system is completed before a 
ribose-5-P moiety is attached. Only two precursors, carbamoyl-P and as- 
partate, contribute atoms to the six-membered pyrimidine ring. The first 
step in humans is catalyzed by CPS-II. ATCase then links carbamoyl-P with 
aspartate. Subsequent reactions close the ring and oxidize it before 
adding ribose-5-P, using a-PRPP as donor. Decarboxylation gives UMP. In 
mammals, the six enzymatic activities of pyrimidine biosynthesis are dis- 
tributed among only three proteins, two of which are multifunctional 
polypeptides. Purine and pyrimidine synthesis in mammals are two 
prominent examples of metabolic channeling. UMP leads to UTP, the 
substrate for formation of CTP via CTP synthetase. Regulation of pyrimi- 
dine synthesis in animals occurs at CPS-II. UDP and UTP are feedback 
inhibitors, whereas PRPP and ATP are allosteric activators. In bacteria, 
regulation acts at ATCase through feedback inhibition by CTP (or UTP) 
and activation by ATP. 
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26.6 How Are Pyrimidines Degraded? Degradation of the pyrimi- 
dine ring generates B-alanine, CO», and ammonia. In humans, pyrim- 
idines are recycled from nucleosides, but free pyrimidine bases are not 
salvaged. 


26.7 How Do Cells Form the Deoxyribonucleotides That Are Necessary 
for DNA Synthesis? 2’-Deoxyribonucleotides are formed from ribonu- 
cleotides through reduction at the 2’-position of the ribose ring in NDPs. 
The reaction, catalyzed by ribonucleotide reductase, involves a free radi- 
cal mechanism that replaces the 2’-OH by a hydride ion (H:~). Thiore- 
doxin provides the reducing power for ribonucleotide reduction. Class Ia 
ribonucleotide reductases have three different nucleotide-binding sites: 
the catalytic site (or active site), which binds substrates (ADP, CDP, GDP, 
and UDP); the substrate specificity site, which can bind ATP, dATP, dGTP, 
or dTTP; and the overall activity site, which binds either the activator ATP 
or the negative effector dATP. The relative affinities of the three classes of 
nucleotide binding sites in ribonucleotide reductase for the various sub- 
strates, activators, and inhibitors are such that the various dNDPs are 
formed in amounts consistent with cellular needs. 


26.8 How Are Thymine Nucleotides Synthesized? Both dUDP and 
dCDP can lead to formation of (UMP, the immediate precursor for dTMP 
synthesis. Formation of dTMP from dUMP is catalyzed by thymidylate syn- 
thase through reductive methylation of dUMP at the 5-position. The 
methyl donor is the one-carbon folic acid derivative N°,N'°-methylene- 
THF. Fluoro-substituted pyrimidine analogs such as 5-fluorouracil (5-FU), 
5-fluorocytosine, and 5-fluoroorotate can be converted to FAUMP, which 
inhibits thymidylate synthase. These fluoro compounds have found a 
range of therapeutic uses in treating diseases from cancer to malaria. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. Draw the purine and pyrimidine ring structures, indicating the 
metabolic source of each atom in the rings. 

2. Starting from glutamine, aspartate, glycine, CO, and N 10_-formyl- 
THF, how many ATP equivalents are expended in the synthesis of 
(a) ATP, (b) GTP, (c) UTP, and (d) CTP? 

3. Illustrate the key points of regulation in (a) the biosynthesis of IMP, 
AMP, and GMP; (b) E. coli pyrimidine biosynthesis; and (c) mam- 
malian pyrimidine biosynthesis. 

4. Indicate which reactions of purine or pyrimidine metabolism are af- 
fected by the inhibitors (a) azaserine, (b) methotrexate, (c) sulfon- 
amides, (d) allopurinol, and (e) 5-fluorouracil. 

5. Since dUTP is not a normal component of DNA, why do you sup- 
pose ribonucleotide reductase has the capacity to convert UDP to 
dUDP? 

6. Describe the underlying rationale for the regulatory effects exerted 
on ribonucleotide reductase by ATP, dATP, dTTP, and dGTP. 


7. (Integrates with Chapters 18-20 and 22.) By what pathway(s) does 
the ribose released upon nucleotide degradation enter intermediary 
metabolism and become converted to cellular energy? How many 
ATP equivalents can be recovered from one equivalent of ribose? 

8. (Integrates with Chapter 25.) At which steps does the purine biosyn- 
thetic pathway resemble the pathway for biosynthesis of the amino 
acid histidine? 

9. Write reasonable chemical mechanisms for steps 6, 8, and 9 in 
purine biosynthesis (see Figure 26.3). 

10. Write a balanced equation for the conversion of aspartate to fu- 
marate by the purine nucleoside cycle in skeletal muscle. 

11. Write a balanced equation for the oxidation of uric acid to glyoxylic 
acid, CO», and NHs, showing each step in the process and naming 
all of the enzymes involved. 


12. (Integrates with Chapter 15.) E. coli aspartate transcarbamoylase 
(ATCase) displays classic allosteric behavior. This a¢8, enzyme is 
activated by ATP and feedback-inhibited by CTP. In analogy with 
the behavior of glycogen phosphorylase shown in Figure 15.14, il- 
lustrate the allosteric v versus [aspartate] curves for ATCase (a) in 
the absence of effectors, (b) in the presence of CTP, and (c) in the 
presence of ATP. 


*13. (Integrates with Chapter 15.) Unlike its allosteric counterpart glyco- 
gen phosphorylase (an œ enzyme), E. coli ATCase has a heteromeric 
(a¢B5) organization. The a-subunits bind aspartate and are consid- 
ered catalytic subunits, whereas the -subunits bind CTP or ATP and 
are considered regulatory subunits. How would you describe the sub- 
unit organization of ATCase from a functional point of view? 

14. (Integrates with Chapter 20.) Starting from HCOs_, glutamine, as- 
partate, and ribose-5-P, how many ATP equivalents are consumed in 
the synthesis of dTTP in a eukaryotic cell, assuming dihydroorotate 
oxidation is coupled to oxidative phosphorylation? How does this 
result compare with the ATP costs of purine nucleotide biosynthe- 
sis calculated in problem 2? 

15. Write a balanced equation for the synthesis of dTMP from UMP and 
N®,N'°methylene-THF. Thymidylate synthase has four active-site 
arginine residues (Arg, Arg!”*, Arg!”", and Arg?!*) involved in 
substrate binding. Postulate a role for the side chains of these Arg 
residues. 

16. Enzymes that bind phosphoribosyl-5-phosphate (PRPP) have a com- 
mon structural fold, the PRT fold, which unites them as a structural 
family. PRT here refers to the phosphoribosyl transferase activity dis- 
played by some family members. Typically, in such reactions, PP; is 
displaced from PRPP by a nitrogen-containing nucleophile. Several 
such reactions occur in purine metabolism. Identify two such reac- 
tions where the enzyme involved is likely to be a PRT family member. 


17. The crystal structure of E. coli dihydrofolate reductase (DFR) with 
NADP* and folate bound can be found in the Protein Data Bank 
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(www.rcsb.org/pdb) as file 7DFR. Go to this website, enter “7DFR” 
in the search line, and click on “KiNG” under “Display options” 
when the 7DFR page comes up. Explore the KiNG graphic of the 
DFR structure to visualize how its substrates are bound. (If you hold 
down the left button on your mouse and move the cursor over the 
image, you can rotate the structure to view it from different per- 
spectives.) Note in particular the spatial relationship between the 
nicotinamide ring of NADP* and the pterin ring of folate. Do you 
now have a better appreciation for how this enzyme works? Note 
also the location of polar groups on the two substrates in relation to 
the DFR structure. 

18. E. coli aspartate transcarbamoylase is an allosteric enzyme (see 
problem 12) composed of six catalytic (C) subunits and six regula- 
tory (R) subunits. Protein Data Bank file 1RAA shows one-third of 
the ATCase holoenzyme (two C subunits and two R subunits; CTP 
molecules are bound to the R subunits). Explore this structure us- 
ing the KiNG display option. What secondary structural motif dom- 
inates the R subunit structure? Protein Data Bank file 2IPO also 


shows one-third of the ATCase holoenzyme (two C subunits and 
two R subunits), but in this structure molecules of the substrate 
analog N-(2-phosphonoacetyl)-L-asparagine are bound to the C 
subunits. Explore this structure using the KiNG display option. 
Note the distance separating the ATCase active site from its al- 
losteric site. Interpret what you see in terms of the Monod- 
Wyman-—Changeux model for allosteric regulation (see Chapter 
15). Which of these structures corresponds to the MWC R:state, 
and which corresponds to the T-state? 


Preparing for the MCAT Exam 


19. Examine Figure 26.6 and predict the relative rates of the regulated 
reactions in the purine biosynthetic pathway from ribose-5-P to 
GMP and AMP under conditions in which GMP levels are very high. 

20. Decide from Figures 18.1, 25.31, 26.26, and the Deeper Look box 
on page 817 which carbon atom(s) in glucose would be most likely 
to end up as the 5-CH; carbon in dTMP. 
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Metabolic Integration 
and Organ Specialization 





ESSENTIAL QUESTIONS 


Cells are systems in a dynamic steady state, maintained by a constant flux of nutri- 
ents that serve as energy sources or as raw material for the maintenance of cellular 
structures. Catabolism and anabolism are ongoing, concomitant processes. 


What principles underlie the integration of catabolism and energy production 
with anabolism and energy consumption? How is metabolism integrated in 
complex organisms with multiple organ systems? 


In the preceding chapters, we have explored the major metabolic pathways— 
glycolysis, the citric acid cycle, electron transport and oxidative phosphorylation, 
photosynthesis, gluconeogenesis, fatty acid oxidation, lipid biosynthesis, amino 
acid metabolism, and nucleotide metabolism. Several of these pathways are cata- 
bolic and serve to generate chemical energy useful to the cell; others are anabolic 
and use this energy to drive the synthesis of essential biomolecules. Despite their 
opposing purposes, these reactions typically occur at the same time as nutrient 
molecules are broken down to provide the building blocks and energy for ongoing 
biosynthesis. Cells maintain a dynamic steady state through processes that involve 
considerable metabolic flux. The metabolism that takes place in just a single cell is 
so complex that it defies detailed quantitative description. However, an apprecia- 
tion of overall relationships can be achieved by stepping back and considering in- 
termediary metabolism at a systems level of organization. 


27l" Can Systems Analysis Simplify the Complexity 
of Metabolism? 


The metabolism of a typical heterotrophic cell can be portrayed by a schematic 
diagram consisting of just three interconnected functional blocks: (1) catabolism, 
(2) anabolism, and (3) macromolecular synthesis and growth (Figure 27.1). 


1.Catabolism Energy-yielding nutrients are oxidized to CO, and H,O in catabo- 
lism, and most of the electrons liberated are passed to oxygen via an electron- 
transport pathway coupled to oxidative phosphorylation, resulting in the forma- 
tion of ATP. Some electrons go to reduce NADP* to NADPH, the source of 
reducing power for anabolism. Glycolysis, the citric acid cycle, electron transport 
and oxidative phosphorylation, and the pentose phosphate pathway are the prin- 
cipal pathways within this block. The metabolic intermediates in these pathways 
also serve as substrates for processes within the anabolic block. 


2.Anabolism The biosynthetic reactions that form the many cellular molecules 
collectively comprise anabolism. For thermodynamic reasons, the chemistry of an- 
abolism is more complex than that of catabolism (that is, it takes more energy [and 
often more steps] to synthesize a molecule than can be produced from its degrada- 
tion). Metabolic intermediates derived from glycolysis and the citric acid cycle are 
the precursors for this synthesis, with NADPH supplying the reducing power and 
ATP the coupling energy. 


3. Macromolecular Synthesis and Growth The organic molecules produced in 
anabolism are the building blocks for creation of macromolecules. Like anabolism, 
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The Washington, D.C., Metro map. The coordinated 
flow of passengers along different transit lines is an 
apt metaphor for metabolic regulation. 
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the hierarchy of function is kept in mind. 
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27.1 Can Systems Analysis Simplify the 
Complexity of Metabolism? 


27.2 What Underlying Principle Relates ATP 
Coupling to the Thermodynamics of 

etabolism? 

27.3 Is There a Good Index of Cellular Energy 

Status? 
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macromolecular synthesis is driven by energy from ATP, although indirectly in some 
cases: GTP is the principal energy source for protein synthesis, CTP for phospholipid 
synthesis, and UTP for polysaccharide synthesis. However, keep in mind that ATP is 
the principal phosphoryl donor for formation of GTP, CTP, and UTP from GDP, CDP, 
and UDP, respectively. Macromolecules are the agents of biological function and 
information—proteins, nucleic acids, lipids that self-assemble into membranes, and 
so on. Growth can be represented as cellular accumulation of macromolecules and 
the partitioning of these materials of function and information into daughter cells in 
the process of cell division. 


Only a Few Intermediates Interconnect the Major Metabolic Systems 


Despite the complexity of processes taking place within each block, the connections 
between blocks involve only a limited number of substances. Just ten or so kinds of 
catabolic intermediates from glycolysis, the pentose phosphate pathway, and the cit- 
ric acid cycle serve as the raw material for most of anabolism: four kinds of sugar 
phosphates (triose-P, tetrose-P, pentose-P, and hexose-P), three a-keto acids (pyru- 
vate, oxaloacetate, and a-ketoglutarate), two coenzyme A derivatives (acetyl-CoA and 
succinyl-CoA), and PEP (phosphoenolpyruvate). 


ATP and NADPH Couple Anabolism and Catabolism 


Metabolic intermediates are consumed by anabolic reactions and must be continu- 
ously replaced by catabolic processes. In contrast, the energy-rich compounds ATP 
and NADPH are recycled rather than replaced. When these substances are used in 
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biosynthesis, the products are ADP and NADP*, and ATP and NADPH are regener- 
ated during catabolism. ATP and NADPH are unique in that they are the only com- 
pounds whose purpose is to couple the energy-yielding processes of catabolism to the 
energy-consuming reactions of anabolism. Certainly, other coupling agents serve es- 
sential roles in metabolism. For example, NADH and [FADHs] participate in the 
transfer of electrons from substrates to Os during oxidative phosphorylation. How- 
ever, these reactions are solely catabolic, and the functions of NADH and [FADH»] 
are fulfilled within the block called catabolism. 


Phototrophs Have an Additional Metabolic System— 
The Photochemical Apparatus 


The systems in Figure 27.1 reviewed thus far are representative only of metabolism as 
it exists in aerobic heterotrophs. The photosynthetic production of ATP and NADPH 
in photoautotrophic organisms entails a fourth block, the photochemical system (Fig- 
ure 27.1). This block consumes HO and releases Og. When this fourth block operates, 
energy production within the catabolic block can be largely eliminated. Yet another 
block, one to account for the fixation of carbon dioxide into carbohydrates, is also re- 
quired for photoautotrophs. The inputs to this fifth block are the products of the pho- 
tochemical system (ATP and NADPH) and CO, derived from the environment. The 
carbohydrate products of this block may enter catabolism, but not primarily for energy 
production. In photoautotrophs, carbohydrates are fed into catabolism to generate 
the metabolic intermediates needed to supply the block of anabolism. Although these 
diagrams are oversimplifications of the total metabolic processes in heterotrophic or 
phototrophic cells, they are useful illustrations of functional relationships between the 
major metabolic subdivisions. This general pattern provides an overall perspective on 
metabolism, making its purpose easier to understand. 


27228 What Underlying Principle Relates ATP Coupling 
to the Thermodynamics of Metabolism? 


Virtually every metabolic pathway either consumes or produces ATP. The amount of 
ATP involved—that is, the stoichiometry of ATP synthesis or hydrolysis—lies at the 
heart of metabolic relationships. The overall thermodynamic efficiency of any meta- 
bolic sequence, be it catabolic or anabolic, is determined by ATP coupling. In every 
case, the overall reaction mediated by any metabolic pathway is energetically favorable because of 
its particular ATP stoichiometry. In the highly exergonic reactions of catabolism, much of 
the energy released is captured in ATP synthesis. In turn, the thermodynamically un- 
favorable reactions of anabolism are driven by energy released upon ATP hydrolysis. 

To illustrate this principle, we must first consider the three types of stoichi- 
ometries. The first two are fixed by the laws of chemistry, but the third is unique to 
living systems and reveals a fundamental difference between the inanimate world of 
chemistry and physics and the world of biological function, as shaped by evolution— 
that is, the world of living organisms. The fundamental difference is the stoichiome- 
try of ATP coupling. 


1.Reaction Stoichiometry This is simple chemical stoichiometry—the number of 
each kind of atom in any chemical reaction remains the same, and thus equal num- 
bers must be present on both sides of the equation. This requirement holds even 
for a process as complex as cellular respiration: 


CsH90,5 + 6 Op — 6 CO, + 6 H,O 


The six carbons in glucose appear as 6 COs, the 12 H of glucose appear as the 12 H 
in six molecules of water, and the 18 oxygens are distributed between CO, and H,O. 


2. Obligate Coupling Stoichiometry Cellular respiration is an oxidation—reduction 
process, and the oxidation of glucose is coupled to the reduction of NAD* and [FAD]. 
(Brackets here denote that the relevant FAD is covalently linked to succinate dehy- 


Stoichiometry is the measurement of the 
amounts of chemical elements and molecules 
involved in chemical reactions (from the Greek 
stoicheion, meaning “element,” and metria, mean- 
ing “measure’). 
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drogenase; see Chapter 20). The NADH and [FADH,] thus formed are oxidized in the 
electron-transport pathway: 


(a) C6H1206 + 10 NAD* + 2 [FAD] + 6 HO — 
6 CO, + 10 NADH + 10 H+ + 2 [FADH.] 


(b) 10 NADH + 10 H* + 2 [FADH,] + 6 Og. —>12 HO + 10 NAD* + 2 [FAD] 


Sequence (a) accounts for the oxidation of glucose via glycolysis and the citric acid 
cycle. Sequence (b) is the overall equation for electron transport per glucose. The 
stoichiometry of coupling by the biological e carriers, NAD* and FAD, is fixed by the 
chemistry of electron transfer; each of the coenzymes serves as an ¢ pair acceptor. Re- 
duction of each O atom takes an e pair. Metabolism must obey these facts of chem- 
istry: Biological oxidation of glucose releases 12 ¢ pairs, creating a requirement for 
12 equivalents of € pair acceptors, which transfer the electrons to 12 O atoms. By evo- 
lutionary chance, NAD*/NADH and FAD/FADH carry these electrons, but the stoi- 
chiometry is fixed by the chemistry. 


3. Evolved Coupling Stoichiometries The participation of ATP is fundamentally 
different from the role played by pyridine nucleotides and flavins. The stoichiome- 
try of adenine nucleotides in metabolic sequences is not fixed by chemical neces- 
sity. Instead, the “stoichiometries” we observe are the consequences of evolutionary 
design. The overall equation for cellular respiration,! including the coupled for- 
mation of ATP by oxidative phosphorylation, is 


CsH1206 + 6 Oo + 38 ADP + 38 P; —>6 CO; + 38 ATP + 44 HO 


The “stoichiometry” of ATP formation, 38 ADP + 38 P;—>38 ATP + 38 HO, can- 
not be predicted from any chemical considerations. The value of 38 ATP is an 
end result of biological adaptation. It is a trait that evolved through interactions 
between chemistry, heredity, and the environment over the course of evolution. 
Like any evolved character, ATP stoichiometry is the result of compromise. The 
final trait is one particularly suited to the fitness of the organism. 

The number 38 is not magical. Recall that in eukaryotes, the consensus value for 
the net yield of ATP per glucose is 30 to 32, not 38 (see Table 20.4). Also, the value 
of 38 was established a long time ago in evolution, when the prevailing atmospheric 
conditions and the competitive situation were undoubtedly very different from 
those today. The significance of this number is that it provides a high yield of ATP 
for each glucose molecule, yet the yield is still low enough that essentially all of the 
glucose is metabolized. 


ATP Coupling Stoichiometry Determines the K,,, for Metabolic Sequences 


The fundamental biological purpose of ATP as an energy-coupling agent is to drive 
thermodynamically unfavorable reactions. In effect, the energy release accompa- 
nying ATP hydrolysis is transmitted to the unfavorable reaction so that the overall 
free energy change for the coupled process is negative (that is, favorable). The in- 
volvement of ATP serves to alter the free energy change for a reaction; or to put it 
another way, the role of ATP is to change the equilibrium ratio of [reactants] to 
[products] for a reaction. (See the A Deeper Look box on page 67.) 

Another way of viewing these relationships is to note that, at equilibrium, the 
concentrations of ADP and P; will be vastly greater than that of ATP because A G°’ 
for ATP hydrolysis is a large negative number.” However, the cell where this reaction 


'This overall equation for cellular respiration is for the reaction within an uncompartmentalized 
(prokaryotic) cell. In eukaryotes, where much of the cellular respiration is compartmentalized within 
mitochondria, mitochondrial ADP/ATP exchange imposes a metabolic cost on the proton gradient 
of 1 H* per ATP, so the overall yield of ATP per glucose is 32, not 38. 

*Since AG” = —30.5 kJ/mol, In Keg = 12.3. So Keg = 2.2 X 10°. Choosing starting conditions of 
[ATP] = 8 mM, [ADP] = 8 mM, and [P;] = 1 mM, we can assume that, at equilibrium, [ATP] has 
fallen to some insignificant value x, [ADP] = approximately 16 mM, and [P;] = approximately 9 mM. 
The concentration of ATP at equilibrium, x, then calculates to be about 1 nM. 


27.3 Is There a Good Index of Cellular Energy Status? 


is at equilibrium is a dead cell. Living cells break down energy-yielding nutrient 
molecules to generate ATP. These catabolic reactions proceed with a very large 
overall decrease in free energy. Kinetic controls over the rates of the catabolic path- 
ways are designed to ensure that the [ATP]/([ADP][P;]) ratio is maintained very 
high. The cell, by employing kinetic controls over the rates of metabolic pathways, maintains 
a very high [ATP]/([ADP][P;]) ratio so that ATP hydrolysis can serve as the driving force for 
virtually all biochemical events. 


ATP Has Two Metabolic Roles 


The role of ATP in metabolism is twofold: 


1. It serves in a stoichiometric role to establish large equilibrium constants for meta- 
bolic conversions and to render metabolic sequences thermodynamically favor- 
able. This is the role referred to when we call ATP the energy currency of the cell. 

2. ATP also serves as an important allosteric effector in the kinetic regulation of me- 
tabolism. Its concentration (relative to those of ADP and AMP) is an index of the 
energy status of the cell and determines the rates of regulatory enzymes situated at 
key points in metabolism, such as PFK in glycolysis and FBPase in gluconeogenesis. 


27:30 Is There a Good Index of Cellular Energy Status? 


Energy transduction and energy storage in the adenylate system—ATP, ADP, and 
AMP—lie at the very heart of metabolism. The amount of ATP a cell uses per 
minute is roughly equivalent to the steady-state amount of ATP it contains. Thus, 
the metabolic lifetime of an ATP molecule is brief. ATP, ADP, and AMP are all im- 
portant effectors in exerting kinetic control on regulatory enzymes situated at key 
points in metabolism, so uncontrolled changes in their concentrations could have 
drastic consequences. The regulation of metabolism by adenylates in turn requires 
close control of the relative concentrations of ATP, ADP, and AMP. Some ATP- 
consuming reactions produce ADP; PFK and hexokinase are examples. Others lead 
to the formation of AMP, as in fatty acid activation by acyl-CoA synthetases: 


Fatty acid + ATP + coenzyme A——>AMP + PP; + fatty acyl-CoA 


Adenylate Kinase Interconverts ATP, ADP, and AMP 


Adenylate kinase (see Chapter 18), by catalyzing the reversible phosphorylation of 
AMP by ATP, provides a direct connection among all three members of the adeny- 
late pool: 


ATP + AMP =— 2 ADP 


The free energy of hydrolysis of a phosphoanhydride bond is essentially the same in 
ADP and ATP (see Chapter 3), and the standard free energy change for this reac- 
tion is close to zero. 


Energy Charge Relates the ATP Levels to the Total Adenine 
Nucleotide Pool 


The role of the adenylate system is to provide phosphoryl groups at high group- 
transfer potential in order to drive thermodynamically unfavorable reactions. The 
capacity of the adenylate system to fulfill this role depends on how fully charged it 
is with phosphoric anhydrides. Energy charge is an index of this capacity: 








1 2 [ATP] + [ADP] 
Energy charge ( ) 


2\ [ATP] + [ADP] + [AMP] 


The denominator represents the total adenylate pool ([ATP] + [ADP] + [AMP] ); the 
numerator is the number of phosphoric anhydride bonds in the pool, two for each 
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FIGURE 27.2 Relative concentrations of AMP ADP and 
ATP as a function of energy charge. (This graph was 
constructed assuming that the adenylate kinase reac- 
tion is at equilibrium and that AG” for the reaction is 
—473 J/mol; Keg = 1.2.) 
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FIGURE 27.3 Responses of regulatory enzymes to varia- 
tion in energy charge. 


ATP and one for each ADP. The factor } normalizes the equation so that energy 
charge, or E.C., has the range 0 to 1.0. If all the adenylate is in the form of ATP, E.C. = 
1.0, and the potential for phosphoryl transfer is maximal. At the other extreme, if AMP 
is the only adenylate form present, E.C. = 0. It is reasonable to assume that the adeny- 
late kinase reaction is never far from equilibrium in the cell. Then the relative amounts 
of the three adenine nucleotides are fixed by the energy charge. Figure 27.2 shows the 
relative changes in the concentrations of the adenylates as energy charge varies from 
0 to 1.0. 


Key Enzymes Are Regulated by Energy Charge 


Regulatory enzymes typically respond in reciprocal fashion to adenine nucleotides. 
For example, PFK is stimulated by AMP and inhibited by ATP. If the activities of var- 
ious regulatory enzymes are examined in vitro as a function of energy charge, an in- 
teresting relationship appears. Regulatory enzymes in energy-producing catabolic 
pathways show greater activity at low energy charge, but the activity falls off abruptly 
as E.C. approaches 1.0. In contrast, regulatory enzymes of anabolic sequences are 
not very active at low energy charge, but their activities increase exponentially as E.C. 
nears 1.0. These contrasting responses are termed R, for ATP-regenerating, and U, 
for ATP-utilizing (Figure 27.3). Regulatory enzymes such as PFK and pyruvate kinase 
in glycolysis follow the R response curve as E.C. is varied. Note that PFK itself is an 
ATP-utilizing enzyme, using ATP to phosphorylate fructose-6-phosphate to yield 
fructose-1,6-bisphosphate. Nevertheless, because PFK acts physiologically as the valve 
controlling the flux of carbohydrate down the catabolic pathways of cellular respira- 
tion that lead to ATP regeneration, it responds as an “R” enzyme to energy charge. 
Regulatory enzymes in anabolic pathways, such as acetyl-CoA carboxylase, which ini- 
tiates fatty acid biosynthesis, respond as “U” enzymes. 

The overall purposes of the R and U pathways are diametrically opposite in terms 
of ATP involvement. Note in Figure 27.3 that the R and U curves intersect at a 
rather high E.C. value. As E.C. increases past this point, R activities decline precip- 
itously and U activities rise. That is, when E.C. is very high, biosynthesis is acceler- 
ated while catabolism diminishes. The consequence of these effects is that ATP is 
used up faster than it is regenerated, and so E.C. begins to fall. As E.C. drops below 
the point of intersection, R processes are favored over U. Then, ATP is generated 
faster than it is consumed, and E.C. rises again. The net result is that the value of 
energy charge oscillates about a point of steady state (Figure 27.3). The experi- 
mental results obtained from careful measurement of the relative amounts of AMP, 
ADP, and ATP in living cells reveals that normal cells have an energy charge in the 
neighborhood of 0.85 to 0.88. Maintenance of this steady-state value is one criterion 
of cell health and normalcy. 


Phosphorylation Potential Is a Measure of Relative ATP Levels 


Because energy charge is maintained at a relatively constant value in normal cells, it 
is not an informative index of cellular capacity to carry out phosphorylation reactions. 
The relative concentrations of ATP, ADP, and P; do provide such information, and a 
function called phosphorylation potential has been defined in terms of these con- 
centrations: 


ADP + P; == ATP + HO 


Phosphorylation potential, I’, is equal to [ATP] /([ADP] [P;]). 

Note that this expression includes a term for the concentration of inorganic 
phosphate. [P;] has substantial influence on the thermodynamics of ATP hydroly- 
sis. In contrast with energy charge, phosphorylation potential varies over a signifi- 
cant range as the actual proportions of ATP, ADP, and P; in cells vary in response to 
metabolic state. [ ranges from 200 to 800 M“', or more, with higher levels signify- 
ing more ATP and correspondingly greater phosphorylation potential. 
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27:4: How Is Overall Energy Balance Regulated in Cells? 


AMP-activated protein kinase (AMPK) is the cellular energy sensor. Metabolic in- 
puts to this sensor determine whether its output, protein kinase activity, takes place. 
When cellular energy levels are high, as signaled by high ATP concentrations, 
AMPK is inactive. When cellular energy levels are depleted, as signaled by high 
[AMP], AMPK is allosterically activated and phosphorylates many targets control- 
ling cellular energy production and consumption. Recall that, due to the nature of 
the adenylate kinase equilibrium (see pages 542-543), AMP levels increase expo- 
nentially as ATP levels decrease. AMP is an allosteric activator of AMPK, whereas 
ATP at high levels acts as an allosteric inhibitor by displacing AMP from the allo- 
steric site. Thus, competition between AMP and ATP for binding to the AMPK 
allosteric sites determines the activity of AMPK. Activation of AMPK (1) sets in mo- 
tion catabolic pathways leading to ATP synthesis and (2) shuts down pathways that 
consume ATP energy, such as biosynthesis and cell growth. 

AMPK is an afy-heterotrimer (Figure 27.4). The a-subunit is the catalytic sub- 
unit; it has an N-terminal Ser/Thr protein kinase domain and a C-terminal 
By-binding domain. The -subunit has at its C-terminus an ay-binding domain. 
The y-subunit is the regulatory subunit; it has a pair of allosteric sites where either 
AMP or ATP binds. These sites are located toward its C-terminus in the form of 
four CBS domains (so named for their homology to cystathionine-B-synthase). These 
CBS domains act in pairs to form structures known as Bateman modules. The 
Bateman modules provide the binding sites for the allosteric ligands, AMP and 
ATP. AMP binding to these sites is highly cooperative, such that binding of AMP to 
one module markedly enhances AMP-binding at the other. This cooperativity ren- 
ders AMPK exquisitely sensitive to changes in AMP concentration. 

AMP binding to AMPK increases its protein kinase activity by more than 1000-fold. 
The underlying mechanism involves a pseudosubstrate sequence (see the Protein 
Kinases: Target Recognition and Intrasteric Control section, page 461) within CBS do- 
main 2 that fits into the a-subunit catalytic site. When AMP binds to the Bateman mod- 
ules, conformational changes in the y-subunit displace the pseudosubstrate sequence 
from the kinase catalytic site, freeing it to act. The structural relationships between the 
AMPK subunits can be seen in the Schizosaccharomyces pombe aß y complex (Figure 27.5). 


FIGURE 27.4 Domain structure of the AMP-activated 
protein kinase subunits. (Adapted from Figure 1 in Hardie, 
D.G., Hawley, S. A., and Scott, J.W., 2006. AMP-activated protein 
kinase: Development of the energy sensor concept. Journal of 
Physiology 574:7-15.) 





FIGURE 27.5 Core structure of the Schizosaccharomyces 
pombe AMPK heterotrimer. The a-subunit is green, the 
B-subunit is yellow, and the y-subunit is white. A bound 
AMP (red) is also shown. A second AMP-binding site 
(vacant) lies directly above this AMP (pdb id = 200X). 
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FIGURE 27.6 AMPK regulation of energy production and 
consumption in mammals. (Adapted from Figure 1 in Kahn, 
B. B., Alquier, T., Carling, D., and Hardie, D. G., 2005. AMP-activated 
protein kinase: Ancient energy gauge provides clues to modern 
understanding of metabolism. Cell Metabolism 1:15-25.) 


Actually, AMP activates AMPK in two ways: First, it is an allosteric activator; 
second, AMP binding favors phosphorylation of Thr!” within the a-subunit kinase 
domain. Phosphorylation of Thr!” is necessary for a-subunit protein kinase activity. 
Thr!” lies within the activation loop of the kinase; activation loops are common fea- 
tures of protein kinases whose activation requires phosphorylation by other protein 
kinases. Both of these favorable actions by AMP are reversed if ATP displaces AMP 
from the allosteric site. 


AMPK Targets Key Enzymes in Energy Production and Consumption 


Activation of AMPK leads to phosphorylation of many key enzymes in energy me- 
tabolism. Those involved in energy production that are activated upon phosphory- 
lation by AMPK include phosphofructokinase-2 (PFK-2; see Chapter 22). In contrast 
to protein kinase A phosphorylation of PFK-2, AMPK phosphorylation of liver 
PFK-2 enhances fructose-2,6-bisphosphate synthesis, which in turn stimulates gly- 
colysis. Enzymes involved in energy consumption that are down-regulated upon 
phosphorylation by AMPK include glycogen synthase (see Chapter 22), acetyl-CoA 
carboxylase (which catalyzes the committed step in fatty acid biosynthesis; see Chap- 
ter 24), and 3-hydroxy-3-methylglutaryl-CoA reductase, which carries out the key 
regulatory reaction in cholesterol biosynthesis (see Chapter 24). Further, AMPK 
phosphorylation of various transcription factors leads to diminished expression of 
genes encoding biosynthetic enzymes and elevated expression of catabolic genes. 


AMPK Controls Whole-Body Energy Homeostasis 


Beyond these cellular effects, AMPK plays a central role in energy balance in multi- 
cellular organisms (Figure 27.6). AMPK in skeletal muscle is activated by hormones 
such as adiponectin and leptin, adipocyte-derived hormones that govern eating be- 
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havior and energy homeostasis (see section 27.6). Physical activity (exercise) also ac- 
tivates muscle AMPK. In turn, skeletal muscle AMPK activates glucose uptake, fatty 
acid oxidation, and mitochondrial biogenesis through its phosphorylation of meta- 
bolic enzymes and transcription factors that control expression of genes involved in 
energy production and consumption. AMPK’s actions in the liver lead to lowered 
ATP (energy) consumption through down-regulation of fatty acid synthesis, choles- 
terol synthesis, and gluconeogenesis. Metformin, a widely used drug for the treat- 
ment of type 2 diabetes (page 668), lowers blood glucose levels through inhibition 
of liver gluconeogenesis; metformin achieves this result through activation of 
AMPK. AMPK blocks insulin secretion by pancreatic B-cells; insulin is a hormone 
that favors energy storage (glycogen and fat synthesis). AMPK is also a master reg- 
ulator of eating behavior through its activity in the hypothalamus, the key center for 
regulation of food intake. These effects of AMPK are described in section 27.6. 


27255 How Is Metabolism Integrated 
in a Multicellular Organism? 


In complex multicellular organisms, organ systems have arisen to carry out specific 
physiological functions. Each organ expresses a repertoire of metabolic pathways that 
is consistent with its physiological purpose. Such specialization depends on coordina- 
tion of metabolic responsibilities among organs so that the organism as a whole may 
thrive. Essentially all cells in animals have the set of enzymes common to the central 
pathways of intermediary metabolism, especially the enzymes involved in the forma- 
tion of ATP and the synthesis of glycogen and lipid reserves. Nevertheless, organs dif- 
fer in the metabolic fuels they prefer as substrates for energy production. Important 
differences also occur in the ways ATP is used to fulfill the organs’ specialized meta- 
bolic functions. To illustrate these relationships, we will consider the metabolic inter- 
actions among the major organ systems found in humans: brain, skeletal muscle, heart, 
adipose tissue, and liver. In particular, the focus will be on energy metabolism in these 
organs (Figure 27.7). The major fuel depots in animals are glycogen in liver and mus- 
cle; triacylglycerols (fats) stored in adipose tissue; and protein, most of which is in skele- 
tal muscle. In general, the order of preference for the use of these fuels is the order 
given: glycogen > triacylglycerol > protein. Nevertheless, the tissues of the body work 
together to maintain energy homeostasis (caloric homeostasis), defined as a constant 
availability of fuels in the blood. 


The Major Organ Systems Have Specialized Metabolic Roles 


Table 27.1 summarizes the energy metabolism of the major human organs. 


Brain The brain has two remarkable metabolic features. First, it has a very high 
respiratory metabolism. In resting adult humans, 20% of the oxygen consumed is 
used by the brain, even though it constitutes only 2% or so of body mass. Interest- 
ingly, this level of oxygen consumption is independent of mental activity, continu- 
ing even during sleep. Second, the brain is an organ with no significant fuel re- 
serves—no glycogen, expendable protein, or fat (even in “fatheads”!). Normally, 
the brain uses only glucose as a fuel and is totally dependent on the blood for a con- 
tinuous incoming supply. Interruption of glucose supply for even brief periods of 
time (as in a stroke) can lead to irreversible losses in brain function. The brain uses 
glucose to carry out ATP synthesis via cellular respiration. High rates of ATP pro- 
duction are necessary to power the plasma membrane Na*,K*-ATPase so that the 
membrane potential essential for transmission of nerve impulses is maintained. 
During prolonged fasting or starvation, the body’s glycogen reserves are depleted. 
Under such conditions, the brain adapts to use B-hydroxybutyrate (Figure 27.8) as 
a source of fuel, converting it to acetyl-CoA for energy production via the citric acid 
cycle. 6-Hydroxybutyrate (see Chapter 23) is formed from fatty acids in the liver. 
Although the brain cannot use free fatty acids or lipids directly from the blood as 
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FIGURE 27.7 Metabolic relationships among the major 
human organs. 












TABLE 27.1 jy Metab i 
Energy Energy Sources 
Organ Reservoir Preferred Substrate Exported 
Brain None Glucose (ketone bodies None 
during starvation) 
Skeletal muscle Glycogen Fatty acids None 
(resting) 
Skeletal muscle None Glucose from glycogen Lactate 
(strenuous 
exercise) 
Heart muscle Glycogen Fatty acids None 
Adipose tissue Triacylglycerol Fatty acids Fatty acids, 
glycerol 
Liver Glycogen, Amino acids, glucose, Fatty acids, 
triacylglycerol fatty acids glucose, ketone 
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fuel, the conversion of these substances to 6-hydroxybutyrate in the liver allows the 
brain to use body fat as a source of energy. The brain’s other potential source of fuel 
during starvation is glucose obtained from gluconeogenesis in the liver (see Chapter 
22), using the carbon skeletons of amino acids derived from muscle protein break- 
down. The adaptation of the brain to use B-hydroxybutyrate from fat spares protein 
from degradation until lipid reserves are exhausted. 


Muscle Skeletal muscle is responsible for about 30% of the Oo consumed by the 
human body at rest. During periods of maximal exertion, skeletal muscle can ac- 
count for more than 90% of the total metabolism. Muscle metabolism is primarily 
dedicated to the production of ATP as the source of energy for contraction and re- 
laxation. Muscle contraction occurs when a motor nerve impulse causes Ca?* re- 
lease from specialized endomembrane compartments (the transverse tubules and 
sarcoplasmic reticulum). Ca?* floods the sarcoplasm (the term denoting the cytosolic 
compartment of muscle cells), where it binds to troponin C, a regulatory protein, 
initiating a series of events that culminate in the sliding of myosin thick filaments 
along actin thin filaments. This mechanical movement is driven by energy released 
upon hydrolysis of ATP (see Chapter 16). The net result is that the muscle shortens. 
Relaxation occurs when the Ca?* ions are pumped back into the sarcoplasmic retic- 
ulum by the action of a Ca?*-transporting membrane ATPase. Two Ca?* ions are 
translocated per ATP hydrolyzed. The amount of ATP used during relaxation is al- 
most as much as that consumed during contraction. 

Because muscle contraction is an intermittent process that occurs upon de- 
mand, muscle metabolism is designed for a demand response. Muscle at rest uses 
free fatty acids, glucose, or ketone bodies as fuel and produces ATP via oxidative 
phosphorylation. Resting muscle also contains about 2% glycogen and about 
0.08% phosphocreatine by weight (Figure 27.9). When ATP is used to drive 
muscle contraction, the ADP formed can be reconverted to ATP by creatine kinase 
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CENGAGENOW” ANIMATED FIGURE 27.9 Phospho- 
creatine serves as a reservoir of ATP-synthesizing poten- 
tial. See this figure animated at www.cengage.com/ 
login. 
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at the expense of phosphocreatine. Muscle phosphocreatine can generate enough 
ATP to power about 4 seconds of exertion. During strenuous exertion, such as a 
100-meter sprint, once the phosphocreatine is depleted, muscle relies solely on its 
glycogen reserves, making the ATP for contraction via glycolysis. In contrast with 
the citric acid cycle and oxidative phosphorylation pathways, glycolysis is capable 
of explosive bursts of activity, and the flux of glucose-6-phosphate through this 
pathway can increase 2000-fold almost instantaneously. The triggers for this activa- 
tion are Ca** and the “fight or flight” hormone epinephrine (see Chapters 22 and 
32). Little interorgan cooperation occurs during strenuous (anaerobic) exercise. 
Muscle fatigue is the inability of a muscle to maintain power output. During max- 
imum exertion, the onset of fatigue takes only 20 seconds or so. Fatigue is not the 
result of exhaustion of the glycogen reserves, nor is it a consequence of lactate ac- 
cumulation in the muscle. Instead, it is caused by a decline in intramuscular pH as 
protons are generated during glycolysis. (The overall conversion of glucose to 2 lac- 
tate in glycolysis is accompanied by the release of 2 H*.) The pH may fall as low as 
6.4. It is likely that the decline in PFK activity at low pH leads to a lowered flux of 
hexose through glycolysis and inadequate ATP levels, causing a feeling of fatigue. 
One benefit of PFK inhibition is that the ATP remaining is not consumed in the 
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Athletic Performance Enhancement with Creatine Supplements? 


The creatine pool in a 70-kg (1541b) human body is about 120 
grams. This pool includes dietary creatine (from meat) and crea- 
tine synthesized by the human body from its precursors (arginine, 
glycine, and methionine). Of this creatine, 95% is stored in the 
skeletal and smooth muscles, about 70% of which is in the form of 
phosphocreatine. Supplementing the diet with 20 to 30 grams of 
creatine per day for 4 to 21 days can increase the muscle creatine 
pool by as much as 50% in someone with a previously low creatine 
level. Thereafter, supplements of 2 grams per day will maintain ele- 
vated creatine stores. Studies indicate that creatine supplementa- 
tion gives some improvement in athletic performance during high- 
intensity, short-duration events (such as weight lifting), but no 
benefit in endurance events (such as distance running). The dis- 
tinction makes sense in light of phosphocreatine’s role as the sub- 
strate that creatine kinase uses to regenerate ATP from ADP. In- 
tense muscular activity quickly (less than 2 seconds) exhausts ATP 
supplies; [phosphocreatine ]muscie is sufficient to restore ATP levels 
for a few extra seconds, but no more. The U.S. Food and Drug Ad- 
ministration advises consumers to consult with their doctors before 
using creatine as a dietary supplement. 
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PFK reaction, thereby sparing the cell from the more serious consequences of los- 
ing all of its ATP. 

During fasting or excessive activity, skeletal muscle protein is degraded to 
amino acids so that their carbon skeletons can be used as fuel. Many of the skele- 
tons are converted to pyruvate, which can be transaminated back into alanine for 
export via the circulation (Figure 27.10). Alanine is carried to the liver, which in 
turn deaminates it back into pyruvate so that it can serve as a substrate for gluco- 
neogenesis. Although muscle protein can be mobilized as an energy source, it is 
not efficient for an organism to consume its muscle and lower its overall fitness 
for survival. Muscle protein represents a fuel of last resort. 


Heart In contrast with the intermittent work of skeletal muscle, the activity of heart 
muscle is constant and rhythmic. The range of activity in heart is also much less than 
that in muscle. Consequently, the heart functions as a completely aerobic organ and, 
as such, is very rich in mitochondria. Roughly half the cytoplasmic volume of heart 
muscle cells is occupied by mitochondria. Under normal working conditions, the 
heart prefers fatty acids as fuel, oxidizing acetyl-CoA units via the citric acid cycle and 
producing ATP for contraction via oxidative phosphorylation. Heart tissue has min- 
imal energy reserves: a small amount of phosphocreatine and limited quantities of 
glycogen. As a result, the heart must be continually nourished with oxygen and free 
fatty acids, glucose, or ketone bodies as fuel. 


Adipose Tissue Adipose tissue is an amorphous tissue that is widely distributed 
about the body—around blood vessels, in the abdominal cavity and mammary 
glands, and most prevalently, as deposits under the skin. Long considered merely a 
storage depot for fat, adipose tissue is now appreciated as an endocrine organ re- 
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Fat-Free Mice—A Snack Food for Pampered Pets? No, A Model for One 
Form of Diabetes 





Scientists at the National Institutes of Health have created transgenic 
mice that lack white adipose tissue throughout their lifetimes. These 
mice were created by blocking the normal differentiation of stem 
cells into adipocytes so that essentially no white adipose tissue can be 
formed in these animals. These “fat-free” mice have double the food 
intake and five times the water intake of normal mice. Fat-free mice 
also show decreased physical activity and must be kept warm on little 
heating pads to survive, because they lack insulating fat. They are 
also diabetic, with three times normal blood glucose and triacylglyc- 
erol levels and only 5% of normal leptin levels; they die prematurely. 
Like type 2 diabetic patients, fat-free mice have markedly elevated in- 


sulin levels (50-400 times normal) but are unresponsive to insulin. 
These mice serve as an excellent model for the disease lipoatrophic 
diabetes, an inherited disease characterized by the absence of adipose 
tissue and severe diabetic symptoms. Indeed, transplantation of adi- 
pose tissue into these fat-free mice cured their diabetes. As the ma- 
jor organ for triacylglycerol storage, white adipose tissue helps con- 
trol energy homeostasis (food intake and energy expenditure) via 
the release of leptin and other hormonelike substances (see the dis- 
cussion on page 855). Clearly, absence of adipose tissue has wide- 
spread, harmful consequences for metabolism. 





852 Chapter 27 Metabolic Integration and Organ Specialization 


sponsible for secretion of a variety of hormones that govern eating behavior and 
caloric homeostasis. It consists principally of cells known as adipocytes that no 
longer replicate. However, adipocytes can increase in number as adipocyte precur- 
sor cells divide, and obese individuals tend to have more of them. As much as 65% 
of the weight of adipose tissue is triacylglycerol that is stored in adipocytes, essen- 
tially as oil droplets. The average 70-kg man has enough caloric reserve stored as fat 
to sustain a 6000 kJ/day rate of energy production for 3 months, which is adequate 
for survival, assuming no serious metabolic aberrations (such as nitrogen, mineral, 
or vitamin deficiencies). Despite their role as energy storage depots, adipocytes 
have a high rate of metabolic activity, synthesizing and breaking down triacyl- 
glycerol so that the average turnover time for a triacylglycerol molecule is just a few 
days. Adipocytes actively carry out cellular respiration, transforming glucose to en- 
ergy via glycolysis, the citric acid cycle, and oxidative phosphorylation. If glucose lev- 
els in the diet are high, glucose is converted to acetyl-CoA for fatty acid synthesis. 
However, under most conditions, free fatty acids for triacylglycerol synthesis are ob- 
tained from the liver. Because adipocytes lack glycerol kinase, they cannot recycle 
the glycerol of triacylglycerol but rather depend on glycolytic conversion of glucose 
to dihydroxyacetone-3-phosphate (DHAP) and the reduction of DHAP to glycerol- 
3-phosphate for triacylglycerol biosynthesis. Adipocytes also require glucose to feed 
the pentose phosphate pathway for NADPH production. 

Glucose plays a pivotal role for adipocytes. If glucose levels are adequate, 
glycerol-3-phosphate is formed in glycolysis and the free fatty acids liberated in tri- 
acylglycerol breakdown are re-esterified to glycerol to re-form triacylglycerols. How- 
ever, if glucose levels are low, [glycerol-3-phosphate] falls and free fatty acids are re- 
leased to the bloodstream (see Chapter 23). 


“Brown Fat” A specialized type of adipose tissue, so-called brown fat, is found in new- 
borns and hibernating animals. The abundance of mitochondria, which are rich in 
cytochromes, is responsible for the brown color of this fat. As usual, these mito- 
chondria are very active in electron transport-—driven proton translocation, but these 
particular mitochondria contain in their inner membranes a protein, thermogenin, 
also known as uncoupling protein 1 (see Chapter 20), that creates a passive proton 
channel, permitting the H* ions to reenter the mitochondrial matrix without gen- 
erating ATP. Instead, the energy of oxidation is dissipated as heat. Indeed, brown fat 
is specialized to oxidize fatty acids for heat production rather than ATP synthesis. 


Liver The liver serves as the major metabolic processing center in vertebrates. Ex- 
cept for dietary triacylglycerols, which are metabolized principally by adipose tissue, 
most of the incoming nutrients that pass through the intestinal tract are routed via 
the portal vein to the liver for processing and distribution. Much of the liver’s activity 
centers around conversions involving glucose-6-phosphate (Figure 27.11). Glucose-6- 
phosphate can be converted to glycogen, released as blood glucose, used to generate 
NADPH and pentoses via the pentose phosphate cycle, or catabolized to acetyl-CoA 
for fatty acid synthesis or for energy production via oxidative phosphorylation. Most 
of the liver glucose-6-phosphate arises from dietary carbohydrate, from degradation 
of glycogen reserves, or from muscle lactate that enters the gluconeogenic pathway. 

The liver plays an important regulatory role in metabolism by buffering the 
level of blood glucose. Liver has two enzymes for glucose phosphorylation: hexo- 
kinase and glucokinase (type-IV hexokinase). Unlike hexokinase, glucokinase has 
a low affinity for glucose. Its K„ for glucose is high, on the order of 10 mM. When 
blood glucose levels are high, glucokinase activity augments hexokinase in phos- 
phorylating glucose as an initial step leading to its storage in glycogen. The major 
metabolic hormones—epinephrine, glucagon, and insulin—all influence glucose 
metabolism in the liver to keep blood glucose levels relatively constant (see Chap- 
ters 22 and 32). 

The liver is a major center for fatty acid turnover. When the demand for meta- 
bolic energy is high, triacylglycerols are broken down and fatty acids are degraded 
in the liver to acetyl-CoA to form ketone bodies, which are exported to the heart, 
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brain, and other tissues. If energy demands are low, fatty acids are incorporated into 
triacylglycerols that are carried to adipose tissue for deposition as fat. Cholesterol is 
also synthesized in the liver from two-carbon units derived from acetyl-CoA. 

In addition to these central functions in carbohydrate and fat-based energy me- 
tabolism, the liver serves other purposes. For example, the liver can use amino acids 
as metabolic fuels. Amino acids are first converted to their corresponding a-keto acids 
by aminotransferases. The amino group is excreted after incorporation into urea in 
the urea cycle. The carbon skeletons of glucogenic amino acids can be used for glu- 
cose synthesis, whereas those of ketogenic amino acids appear in ketone bodies (see 
Figure 25.41). The liver is also the principal detoxification organ in the body. The 
endoplasmic reticulum of liver cells is rich in enzymes that convert biologically active 
substances such as hormones, poisons, and drugs into less harmful by-products. 

Liver disease leads to serious metabolic derangements, particularly in amino acid 
metabolism. In cirrhosis, the liver becomes defective in converting NH," to urea for 
excretion, and blood levels of NH4* rise. Ammonia is toxic to the central nervous 
system, and coma ensues. 


27265 What Regulates Our Eating Behavior? 


Approximately 65% of Americans are overweight, and one in three Americans is clin- 
ically obese (overweight by 20% or more). Obesity is the single most important cause 
of type 2 (adult-onset insulin-independent) diabetes. Research into the regulatory 
controls that govern our feeding behavior has become a medical urgency with great 
financial incentives, given the epidemic proportions of obesity and widespread pre- 
occupation with dieting and weight loss. 


The Hormones That Control Eating Behavior Come From 
Many Different Tissues 


Appetite and weight regulation are determined by a complex neuroendocrine sys- 
tem that involves hormones produced in the stomach, small intestines, pancreas, adi- 
pose tissue, and central nervous system. These hormones act in the brain, principally 
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FIGURE 27.11 Metabolic conversions of glucose-6- 
phosphate in the liver. 
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on neurons within the arcuate nucleus region of the hypothalamus. The arcuate nu- 
cleus is an anatomically distinct brain area that functions in homeostasis of body 
weight, body temperature, blood pressure, and other vital functions (Figure 27.12). 
The neurons respond to these signals by activating, or not, pathways involved in eat- 
ing (food intake) and energy expenditure. Hormones that regulate eating behavior 
can be divided into short-term regulators that determine individual meals and long- 
term regulators that act to stabilize the levels of body fat deposits. Two subsets of 
neurons are involved: (1) the NPY/AgRP-producing neurons that release NPY 
(neuropeptide Y), the protein that stimulates the neurons that trigger eating behav- 
ior, and (2) the melanocortin-producing neurons, whose products inhibit the neu- 
rons initiating eating behavior. AgRP is agouti-related peptide, a protein that blocks 
the activity of melanocortin-producing neurons. Melanocortins are a group of peptide 
hormones that includes aœ- and B-melanocyte-stimulating hormones (@-MSH and 
B-MSH). Melanocortins act on melanocortin receptors (MCRs), which are members 
of the 7-TMS G-protein-coupled receptor (GPCR) family of membrane receptors; 
MCRs trigger cellular responses through adenylyl cyclase activation (see Chapters 15 
and 32). 


Ghrelin and Cholecystokinin Are Short-Term Regulators 
of Eating Behavior 


Short-term regulators of eating include ghrelin and cholecystokinin. Ghrelin is an 
appetite-stimulating peptide hormone produced in the stomach. Production of 
ghrelin is maximal when the stomach is empty, but ghrelin levels fall quickly once 
food is consumed. Cholecystokinin is a peptide hormone released from the gas- 
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FIGURE 27.12 The regulatory pathways that control eating. (Adapted from Figure 1 in Schwartz, M.W, and Morton, 
G. J, 2002. Obesity: Keeping hunger at bay. Nature 418:595-597.) 
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HUMAN BIOCHEMISTRY 


Ethanol metabolism alters the NAD*/NADH ratio. Ethanol is 
metabolized to acetate in the liver by alcohol dehydrogenase and 
aldehyde dehydrogenase: 


CH;CH,OH + NAD+ == CH;CHO + NADH + H+ 
CH;CHO + NAD* + HO == CH;COO- + NADH + 2H* 


Acetate is then converted to acetyl-CoA. Excessive conversion of 
available NAD* to NADH impairs NAD*-requiring reactions, such 
as the citric acid cycle, gluconeogenesis, and fatty acid oxidation. 
Accumulation of acetyl-CoA favors fatty acid synthesis, which, 





The Metabolic Effects of Alcohol Consumption 


glycerol levels in the liver. Over time, these triacylglycerols accu- 
mulate as fatty deposits, which ultimately contribute to cirrhosis of 
the liver. Impairment of gluconeogenesis leads to buildup of this 
pathway’s substrate, lactate. Lactic acid accumulation in the blood 
causes acidosis. A further consequence is that acetaldehyde can 
form adducts with protein —NHg, groups, which may inhibit pro- 
tein function. Because gluconeogenesis is limited, alcohol con- 
sumption can cause hypoglycemia (low blood sugar) in someone 
who is undernourished. In turn, hypoglycemia can cause irre- 
versible damage to the central nervous system. 


along with blockage of fatty acid oxidation, causes elevated triacyl- 








trointestinal tract during eating. In contrast to ghrelin, cholecystokinin signals sati- 
ety (the sense of fullness) and tends to curtail further eating. Together, ghrelin and 
cholecystokinin constitute a meal-to-meal control system that regulates the onset 
and end of eating behavior. The activity of this control system is also modulated by 
the long-term regulators. 


Insulin and Leptin Are Long-Term Regulators of Eating Behavior 


Long-term regulators include insulin and leptin, both of which inhibit eating and 
promote energy expenditure. Insulin is produced in the B-cells of the pancreas 
when blood glucose levels rise. A major role of insulin is to stimulate glucose uptake 
from the blood into muscle, fat, and other tissues. Blood insulin levels correlate with 
body fat amounts. Insulin also stimulates fat cells to make leptin. Leptin (from the 
Greek word lepto, meaning “thin”) is a 16-kD, 146—amino acid residue protein pro- 
duced principally in adipocytes (fat cells). Leptin has a four-helix bundle tertiary 
structure similar to that of cytokines (protein hormones involved in cell-cell com- 
munication). Normally, as fat deposits accumulate in adipocytes, more and more lep- 
tin is produced in these cells and spewed into the bloodstream. Leptin levels in the 
blood communicate the status of triacylglycerol levels in the adipocytes to the cen- 
tral nervous system so that appropriate changes in appetite take place. If leptin lev- 
els are low (“starvation”), appetite increases; if leptin levels are high (“overfeeding”), 
appetite is suppressed. Leptin also regulates fat metabolism in adipocytes, inhibiting 
fatty acid biosynthesis and stimulating fat metabolism. In the latter case, leptin in- 
duces synthesis of the enzymes in the fatty acid oxidation pathway and increases ex- 
pression of uncoupling protein 2 (UCP2), a mitochondrial protein that uncouples oxi- 
dation from phosphorylation so that the energy of oxidation is lost as heat 
(thermogenesis). Leptin binding to leptin receptors in the hypothalamus inhibits re- 
lease of NPY. Because NPY is a potent orexic (appetite-stimulating) peptide hormone, 
leptin is therefore an anorexic (appetite-suppressing) agent. Functional leptin recep- 
tors are also essential for pituitary function, growth hormone secretion, and normal 
puberty. When body fat stores decline, the circulating levels of leptin and insulin also 
decline. Hypothalamic neurons sense this decline and act to increase appetite to re- 
store body fat levels. 

Intermediate regulation of eating behavior is accomplished by the gut hormone 
PYY3_35. PYY3.36 is produced in endocrine cells found in distal regions of the small in- 
testine, areas that receive ingested food some time after a meal is eaten. PYY336 
inhibits eating for many hours after a meal by acting on the NPY/AgRP-producing 
neurons in the arcuate nucleus. Clearly, the regulatory controls that govern eating 
are complex and layered. Some believe that defects in these controls are common 
and biased in favor of overeating, an advantageous evolutionary strategy that may 
have unforeseen consequences in these bountiful times. 
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AMPK Mediates Many of the Hypothalamic Responses 
to These Hormones 


The actions of leptin, ghrelin, and NPY converge at AMPK. Leptin inhibits AMPK 
activity in the arcuate nucleus, and this inhibition underlies the anorexic effects of 
leptin. Leptin action on AMPK depends on a particular melanocortin receptor type 
known as the melanocortin-4 receptor (MC4R). On the other hand, ghrelin and 
NPY activate hypothalamic AMPK, which stimulates food intake and leads, over 
time, to increased body weight. The effects of AMPK in the hypothalamus that lead 
to alterations in eating behavior may be mediated through changes in malonyl-CoA 
levels. Low [malonyl-CoA] in hypothalamic neurons is associated with increased 
food intake, and elevated malonyl-CoA levels are associated with suppression of eat- 
ing. The inhibition of acetyl-CoA carboxylase (and thus, malonyl-CoA synthesis) as 
a result of phosphorylation by AMPK plays an important part in the regulation of 
our eating behavior. 


271% Can You Really Live Longer by Eating Less? 


Caloric Restriction Leads to Longevity 


Nutritional studies published in 1935 showed that rats fed a low-calorie, but balanced 
and nutritious diet lived nearly twice as long as rats with unlimited access to food 
(2.4 years versus 1.3 years). Subsequent research over the ensuing 70 years has shown 
that the relationship between diet and longevity is a general one for organisms from 
yeast to mammals. To achieve this effect of caloric restriction (CR), animals are given 
a level of food that amounts to 60% to 70% of what they would eat if they were al- 
lowed free access to food. In animals, CR results in lower blood glucose levels, de- 
clines in glycogen and fat stores, enhanced responsiveness to insulin, lower body 
temperature, and diminished reproductive capacity. The extended life span given by 
CR offers a definite evolutionary advantage: Any animal that could slow the aging 
process and postpone reproduction in times of food scarcity and then resume re- 
production when food became available would out-compete animals without such 
ability. 

Another remarkable feature of CR is that it diminishes the likelihood for develop- 
ment of many age-related diseases, such as cancer, diabetes, and atherosclerosis. Is this 
benefit simply the result of lowered caloric intake, or does CR lead to significant reg- 
ulatory changes that affect many aspects of an organism’s physiology? The answer to 
this and other questions is emerging from vigorous research efforts toward under- 
standing CR. 


Mutations in the S/R2 Gene Decrease Life Span 


Many important clues came from genetic investigations. Deletion of a gene termed 
SIR2 (for silent information regulator 2) abolished the ability of CR to lengthen life 
span in yeast and roundworms, implicating the S/R2 gene product in longevity. SIR2 
originally was discovered through its ability to silence the transcription of genes that 
encode rRNA. S/R2-related genes are found in some prokaryotes and virtually all 
eukaryotes, including yeast, nematodes, fruit flies, and humans. The human gene is 
designated SIRT], for sirtuin 1; sirtuin is the generic name for proteins encoded by 
SIR2 genes. Sirtuins are NAD*-dependent protein deacetylases (Figure 27.13). Cleav- 
age of acetyl groups from proteins is an exergonic reaction, as is cleavage of the 
Nelycosidic bond in NAD* (AG = —34 kJ/mol). Thus, involvement of NAD* in 
the reaction is not a thermodynamic necessity. However, NAD* involvement does 
couple the reaction to an important signal of metabolic status, namely, the 
NAD*/NADH ratio. Furthermore, both nicotinamide and NADH are potent in- 
hibitors of the deacelylase reaction. Thus, the NAD*/NADH ratio controls sirtuin 
protein deacetylase activity, so oxidative metabolism, which drives conversion of 
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FIGURE 27.13 The NAD*-dependent protein deacetyl- 
ase reaction of sirtuins. Acetylated peptides include 
N-acetyl lysine side chains of histone H3 and H4 and 
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NADH to NAD+, enhances sirtuin activity. One adaptive response found with CR is 
increased mitochondrial biogenesis in liver, fat, and muscle, which is a response that 
would raise the NAD*/NADH ratio. 

Sirtuin-catalyzed removal of acetyl groups from lysine residues of histones, the 
core proteins of nucleosomes, allows the nucleosomes to interact more strongly 
with DNA, making transcription more difficult (see Chapter 29 for a discussion of 
the relationship between histone acetylation and transcriptional activity of genes). 


SIRT1 Is a Key Regulator in Caloric Restriction 


As a key regulator in CR, the human sirtuin protein SIRT] connects nutrient avail- 
ability to the expression of metabolic genes. A striking feature of CR is the loss of fat 
stores and reduction in white adipose tissue (WAT). SIRTI participates in the tran- 
scriptional regulation of adipogenesis through interaction with PPARy (peroxisome 
proliferator-activator receptor-y), a nuclear hormone receptor that activates tran- 
scription of genes involved in adipogenesis and fat storage. SIRT1 binding to PPARy 
represses transcription of these genes, leading to loss of fat stores. Because adipose 
tissue functions as an endocrine organ, this loss of fat has significant hormonal con- 
sequences for energy metabolism. 

In liver, SIRT1 interacts with and deacetylates PGC-1 (peroxisome proliferator- 
activator receptor-y coactivator-1), a transcriptional regulator of genes involved in 
glucose production. Thus, CR leads to increased transcription of the genes encod- 
ing the enzymes of gluconeogenesis and repression of genes encoding glycolytic en- 
zymes. Acting in these roles, SIRT] connects nutrient availability to the regulation 
of major pathways of energy storage (glycogen and fat) and fuel utilization. 


Resveratrol, a Compound Found in Red Wine, Is a Potent Activator 
of Sirtuin Activity 


Resveratrol (trans3,4’ ,5-trihydroxystilbene, Figure 27.14) is a phytoalexin. Phyto- 
alexins are compounds produced by plants in response to stress, injury, or fungal 
infection. Resveratrol is abundant in wine grape skins as a result of common envi- 
ronmental stresses, such as infection by Botrytis cinerea, a fungus important in mak- 
ing certain wines. Because the skins are retained when grapes are processed to 


acetylated p53 (p53 is the protein product of the p53 
tumor suppressor gene). 
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FIGURE 27.14 Resveratrol, a phytoalexin, is a member of 
the polyphenol class of natural products. As a polyphe- 
nol, resveratrol is a good free-radical scavenger, which 
may account for its cancer preventative properties. 
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make red wines, red wine is an excellent source of resveratrol. Resveratrol might be 
the basis of the French paradox—the fact that the French people enjoy longevity 
and relative freedom from heart disease despite a high-fat diet. When resveratrol is 
given to yeast cultures, roundworms (Caenorhabditis elegans), or fruit flies (Drosophila 
melanogaster), it has the same life-extending effects as CR. Resveratrol increased the 
replicative life span (the number of times a cell can divide before dying) of yeast by 
70%. Resveratrol activates SIRT1 NAD*-dependent deacetylase activity. Further- 
more, resveratrol activates AMPK in the brain and in cultured neurons. Because 
AMPK is a key energy sensor, resveratrol’s influences on longevity may arise through 
its effects on caloric homeostasis. 

Because the effects of CR and resveratrol on longevity are not additive, a reason- 
able conclusion is that they operate via a common mechanism. If this is so, then if you 
want to enjoy longevity, the appropriate advice would be “Eat less or drink red wine,” 
not “Eat less and drink red wine”! 


SUMMARY 


27.1 Can Systems Analysis Simplify the Complexity of Metabolism? 
Cells are in a dynamic steady state maintained by considerable metabolic 
flux. The metabolism going on in even a single cell is so complex that it 
defies meaningful quantitative description. Nevertheless, overall relation- 
ships become more obvious by a systems analysis approach to intermedi- 
ary metabolism. The metabolism of a typical heterotrophic cell can be 
summarized in three interconnected functional blocks: (1) catabolism, 
(2) anabolism, and (3) macromolecular synthesis and growth. Photo- 
trophic cells require a fourth block: photosynthesis. Only a few metabolic 
intermediates connect these systems, and ATP and NADPH serve as the 
carriers of chemical energy and reducing power, respectively, between 
these various blocks. 


27.2 What Underlying Principle Relates ATP Coupling to the Thermo- 
dynamics of Metabolism? ATP coupling determines the thermody- 
namics of metabolic sequences. The ATP coupling stoichiometry can- 
not be predicted from chemical considerations; instead it is a quantity 
selected by evolution and the fundamental need for metabolic se- 
quences to be emphatically favorable from a thermodynamic perspec- 
tive. Catabolic sequences generate ATP with an overall favorable Keq 
(and hence, a negative AG) and anabolic sequences consume this en- 
ergy with an overall favorable Keq, even though such sequences may 
span the same starting and end points (as in fatty acid oxidation of 
palmitoyl-CoA to 8 acetyl-CoA versus synthesis of palmitoyl-CoA from 
8 acetyl-CoA). ATP has two metabolic roles: a stoichiometric role in 
rendering metabolic sequences thermodynamically favorable and a reg- 
ulatory role as an allosteric effector. 


27.3 Is There a Good Index of Cellular Energy Status? The level of 
phosphoric anhydride bonds in the adenylate system of ATP, ADP, and 
AMP can be expressed in terms of the energy charge equation: 

2 [ATP] + [ADP] 
[ATP] + [ADP] + [AMP] 
More revealing of the potential for an ATP-dependent reaction to occur 
is I’, the phosphorylation potential: = [ATP]/([ADP] [Pj]. 








1 
Energy charge 3 


27.4 How Is Overall Energy Balance Regulated in Cells? AMP-activated 
protein kinase (AMPK) is the cellular energy sensor. When cellular 
energy levels are high, as signaled by high ATP concentrations, AMPK 
is inactive. When cellular energy levels are depleted, as signaled by 
high [AMP], AMPK is allosterically activated and phosphorylates many 
enzymes involved in cellular energy production and consumption. 
Competition between AMP and ATP for binding to the AMPK allosteric 
sites determines AMPK activity. AMPK activation leads to elevated 
energy production and diminished energy consumption. AMPK is an 
aBy-heterotrimer. The a-subunit is the catalytic subunit. AMP or ATP 


binding to the y-subunit determines AMPK activity, with AMP binding 
increasing activity by more than 1000-fold. 

Activation of AMPK leads to phosphorylation of key enzymes in en- 
ergy metabolism, activating those involved in ATP production and inac- 
tivating those in ATP consumption. In addition, AMPK phosphorylation 
of various transcription factors leads to elevated expression of catabolic 
genes and diminished expression of genes encoding biosynthetic en- 
zymes. Beyond these cellular effects, AMPK plays a central role in en- 
ergy balance in multicellular organisms, because its activity is responsive 
to hormones that govern eating behavior and energy homeostasis. 


27.5 How Is Metabolism Integrated in a Multicellular Organism? Organ 
systems in complex multicellular organisms carry out specific physiologi- 
cal functions, with each expressing those metabolic pathways appropriate 
to its physiological purpose. Essentially all cells in animals carry out the 
central pathways of intermediary metabolism, especially the reactions of 
ATP synthesis. Nevertheless, organs differ in the metabolic fuels they pre- 
fer as substrates for energy production. The major fuel depots in animals 
are glycogen in liver and muscle; triacylglycerols (fats) in adipose tissue; 
and protein, most of which is in skeletal muscle. The order of preference 
for the use of these fuels is glycogen > triacylglycerol > protein. Never- 
theless, the tissues of the body work together to maintain caloric home- 
ostasis: Constant availability of fuels in the blood. The major organ systems 
have specialized metabolic roles within the organism. 

The brain has a strong reliance on glucose as fuel. Muscle at rest pri- 
marily relies on fatty acids, but under conditions of strenuous contrac- 
tion when Os is limiting, muscle shifts to glycogen as its primary fuel. 
The heart is a completely aerobic organ, rich in mitochondria, with a 
preference for fatty acids as fuel under normal operating conditions. 
The liver is the body’s metabolic processing center, taking in nutrients 
and sending out products such as glucose, fatty acids, and ketone bod- 
ies. Adipose tissue takes up glucose and, to a lesser extent, fatty acids for 
the synthesis and storage of triacylglycerols. 


27.6 What Regulates Our Eating Behavior? Appetite and weight reg- 
ulation are governed by hormones produced in the stomach, small in- 
testines, pancreas, adipose tissue, and central nervous system. These 
hormones act on neurons within the arcuate nucleus region of the hy- 
pothalamus that control pathways involved in eating (food intake) and 
energy expenditure. Hormones that regulate eating behavior can be di- 
vided into short-term regulators that determine individual meals and 
long-term regulators that act to stabilize the levels of body fat deposits. 
Short-term regulators of eating include ghrelin and cholecystokinin. 
Ghrelin is produced when the stomach is empty, but ghrelin levels fall 
quickly once food is consumed. Cholecystokinin is released from the 
gastrointestinal tract during eating. In contrast to ghrelin, chole- 


cysokinin signals satiety (the sense of fullness) and tends to curtail fur- 
ther eating. Together, ghrelin and cholecystokinin constitute a meal-to- 
meal control system that regulates the onset and end of eating behavior. 

Long-term regulators include insulin and leptin, both of which in- 
hibit eating and promote energy expenditure. Blood insulin levels cor- 
relate with body fat amounts. Insulin also stimulates fat cells to make 
leptin. Leptin is produced principally in adipocytes. As fat accumulates 
in adipocytes, more leptin is released into the bloodstream to commu- 
nicate the status of adipocyte fat to the central nervous system. If leptin 
levels are low (“starvation”), appetite increases; if leptin levels are high 
(“overfeeding”), appetite is suppressed. Leptin binding to its receptors 
in the hypothalamus inhibits release of NPY. NPY is a potent orexic hor- 
mone; therefore, leptin is an anorexic agent. When body fat stores de- 
cline, the circulating levels of leptin and insulin also decline. Hypothal- 
amic neurons sense this decline and act to increase appetite to restore 
body fat levels. 

Intermediate regulation of eating behavior is accomplished by the 
gut hormone PYY;~35. Produced in distal regions of the intestines, 
PYY; 36 delays eating for many hours after a meal by inhibiting the 
NPY/AgRP-producing neurons in the arcuate nucleus. The regulatory 
controls that govern eating are complex and layered. 


27.7 Can You Really Live Longer by Eating Less? Caloric restriction 
(CR) prolongs the longevity of organisms from yeast to mammals. CR 
results in lower blood glucose levels, decline in glycogen and fat stores, 
enhanced responsiveness to insulin, lower body temperature, and di- 
minished reproductive capacity. CR also diminishes the likelihood for 
development of many age-related diseases, such as cancer, diabetes, and 
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atherosclerosis. Genetic investigations revealed that mutations in the 
SIR2 gene abolish the extension of life span by CR. The human SIR2 
gene equivalent is SIRT1. SIRT genes encode sirtuins, a family of NAD*- 
dependent protein deacetylases. The NAD*/NADH ratio controls sir- 
tuin protein deacetylase activity, so oxidative metabolism, which drives 
conversion of NADH to NAD‘, enhances sirtuin action. CR increases 
mitochondrial biogenesis in liver, fat, and muscle, a response that would 
raise the NAD*/NADH ratio. SIRTI is a key regulator in CR. The phys- 
iological responses caused by CR are the result of a tightly regulated 
program that connects nutrient availability to the expression of meta- 
bolic genes. A striking feature of CR is the loss of fat stores and reduc- 
tion in white adipose tissue. SIRT] binding to PPARy, a nuclear hor- 
mone receptor that activates transcription of genes involved in 
adipogenesis and fat storage, represses transcription of these genes, 
leading to the loss of fat stores. Because adipose tissue functions as an 
endocrine organ, this loss of fat has significant hormonal consequences 
for energy metabolism. In liver, SIRT] interacts with and deacetylates 
PGC-1, a transcriptional regulator of glucose production. Transcription 
of genes encoding the enzymes of gluconeogenesis and repression of 
genes encoding glycolytic enzymes are increased upon PGC-1 deacety- 
lation. Thus, SIRT] connects nutrient availability to the regulation of 
major pathways of energy storage and fuel use. 

Resveratrol, a phytoalexin, has the same life-extending effects as CR. 
Resveratrol activates both SIRT1 activity and brain AMPK, a key energy 
sensor. The influences of resveratrol on longevity may arise through its 
effects on caloric homeostasis. 


PROBLEMS 


CENGAGENOW" Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. (Integrates with Chapters 3, 18, and 22.) The conversion of PEP 
to pyruvate by pyruvate kinase (glycolysis) and the reverse reac- 
tion to form PEP from pyruvate by pyruvate carboxylase and PEP 
carboxykinase (gluconeogenesis) represent a so-called substrate 
cycle. The direction of net conversion is determined by the rela- 
tive concentrations of allosteric regulators that exert kinetic con- 
trol over pyruvate kinase, pyruvate carboxylase, and PEP car- 
boxykinase. Recall that the last step in glycolysis is catalyzed by 
pyruvate kinase: 


PEP + ADP == pyruvate + ATP 


The standard free energy change is —31.7 kJ/mol. 

a. Calculate the equilibrium constant for this reaction. 

b. If [ATP] = [ADP], by what factor must [pyruvate] exceed [PEP] 
for this reaction to proceed in the reverse direction? 

The reversal of this reaction in eukaryotic cells is essential to gluco- 

neogenesis and proceeds in two steps, each requiring an equivalent 

of nucleoside triphosphate energy: 


Pyruvate carboxylase 
Pyruvate + CO, + ATP —> oxaloacetate + ADP + P; 


PEP carboxykinase 
Oxaloacetate + GTP — > PEP + CO, + GDP 


Net: Pyruvate + ATP + GTP —> PEP + ADP + GDP + P; 


c. The AG” for the overall reaction is +0.8 kJ/mol. What is the value 
of Keg? 

d. Assuming [ATP] = [ADP], [GTP] = [GDP], and P;=1 mM 
when this reaction reaches equilibrium, what is the ratio of 
[PEP]/ [pyruvate]? 

e. Are both directions in the substrate cycle likely to be strongly 
favored under physiological conditions? 











2. (Integrates with Chapter 3.) Assume the following intracellular con- 
centrations in muscle tissue: ATP = 8 mM, ADP = 0.9 mM, AMP = 
0.04 mM, P; = 8 mM. What is the energy chargein muscle? What is the 
phosphorylation potential? 

3. Strenuous muscle exertion (as in the 100-meter dash) rapidly depletes 
ATP levels. How long will 8 mM ATP last if 1 gram of muscle consumes 
300 umol of ATP per minute? (Assume muscle is 70% water.) Muscle 
contains phosphocreatine as a reserve of phosphorylation potential. 
Assuming [phosphocreatine] = 40 mM, [creatine] = 4 mM, and 
AG” (phosphocreatine + HO == creatine + P;) = —43.3 kJ/mol, 
how low must [ATP] become before it can be replenished by the 
reaction: phosphocreatine + ADP == ATP + creatine? [Remember, 
AG” (ATP hydrolysis) = —30.5 kJ/mol.] 

4. (Integrates with Chapter 20.) The standard reduction potentials for 
the (NAD*/NADH) and (NADP*/NADPH) couples are identical, 
namely, —320 mV. Assuming the in vivo concentration ratios NAD*/ 
NADH = 20 and NADP*/NADPH = 0.1, what is AG for the follow- 
ing reaction? 


NADPH + NAD+ == NADP+ + NADH 


Assuming standard state conditions for the reaction, ADP + P;—> 
ATP + HO, calculate how many ATP equivalents can be formed 
from ADP + P; by the energy released in this reaction. 


5. (Integrates with Chapter 3.) Assume the total intracellular pool of 
adenylates (ATP + ADP + AMP) = 8 mM, 90% of which is ATP. 
What are [ADP] and [AMP] if the adenylate kinase reaction is at 
equilibrium? Suppose [ATP] drops suddenly by 10%. What are the 
concentrations now for ADP and AMP, assuming that the adenylate 
kinase reaction is at equilibrium? By what factor has the AMP con- 
centration changed? 

6. (Integrates with Chapters 18 and 22.) The reactions catalyzed by 
PFK and FBPase constitute another substrate cycle. PFK is AMP ac- 
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10. 


11. 


12. 


13. 


tivated; FBPase is AMP inhibited. In muscle, the maximal activity of 
PFK (mmol of substrate transformed per minute) is ten times 
greater than FBPase activity. If the increase in [AMP] described in 
problem 5 raised PFK activity from 10% to 90% of its maximal value 
but lowered FBPase activity from 90% to 10% of its maximal value, 
by what factor is the flux of fructose-6-P through the glycolytic path- 
way changed? (Hint: Let PFK maximal activity = 10, FBPase maxi- 
mal activity = 1; calculate the relative activities of the two enzymes 
at low [AMP] and at high [AMP]; let J, the flux of F-6-P through the 
substrate cycle under any condition, equal the velocity of the PFK 
reaction minus the velocity of the FBPase reaction.) 


. (Integrates with Chapters 23 and 24.) Leptin not only induces syn- 


thesis of fatty acid oxidation enzymes and uncoupling protein 2 in 
adipocytes, but it also causes inhibition of acetyl-CoA carboxylase, 
resulting in a decline in fatty acid biosynthesis. This effect on acetyl- 
CoA carboxylase, as an additional consequence, enhances fatty acid 
oxidation. Explain how leptin-induced inhibition of acetyl-CoA car- 
boxylase might promote fatty acid oxidation. 


. (Integrates with Chapters 19 and 20.) Acetate produced in ethanol 


metabolism can be transformed into acetyl-CoA by the acetyl thio- 
kinase reaction: 


Acetate + ATP + CoASH —> acetyl-CoA + AMP + PP; 


Acetyl-CoA then can enter the citric acid cycle and undergo oxida- 
tion to 2 COs. How many ATP equivalents can be generated in a 
liver cell from the oxidation of one molecule of ethanol to 2 CO» by 
this route, assuming oxidative phosphorylation is part of the 
process? (Assume all reactions prior to acetyl-CoA entering the cit- 
ric acid cycle occur outside the mitochondrion.) Per carbon atom, 
which is a better metabolic fuel, ethanol or glucose? That is, how 
many ATP equivalents per carbon atom are generated by combus- 
tion of glucose versus ethanol to CO? 


. (Integrates with Chapter 23.) Assuming each NADH is worth 3 ATP, 


each FADH; is worth 2 ATP, and each NADPH is worth 4 ATP: 
How many ATP equivalents are produced when one molecule of 
palmitoyl-CoA is oxidized to 8 molecules of acetyl-CoA by the fatty 
acid -oxidation pathway? How many ATP equivalents are con- 
sumed when 8 molecules of acetyl-CoA are transformed into one 
molecule of palmitoyl-CoA by the fatty acid biosynthetic pathway? 
Can both of these metabolic sequences be metabolically favorable 
at the same time if AG for ATP synthesis is +50 kJ/mol? 
(Integrates with Chapters 18-21.) If each NADH is worth 3 ATP, 
each FADH; is worth 2 ATP, and each NADPH is worth 4 ATP, cal- 
culate the equilibrium constant for cellular respiration, assuming 
synthesis of each ATP costs 50 kJ/mol of energy. Calculate the equi- 
librium constant for CO» fixation under the same conditions, ex- 
cept here ATP will be hydrolyzed to ADP + P; with the release of 
50 kJ/mol. Comment on whether these reactions are thermody- 
namically favorable under such conditions. 

(Integrates with Chapter 22.) In type 2 diabetics, glucose produc- 
tion in the liver is not appropriately regulated, so glucose is over- 
produced. One strategy to treat this disease focuses on the devel- 
opment of drugs targeted against regulated steps in glycogenolysis 
and gluconeogenesis, the pathways by which liver produces glucose 
for release into the blood. Which enzymes would you select for as 
potential targets for such drugs? 

As chief scientist for drug development at PhatFarmaceuticals, Inc., 
you want to create a series of new diet drugs. You have a grand plan 
to design drugs that might limit production of some hormones or 
promote the production of others. Which hormones are on your 
“limit production” list and which are on your “raise levels” list? 
The existence of leptin was revealed when the ob/ob genetically 
obese strain of mice was discovered. These mice have a defective 


14. 


15, 


16. 


17. 


18. 


leptin gene. Predict the effects of daily leptin injections into ob/ob 
mice on food intake, fatty acid oxidation, and body weight. Similar 
clinical trials have been conducted on humans, with limited success. 
Suggest a reason why this therapy might not be a miracle cure for 
overweight individuals. 

Would it be appropriate to call neuropeptide Y (NPY) the obesity- 
promoting hormone? What would be the phenotype of a mouse 
whose melanocortin-producing neurons failed to produce melano- 
cortin? What would be the phenotype of a mouse lacking a func- 
tional MC3R gene? What would be the phenotype of a mouse lack- 
ing a functional leptin receptor gene? 

The Human Biochemistry box, The Metabolic Effects of Alcohol 
Consumption, points out that ethanol is metabolized to acetate in 
the liver by alcohol dehydrogenase and aldehyde dehydrogenase: 


CH;CH,OH + NAD+ —— CH;CHO + NADH + H+ 
CH,CHO + NAD+ + H,O — = CH,COO- + NADH + 2H+ 





These reactions alter the NAD*/NADH ratio in liver cells. From 
your knowledge of glycolysis, gluconeogenesis, and fatty acid oxida- 
tion, what might be the effect of an altered NAD*/NADH ratio on 
these pathways? What is the basis of this effect? 


A T172D mutant of the AMPK is locked in a permanently active 
state. Explain. 


a. Some scientists support the “malonyl-CoA hypothesis,” which 
suggests that malonyl-CoA is a key indicator of nutrient avail- 
ability and the brain uses its abundance to assess whole-body en- 
ergy homeostasis. Others have pointed out that malonyl-CoA is a 
significant inhibitor of carnitine acyltransferase-1 (see Figure 
24.16). Thus, malonyl-CoA may be influencing the levels of an- 
other metabolite whose concentration is more important as a sig- 
nal of energy status. What metabolite might that be? 

b. Another test of the malonyl-CoA hypothesis was conducted 
through the creation of a transgenic strain of mice that lacked 
functional hypothalamic fatty acid synthase (see Chapter 24). Pre- 
dict the effect of this genetic modification on cellular malonyl- 
CoA levels in the hypothalamus, the eating behavior of these 
transgenic mice, their body fat content, and their physical activity 
levels. Defend your predictions. 


a. Leptin was discovered when a congenitally obese strain of mice 
(ob/ob mice) was found to lack both copies of a gene encoding a 
peptide hormone produced mainly by adipose tissue. The pep- 
tide hormone was named leptin. Leptin is an anorexic (appetite- 
suppressing) agent; its absence leads to obesity. Propose an ex- 
periment to test these ideas. 

b. A second strain of obese mice (db/db mice) produces leptin in 
abundance but fails to respond to it. Assuming the db mutation 
leads to loss of function in a protein, what protein is likely to be 
nonfunctional or absent? How might you test your idea? 


Preparing for the MCAT Exam 


19. 


20. 


Consult Figure 27.7 and answer the following questions: Which or- 
gans use both fatty acids and glucose as a fuel in the well-fed state, 
which rely mostly on glucose, which rely mostly on fatty acids, which 
one never uses fatty acids, and which one produces lactate. 


Figure 27.3 illustrates the response of R (ATP-regenerating) and U 

(ATP-utilizing) enzymes to energy charge. 

a. Would hexokinase be an R enzyme or a U enzyme? Would glu- 
tamine:PRPP amidotransferase, the second enzyme in purine 
biosynthesis, be an R enzyme or a U enzyme? 

b. If energy charge = 0.5: Is the activity of hexokinase high or low? Is 
ribose-5-P pyrophosphokinase activity high or low? 

c. If energy charge = 0.95: Is the activity of hexokinase high or low? 
Is ribose-5-P pyrophosphokinase activity high or low? 
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Julie Newdoll's painting “Dawn of the Double Helix” 
composes the DNA duplex as human figures. Her 
theme in this painting is “Life Forms: The basic struc- 
tures that make our existence possible.” 


“Dawn of the Double Helix,” by Julie Newdoll 





Heredity 


I am the family face; 

Flesh perishes, I live on, 
Projecting trait and trace 
Through time to times anon, 
And leaping from place to place 
Over oblivion. 


The years-heired feature that can 
In curve and voice and eye 
Despise the human span 

Of durance—that is I; 

The eternal thing in man, 

That heeds no call to die. 


Thomas Hardy 


(in Moments of Vision and 
Miscellaneous Verses, 1917) 
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DNA Metabolism: Replication, 
Recombination, and Repair 





ESSENTIAL QUESTIONS 


DNA is the physical repository of genetic information in the cell and the material of 
heredity that is passed on to progeny. 


How is this genetic information in the form of DNA replicated, how is the infor- 
mation rearranged, and how is its integrity maintained in the face of damage? 


Heredity, which we can define generally as the tendency of an organism to pos- 
sess the characteristics of its parent(s), is clearly evident throughout nature and 
since the dawn of history has served to justify the classification of organisms ac- 
cording to shared similarities. The basis of heredity, however, was a mystery. Early 
in the 20th century, geneticists demonstrated that genes, the elements or units 
carrying and transferring inherited characteristics from parent to offspring, are 
contained within the nuclei of cells in association with the chromosomes. Yet the 
chemical identity of genes remained unknown, and genetics was an abstract sci- 
ence. Even the realization that chromosomes are composed of proteins and nu- 
cleic acids did little to define the molecular nature of the gene because, at the 
time, no one understood either of these substances. 

The material of heredity must have certain properties. It must be very stable so 
that genetic information can be stored in it and transmitted countless times to sub- 
sequent generations. It must be capable of precise copying or replication so that its 
information is not lost or altered. And, although stable, it must also be subject to 
change in order to account, in the short term, for the appearance of mutant forms 
and, in the long term, for evolution. DNA is the material of heredity. 


28l" How Is DNA Replicated? 


Transfer of genetic information from generation to generation requires the faithful 
reproduction of the parental DNA. DNA reproduction produces two identical 
copies of the original DNA in a process termed DNA replication. The mechanism 
for DNA replication is strand separation followed by the copying of each strand. In the 
process, each separated strand acts as a template for the synthesis of a new com- 
plementary strand whose nucleotide sequence is fixed by the base-pairing rules 
Watson and Crick proposed (see Chapter 10). Strand separation is achieved by 
untwisting the double helix (Figure 28.1). Base pairing then dictates the proper 
sequence of nucleotide addition to achieve an accurate replication of each orig- 
inal strand. Thus, each original strand ends up paired with a new complementary 
partner, and two identical double-stranded DNA molecules are formed from one. 
This mode of DNA replication is referred to as semiconservative because one of 
the two original strands is conserved in each of the two progeny molecules. 


DNA Replication Is Bidirectional 


Replication of DNA molecules begins at one or more specific regions called the 
origin(s) of replication and, excepting certain bacteriophage chromosomes and plas- 
mids, proceeds in both directions from this origin (Figure 28.2). For example, repli- 
cation of E. coli DNA begins at oriC, a unique 245-bp chromosomal site that contains 
11 GATC tetranucleotide sequences along its length. From orC, replication ad- 
vances in both directions around the circular chromosome. That is, bidirectional 


replication involves two replication forks that move in opposite directions. Bidirec- 
tional replication predicts that, if radioactively labeled nucleotides are provided as 
substrates for new DNA synthesis, both replication forks will become radioactively 
labeled. The experiment illustrated in Figure 28.2 confirms this prediction. 


Replication Requires Unwinding of the DNA Helix 


Semiconservative replication depends on unwinding the DNA double helix to ex- 
pose single-stranded templates to polymerase action. For a double helix to unwind, 
it must either rotate about its axis (while the ends of its strands are held fixed), or 
positive supercoils must be introduced, one for each turn of the helix unwound (see 
Chapter 11). If the chromosome is circular, as in FE. coli, only the latter alternative is 
possible. Because DNA replication in E. coli proceeds at a rate approaching 1000 
nucleotides per second and there are about 10 bp per helical turn, the chromosome 
would accumulate 100 positive supercoils per second! In effect, the DNA would be- 
come too tightly supercoiled to allow unwinding of the strands. 

DNA gyrase, a Type II topoisomerase, acts to overcome the torsional stress im- 
posed upon unwinding; DNA gyrase introduces negative supercoils at the expense of 
ATP hydrolysis. The unwinding reaction is driven by helicases (see also Chapter 16), 
a class of proteins that catalyze the ATP-dependent unwinding of DNA double he- 
lices. Unlike topoisomerases that alter the linking number of dsDNA through phos- 
phodiester bond breakage and reunion (see Chapter 11), helicases simply disrupt 
the hydrogen bonds that hold the two strands of duplex DNA together. A helicase 
molecule requires a single-stranded region for binding. It then moves along the sin- 
gle strand, unwinding the double-stranded DNA in an ATP-dependent process. SSB 
(single-stranded DNA-binding protein) binds to the unwound strands, preventing 
their re-annealing. At least ten distinct DNA helicases involved in different aspects of 
DNA and RNA metabolism have been found in E. coli alone. DnaB is the DNA heli- 
case acting in E. coli DNA replication. DnaB helicase assembles as a hexameric (œe) 
“doughnut”-shaped protein ring, with DNA passing through its hole. 


DNA Replication Is Semidiscontinuous 


As shown in Figure 28.2, both parental DNA strands are replicated at each advancing 
replication fork. The enzyme that carries out DNA replication is DNA polymerase. 
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A template is something whose edge is shaped 
in a particular way so that it can serve as a guide 
in making a similar object with a corresponding 
contour. 





Old New New Old 
Emerging progeny DNA 


FIGURE 28.1 DNA replication: Strand separation fol- 
lowed by the copying of each strand. 


FIGURE 28.2 Bidirectional replication. (a) Comparison of 
labeling during unidirectional versus bidirectional repli- 
cation. (b) An autoradiogram of E. coli chromosome 
replication in the presence of radioactive thymidine 
confirms bidirectional replication. (Photo courtesy of David 
M. Prescott, University of Colorado.) 
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This enzyme uses single-stranded DNA (ssDNA) as a template and makes a comple- 
mentary strand by polymerizing deoxynucleotides in the order specified by their base 
pairing with bases in the template. DNA polymerases synthesize DNA only in a 5’>3' 
direction, reading the antiparallel template strand in a 3'—>5' sense. A dilemma arises: 
How does DNA polymerase copy the parent strand that runs in the 5'—>3' direction 
at the replication fork? It turns out that replication is semidiscontinuous (Figure 28.3): As 
the DNA helix is unwound during its replication, the 3’-5’ strand (as defined by the 
direction that the replication fork is moving) can be copied continuously by DNA 
polymerase synthesizing in the 5'—>3' direction behind the replication fork. The 
other parental strand is copied only when a sufficient stretch of its sequence has been 
exposed for DNA polymerase to read it in the 3’->5' sense. Thus, one parental strand 
is copied continuously to give a newly synthesized copy, called the leading strand, at 
each replication fork. The other parental strand is copied in an intermittent, or dis- 
continuous, mode to yield a set of fragments 1000 to 2000 nucleotides in length, 
called the Okazaki fragments (Figure 28.3a). These fragments are then joined to 
form an intact lagging strand. Because both strands are synthesized in concert by a 
dimeric DNA polymerase situated at the replication fork, the 5'—>3' parental strand 
must wrap around in trombone fashion so that the unit of dimeric DNA polymerase 
replicating it can move along it in the 3'—>5' direction (Figure 28.3b). Overall, each 
of the two DNA duplexes produced in DNA replication contains one “old” and one 
“new” DNA strand, and half of the new strand was formed by leading strand synthesis 
and the other half by lagging strand synthesis. 


The Lagging Strand Is Formed from Okazaki Fragments 


In 1968, Tuneko and Reiji Okazaki provided biochemical verification of the semi- 
discontinuous pattern of DNA replication just described. The Okazakis exposed a 
rapidly dividing E. coli culture to *H-labeled thymidine for 30 seconds, quickly col- 
lected the cells, and found that half of the label incorporated into nucleic acid ap- 
peared in short ssDNA chains just 1000 to 2000 nucleotides in length. (The other 
half of the radioactivity was recovered in very large DNA molecules.) Subsequent ex- 
periments demonstrated that with time, the newly synthesized short ssDNA Okazaki 
fragments became covalently joined to form longer polynucleotide chains, in ac- 
cord with a semidiscontinuous mode of replication. The generality of this mode of 
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FIGURE 28.3 The semidiscontinuous model for DNA replication. Newly synthesized DNA is shown as red. 

(a) Leading and lagging strand synthesis. (b) Synthesis of both strands carried out by a dimeric DNA poly- 
merase situated at the replication fork. Because DNA polymerase must read the template strand in the 3’-35' 
direction, the 5’-3’ parental strand must wrap around in trombone fashion. 
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replication has been corroborated with electron micrographs of DNA undergoing 
replication in eukaryotic cells. 


28:28 What Are the Properties of DNA Polymerases? 


The enzymes that replicate DNA are called DNA polymerases. All DNA polymerases, 
whether from prokaryotic or eukaryotic sources, share the following properties: 


1. The incoming base is selected within the DNA polymerase active site, as deter- 
mined by Watson—Crick geometric interactions with the corresponding base in 
the template strand. 

. Chain growth is in the 5'—>3' direction and is antiparallel to the template strand. 

3. DNA polymerases cannot initiate DNA synthesis de novo—all require a primer 

oligonucleotide with a free 3’-OH to build upon. 


N 


Despite these commonalities, DNA replication in bacterial cells is simpler than in 
eukaryotes and thus will be considered first. 


E. coli Cells Have Several Different DNA Polymerases 


Table 28.1 compares the properties of the principal DNA polymerases in E. coli. 
These enzymes are nicknamed pol and numbered I through V in order of their 
discovery. DNA polymerases I, II, and V function principally in DNA repair; DNA 
polymerase III is the chief DNA-replicating enzyme of E. coli. Only 40 or so copies of 
this enzyme are present per cell. 


The First DNA Polymerase Discovered Was E. coli DNA Polymerase | 


In 1957, Arthur Kornberg and his colleagues discovered the first DNA polymerase, 
DNA polymerase I. DNA polymerase I catalyzed the synthesis of DNA in vitro if 
provided with all four deoxynucleoside-5’-triphosphates (dATP, dTTP, dCTP, 
dGTP), a template DNA strand to copy, and a primer. A primer is essential because 
DNA polymerases can elongate only preexisting chains; they cannot join two 
deoxyribonucleoside-5'-phosphates together to make the initial phosphodiester 
bond. The primer base pairs with the template DNA, forming a short, double- 
stranded region. This primer must possess a free 3'-OH end to which an incoming 
deoxynucleoside monophosphate is added. One of the four dNTPs is selected as 
substrate, pyrophosphate (PP;) is released, and the dNMP is linked to the 3’-OH of 
the primer chain through formation of a phosphoester bond (Figure 28.4). The 
deoxynucleotide selected as substrate is chosen through its geometric fit with the 
template base to form a Watson—Crick base pair. As DNA polymerase I catalyzes the 
successive addition of deoxynucleotide units to the 3’-end of the primer, the chain 
is elongated in the 5'—3’ direction, forming a polynucleotide sequence that is 
antiparallel and complementary to the template. DNA polymerase I can proceed 














TABLE 28.1 

Property Pol | Pol Il Pol Ill (core)* 
Mass (kD) 103 88 130(a), 27.5(e), 8.6(0) 
Molecules/cell 400 40 
Turnover numbert 20 40 1000 
Polymerization 5'—3' Yes Yes Yes 
Exonuclease 3’—5’ Yes Yes Yes 
Exonuclease 5’ 3’ Yes No No 











*q-,e-,and @-subunits. 
*Nucleotides polymerized at 37°C/second/molecule of enzyme. 
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FIGURE 28.4 The chain elongation reaction catalyzed by 
DNA polymerase. The 3’-OH carries out a nucleophilic 
attack on the a-phosphoryl group of the incoming 
dNTP to form a phosphoester bond, and PP; is released. 
The subsequent hydrolysis of PP; by inorganic pyro- 
phosphatase renders the reaction effectively irreversible. 


FIGURE 28.5 The 3’-95’ exonuclease activity of DNA 
polymerase | removes nucleotides from the 3’-end of 
the growing DNA chain. 








along the template strand, synthesizing a complementary strand of 3 to 200 bases 
before it “falls off” (dissociates from) the template. The degree to which the en- 
zyme remains associated with the template through successive cycles of nucleotide 
addition is referred to as its processivity. As DNA polymerases go, DNA polymerase 
I is a modestly processive enzyme. Arthur Kornberg was awarded the Nobel Prize 
in Physiology or Medicine in 1959 for his discovery of this DNA polymerase. DNA 
polymerase I is the best characterized of these enzymes. 


E. coli DNA Polymerase I Has Three Active Sites on Its Single 
Polypeptide Chain 


In addition to its 5'—>3' polymerase activity, E. coli DNA polymerase I has two other 
catalytic functions: a 3’5’ exonuclease (3’-exonuclease) activity and a 5’ 3’ exonu- 
clease (5’-exonuclease) activity. The three distinct catalytic activities of DNA polym- 
erase I reside in separate active sites in the enzyme. 


E. coli DNA Polymerase | Is Its Own Proofreader and Editor 


The exonuclease activities of E. coli DNA polymerase I are functions that enhance the 
accuracy of DNA replication. The 3’-exonuclease activity removes nucleotides from 
the 3’-end of the growing chain (Figure 28.5), an action that negates the action of the 
polymerase activity. Its purpose, however, is to remove incorrect (mismatched) bases. 







Template Mismatched 


Exonuclease 
hydrolysis site 


DNA 
polymerase I 


28.2 What Are the Properties of DNA Polymerases? 867 


Although the 3’-exonuclease works slowly compared to the polymerase, the poly- 
merase cannot elongate an improperly base-paired primer terminus. Thus, the rela- 
tively slow 3'-exonuclease has time to act and remove the mispaired nucleotide. There- 
fore, the polymerase active site is a proofreader, and the 3'-exonuclease activity is an 
editor. This check on the accuracy of base pairing enhances the overall precision of the 
process. 

The 5’-exonuclease of DNA polymerase I acts upon duplex DNA, degrading it 
from the 5'-end by releasing mononucleotides and oligonucleotides. It can remove 
distorted (mispaired) segments lying in the path of the advancing polymerase. Its 
biological roles depend on the ability of DNA polymerase I to bind at nicks (single- 
stranded breaks) in dsDNA and move in the 5'—>3' direction, removing successive 
nucleotides with its 5'-exonucleolytic activity. (This overall process is known as nick 
translation, because the nick is translated [that is, moved] down the DNA.) This 
5'-exonuclease activity plays an important role in primer removal during DNA repli- 
cation, as we shall soon see. DNA polymerase I is also involved in DNA repair 
processes (see Section 28.8). 


E. coli DNA Polymerase III Holoenzyme Replicates the E. coli 
Chromosome 


In its holoenzyme form, DNA polymerase III is the enzyme responsible for replica- 
tion of the E. coli chromosome. The simplest form of DNA polymerase III showing 
any DNA-synthesizing activity in vitro, “core” DNA polymerase III, is 165 kD in size 
and consists of three polypeptides: æ (130 kD), € (27.5 kD), and 0 (8.6 kD). In vivo, 
core DNA polymerase III functions as part of a multisubunit complex, the DNA 
polymerase III holoenzyme, which is composed of ten different kinds of subunits 
(Table 28.2). The various auxiliary subunits increase both the polymerase activity of 
the core enzyme and its processivity. DNA polymerase IIT holoenzyme synthesizes 
DNA strands at a speed of nearly 1 kb/sec. DNA polymerase III holoenzyme is 
organized in the following way: Two core (ae?) DNA polymerase III units and one 
y-complex are attached to DnaB helicase via two t-subunits to form a structure 
known as DNA polymerase III*. In turn, each core polymerase within DNA po- 
lymerase III* binds to a B-subunit dimer to create DNA polymerase III holoenzyme, 
a 17-subunit ((a@€0) 92BoTsy65' yyy) complex (Figure 28.6). The y-complex is respon- 
sible for assembly of the DNA polymerase HI holoenzyme complex onto DNA. The 
y-complex of the holoenzyme acts as a clamp loader by catalyzing the ATP- 
dependent transfer of a pair of B-subunits to each strand of the DNA template. 
Each -subunit dimer forms a closed ring around a DNA strand and acts as a tight 
clamp that can slide along the DNA (Figure 28.7). Each By-sliding clamp tethers a 


























EEEE subunits of E coli DNA Polymerase II Holoenzyme i 
Subunit Mass (kD) Function 
a 130 Polymerase 
€ 27.5 3'-Exonuclease 
0 8.6 e-subunit stabilization 
T 71 DNA template binding; core enzyme dimerization 
B 41 Sliding clamp, processivity 
y 47.5 Part of the y-complex* 
ô 39 Part of the y-complex* 
ô' 37 Part of the y-complex* 
X 17 Interaction with SSB and the y-complex 
y 15 Interaction with y and the y-complex 











*Subunits 7, y,6,5’, x, and w form the so-called y-complex responsible for adding B-subunits (the sliding clamp) to DNA 
and anchoring the sliding clamp to the two core DNA polymerase III structures. The y-complex is referred to as the clamp 
loader. 
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core polymerase to the template, accounting for the great processivity of the DNA 
polymerase holoenzyme. This complex can replicate an entire strand of the E. coli 
genome (more than 4.6 megabases) without dissociating. Compare this to the pro- 
cessivity of DNA polymerase I, which is only 20! 

The core polymerase synthesizing the lagging strand must release from the DNA 
template when synthesis of an Okazaki fragment is completed and rejoin the template 
at the next RNA primer to begin synthesis of the next Okazaki fragment. The 
T-subunit serves as a “processivity switch” that accomplishes this purpose. The 7-subunit 
is usually “off” and is turned “on” only on the lagging strand and only when synthesis 
of an Okazaki fragment is completed. When activated, 7 ejects the Bo-sliding clamp 
bound to the lagging strand core polymerase. Almost immediately, the lagging strand 
core polymerase is reloaded onto a new sliding clamp at the 3-end of next RNA 
primer, and synthesis of the next Okazaki fragment commences. 


A DNA Polymerase III Holoenzyme Sits at Each Replication Fork 


We now can present a snapshot of the enzymatic apparatus assembled at a replication 
fork (Figure 28.8 and Table 28.3). DNA gyrase (topoisomerase) and DnaB helicase 
unwind the DNA double helix, and the unwound, single-stranded regions of DNA are 


«1 FIGURE 28.7 (a) Ribbon diagram of the B-subunit dimer of the DNA polymerase II holoenzyme on B-DNA, 
viewed down the axis of the DNA. One monomer of the B-subunit dimer is colored blue and the other yellow. 
The centrally located DNA is multicolored. (b) Space-filling model of the B-subunit dimer of the DNA polymerase 
Ill holoenzyme on B-DNA.The hole formed by the B-subunits (diameter = 3.5 nm) is large enough to easily 
accommodate DNA (diameter = 2.5 nm) with no steric repulsion (pdb id = 2POL).The rest of polymerase III 
holoenzyme (‘core” polymerase + y-complex) associates with this sliding clamp to form the replicative poly- 
merase (not shown). 
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FIGURE 28.8 General features of a replication fork. The 
DNA duplex is unwound by the action of DNA gyrase 
and helicase, and the single strands are coated with SSB 


: 3 i A 7 . . (ssDNA-binding protein). Primase periodically primes 
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primer on the lagging strand; the leading strand, which needs priming only once, was replicative polymerase is a "core" polymerase bound to 

primed when replication was initiated. The lagging strand template is looped around, its template strand by a B-subunit sliding clamp. DNA 

and each replicative DNA polymerase moves 5'—3' relative to its strand, copying tem- POIymerase | and DNA ligase act downstream on the 
Bos aes : lagging strand to remove RNA primers, replace them 

plate and synthesizing a new DNA strand. Each replicative polymerase is tethered to with DNA and li gate tha Okazaki fragments. 

the DNA by its -subunit sliding clamp. The DNA polymerase II y-complex periodi- 

cally unclamps and then reclamps -subunits on the lagging strand as the primer for 

each new Okazaki fragment is encountered. Downstream on the lagging strand, DNA 

polymerase I excises the RNA primer and replaces it with DNA, and DNA ligase seals 

the remaining nick. 


DNA Ligase Seals the Nicks Between Okazaki Fragments 


DNA ligase (see Chapter 12) seals nicks in double-stranded DNA where a 3'-OH and 
a 5'-phosphate are juxtaposed. This enzyme is responsible for joining Okazaki frag- 
ments together to make the lagging strand a covalently contiguous polynucleotide 
chain. 


DNA Replication Terminates at the Ter Region 


Located diametrically opposite from oriC on the E. coli circular map is a terminus re- 
gion, the Ter, or t, locus. The oppositely moving replication forks meet here, and repli- 
cation is terminated. The Ter region contains a number of short DNA sequences, with 








TABLE 28.3 Pro I 

Protein Function 

DNA gyrase Unwinding DNA 

SSB Single-stranded DNA binding 

DnaA Initiation factor; origin-binding protein 
DnaB 5'—3' helicase (DNA unwinding) 
DnaC DnaB chaperone; loading DnaB on DNA 
Primase (DnaG) Synthesis of RNA primer 

DNA polymerase HI holoenzyme Elongation (DNA synthesis) 

DNA polymerase I Excises RNA primer, fills in with DNA 
DNA ligase Covalently links Okazaki fragments 

Tus Termination 
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A DEEPER LOOK 


A Mechanism for All Polymerases 


Thomas A. Steitz of Yale University has suggested that biosynthesis 
of nucleic acids proceeds by an enzymatic mechanism that is uni- 
versal among polymerases. His suggestion is based on structural 
studies indicating that DNA polymerases use a “two-metal-ion” 
mechanism to catalyze nucleotide addition during elongation of 
a growing polynucleotide chain (see accompanying figure). The in- 
coming nucleotide has two Mg** ions coordinated to its phosphate 
groups, and these metal ions interact with two aspartate residues 
that are highly conserved in DNA (and RNA) polymerases. These 
residues in phage T7 DNA polymerase are D705 and D882. One 
metal ion, designated A, interacts with the O atom of the free 
3'-OH group on the polynucleotide chain, lowering its affinity for 
its hydrogen. This interaction promotes nucleophilic attack of the 
3'-O on the phosphorus atom in the a-phosphate of the incoming 
nucleotide. The second metal ion (B in the figure) assists depar- 
ture of the product pyrophosphate group from the incoming nu- 
cleotide. Together, the two metal ions stabilize the pentacovalent 
transition state on the a-phosphorus atom. 


Adapted from Steitz, T., 1998. A mechanism for all polymerases. Nature 
391:231-232. (See also Doublié, S., et al., 1998. Crystal structure of bacte- 
riophage T7 DNA replication complex at 2.2 A resolution. Nature 391: 
251-258; and Kiefer, J. R., et al., 1998. Visualizing DNA replication in a 
catalytically active Bacillus DNA polymerase crystal. Nature 391:304-307.) 


Template | 
| Base’ 
Base’ ' ' 
T Base 
7 dNTP 
Base o 
Primer 
O 
-0 O0=C OH 
ion `p z O B 
4 EA 
Cc ~~ ó St ee 
7 es, i 7N 
Ow E 
e- (OT KE \ 
K g LHC) —a PLY 
An D a T A Fe 
0.9.0 Lo) Os 
G I 
Cc 
Z N 
D705 















Replication 
factory 


FIGURE 28.9 A replication factory “fixed” to a cellular sub- 
structure extrudes loops of newly synthesized DNA as 
parental DNA duplex is fed in from the sides. Parental 
DNA strands are green; newly synthesized strands are 
blue; small circles indicate origins of replication. 


a consensus core element 5'-GIGTGTTGT. These Ter sequences act as terminators; 
clusters of three or four Ter sequences are organized into two sets inversely oriented 
with respect to one another. One set blocks the clockwise-moving replication fork, and 
its inverted counterpart blocks the counterclockwise-moving replication fork. Termi- 
nation requires binding of a specific replication termination protein, Tus protein, to 
Ter. Tus protein is a contrahelicase. That is, Tus protein prevents the DNA duplex from 
unwinding by blocking progression of the replication fork and inhibiting the ATP- 
dependent DnaB helicase activity. Final synthesis of both duplexes is completed. 


DNA Polymerases Are Immobilized in Replication Factories 


Most drawings of DNA replication (such as Figure 28.8) suggest that the DNA 
polymerases are tracking along the DNA, like locomotives along train tracks, syn- 
thesizing DNA as they go. Experimental evidence, however, favors the view that the 
DNA polymerases are immobilized, either via attachment to the cell membrane in 
prokaryotic cells or to the nuclear matrix in eukaryotic cells. All the associated pro- 
teins of DNA replication, as well as proteins necessary to hold DNA polymerase at 
its fixed location, constitute replication factories. The DNA is then fed through the 
DNA polymerases within the replication factory, much like tape is fed past the heads 
of a tape player, with all four strands of newly replicated DNA looping out from this 
fixed structure (Figure 28.9). 


28:38 Why Are There So Many DNA Polymerases? 


Cells Have Different Versions of DNA Polymerase, 
Each for a Particular Purpose 


A host of different DNA polymerases have been discovered, and even simple bacte- 
ria such as Escherichia coli have more than one. Based on sequence homology, poly- 
merases can be grouped into seven different families. The families differ in terms of 
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the biological function served by family members. For example, family A includes 
DNA polymerases involved in DNA repair in bacteria; family B polymerases include 
the eukaryotic DNA polymerases predominantly involved in replication of chromo- 
somal DNA; family C has the bacterial chromosomal DNA-replicating enzymes; 
members of families X and Y act in DNA repair pathways; and RT designates the 
DNA polymerases of retroviruses (such as HIV) and the telomerases that renew the 
ends of eukaryotic chromosomes. RT polymerases are novel in that they use RNA as 
the template. 


The Common Architecture of DNA Polymerases 


Despite sequence variation, the various DNA polymerase structures more or less fol- 
low a common architectural pattern that is reminiscent of a right hand, with distinct 
structural domains referred to as fingers, palm, and thumb (Figure 28.10). The ac- 
tive site of the polymerase, where deoxynucleotide addition to the growing chain is 
catalyzed, is located in the crevice within the palm domain that lies between the fin- 
gers and thumb domains. The fingers domain acts in deoxynucleotide recognition 
and binding, and the thumb is responsible for DNA binding, in the following man- 
ner: When the DNA polymerase binds to template-primer duplex DNA, its thumb 
domain closes around the DNA so that the DNA is bound in a groove formed by the 
thumb and palm. A dNTP substrate is then selected by the polymerase, and dNTP 
binding induces a conformational change in the fingers, which now rotate toward 
the polymerase active site in the palm. Catalysis ensues and a dNMP is added to the 
3’-end of the growing primer strand; pyrophosphate is released, and the polymerase 
translocates one base farther along the template strand. In essence, all DNA polym- 
erases are molecular motors that synthesize DNA, using dNTP substrates to add 
dNMP units to the primer strand, as they move along the template strand, reading 
its base sequence. 


28:45 How Is DNA Replicated in Eukaryotic Cells? 


DNA replication in eukaryotic cells shows strong parallels with prokaryotic DNA 
replication, but it is vastly more complex. First, eukaryotic DNA is organized into 
chromosomes which are compartmentalized within the nucleus. Furthermore, 
these chromosomes must be duplicated with high fidelity once (and only once!) 
each cell cycle. For example, in a dividing human cell, a carefully choreographed 
replication of 6 billion bp of DNA distributed among 46 chromosomes occurs. The 


FIGURE 28.10 A structural paradigm for DNA poly- 
merases, bacteriophage RB69 DNA polymerase. Ternary 
complex formed between the RB69 DNA polymerase, 
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FIGURE 28.11 The eukaryotic cell cycle. The stages of mi- 
tosis and cell division define the M phase (M for mitosis). 
G; (G for gap, not growth) is typically the longest part of 
the cell cycle; G, is characterized by rapid growth and 
metabolic activity. Cells that are quiescent, that is, not 
growing and dividing (such as neurons), are said to be in 
Go. The S phase is the time of DNA synthesis. S is fol- 
lowed by G, a relatively short period of growth when 
the cell prepares for cell division. Cell cycle times vary 
from less than 24 hours (rapidly dividing cells such as 
the epithelial cells lining the mouth and gut) to hun- 
dreds of days. 







events associated with cell growth and division in eukaryotic cells fall into a general 
sequence having four distinct phases: M, G;, S, and Go (Figure 28.11). Eukaryotic 
cells have solved the problem of replicating their enormous genomes in the few 
hours allotted to the S phase by initiating DNA replication at multiple origins of 
replication distributed along each chromosome. Depending on the organism and 
cell type, replication origins are DNA regions 0.5 to 2 kbp in size that occur every 
3 to 300 kbp (for example, an average human chromosome has several hundred 
replication origins). Since eukaryotic DNA replication proceeds concomitantly 
throughout the genome, each eukaryotic chromosome must contain many units of 
replication, called replicons. 


The Cell Cycle Controls the Timing of DNA Replication 


Checkpoints, Cyclins,and CDKs Progression through the cell cycle is regulated 
through a series of checkpoints that control whether the cell continues into the 
next phase. These checkpoints are situated to ensure that all the necessary steps 
in each phase of the cycle have been satisfactorily completed before the next 
phase is entered. If conditions for advancement to the next phase are not met, 
the cycle is arrested until they are. Checkpoints depend on cyclins and cyclin- 
dependent protein kinases (CDKs). Cyclin is the name given to a class of proteins 
synthesized at one phase of the cell cycle and degraded at another. Thus, cyclins 
appear and then disappear at specific times during the cell cycle. Cyclins are 
larger than the small CDK protein kinase subunits to which they bind. The vari- 
ous CDKs are inactive unless complexed with their specific cyclin partners. In 
turn, these CDKs control events at each phase of the cycle by targeting specific 
proteins for phosphorylation. Destruction of the phase-specific cyclin at the end 
of the phase inactivates the CDK. 


Initiation of Replication Eukaryotic cells initiate DNA replication at multiple ori- 
gins, and two replication forks arise from each origin. The two replication forks 
then move away from each other in opposite directions. Initiation of replication de- 
pends on the origin recognition complex, or ORC, a protein complex that binds to 
replication origins. Indeed, eukaryotic replication origins are defined as nucleotide 
sequences that bind ORC. Stable maintenance of the eukaryotic genome demands 
that DNA replication occurs only once per cell cycle. This demand is met by divid- 
ing initiation of DNA replication into two steps: (1) the licensing of replication ori- 
gins during late M or early G,, and (2) the activation of replication at the origins 
during S phase through the action of two protein kinases, Cdc7-Dbf4 and S-CDK 
(the S-phase cyclin-dependent protein kinase). 

Licensing involves the highly regulated assembly of prereplication complexes 
(pre-RCs) on origins of replication. Early in G; (just after M), the ORC (a hetero- 
hexameric complex of Orcl-6) serves as a “landing pad” for proteins essential to 
replication control. Binding of these proteins to ORC establishes a pre-RC, but only 
within this window of opportunity during G4. Yeast, a simple eukaryote, provides an 
informative model: ORC binds to origins and recruits Cdc6 (in its phosphorylated 
form, Cdc6p), Cdtl, and the MCM proteins (Figure 28.12). Cdc6 and Cdt1 are the 
replicator activator proteins. Cdc6 is degraded following replication initiation, 
thereby precluding the possibility for errant replication initiation events until after 
mitosis, when Cdc6 accumulates again. MCM proteins are also known as replication 
licensing factors, because they “license,” or permit, DNA replication to occur. The 
MCM proteins assemble as hexameric helicases that render the chromosomes com- 
petent for replication. Two MCM complexes are active within each origin, one for 
each replication fork. The pre-RC therefore consists of Cdc6, Cdtl, the MCM com- 
plexes, and other proteins. 

DNA replication is the defining characteristic of the S phase of the cell cycle. The 
switch from G to S is triggered by phosphorylation events carried out by S-CDK and 
Cdc7-Dbf4. Phosphorylation of the MCM proteins and binding of Cdc45 activates 
the helicase activity of MCM (Figure 28.12). Phosphorylation of Sld2 and Sld3, a pair 
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of proteins that interact with Dbp11, leads to the recruitment of DNA polymerase to 
the replication origins. (The protein acronyms refer to DNA polymerase-binding 
[Dpb] and proteins encoded by genes that give a lethal phenotype when mutated in 
Dpb11 mutant strains of yeast [Sld, or synthetic-lethal with Dpb11]. Collectively, 
these proteins are referred to as the “11-3-2 complex”). The actions of S-CDK and 
Cdc7-Dbf4 trigger bidirectional DNA replication from each origin, with the two di- 
verging MCM complexes serving as helicases. Each helicase unwinds the duplex 
DNA to provide single-stranded templates for the DNA polymerases that follow. 


Proteins of the Prereplication Complex Are AAA+ ATPase Family 
Members 


Cdc6, the Orc proteins, and MCM proteins are members of the AAA+ ATPase 
family, a group of proteins characterized by sequence and structural homology, 
ATPase activity, and a general function as molecular chaperones. The binding of 
both ORC and Cdc6 to chromatin in the process of pre-RC assembly is ATP-depen- 
dent (Figure 28.12). ORC with ATP bound can bind to origins of replication and 
recruit Cdc6 and Cdtl to form a pre-RC in which both ORC and Cdc6 have bound 
ATP. Cdc6 is required for the recruitment of the MCM proteins, specifically, a com- 
plex of MCM2-7 with ATP-dependent helicase activity. To establish the pre-RC, 
MCM2-7 must be stably associated with the origin, and this stability is achieved fol- 
lowing ATP hydrolysis, first by Cdc6 and then by ORC. 


Geminin Provides Another Control Over Replication Initiation 


Geminin is a protein that provides another level of regulatory control over DNA 
replication. Geminin inhibits DNA replication by preventing the incorporation of 
MCM complexes into the pre-RC. Geminin is active during S, Gə, and M phases, but 
its destruction during mitosis permits replication initiation in G;. Geminin binds to 
Cdtl, preventing it from recruiting MCM proteins to the pre-RC. Geminin exists as 
a parallel coiled-coil homodimer. It interacts with Cdtl in two ways: It has an array 
of glutamate residues on its surface that interact with positive charges on Cdtl, and 
an adjacent region on geminin interacts independently with the N-terminal region 
of Cdtl. 


Eukaryotic Cells Contain a Number of Different DNA Polymerases 


At least 19 different DNA polymerases have been described in eukaryotic cells thus 
far. These various polymerases have been assigned Greek letters in the order of their 
discovery (Table 28.4 lists the principal ones). Multiple DNA polymerases participate 
in leading and lagging-strand synthesis, but three—a, 6, and e—share the major bur- 
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FIGURE 28.13 Structure of the human PCNA homo- 
trimer. (a) Ribbon representation of the PCNA trimer 
(pdb id = 1AXQ) with an axial view of a B-form DNA 
duplex in its center. The molecular mass of each PCNA 
monomer is 37 kD. (b) Molecular surface of the PCNA 
trimer-DNA complex. 
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den. DNA polymerase æ has an associated primase subunit and functions in the initi- 
ation of nuclear DNA replication. Given a template, it not only synthesizes an RNA 
primer of about 10 nucleotides, but it then adds 20 to 30 deoxynucleotides to extend 
the chain in the 5’-3’ direction. DNA polymerase 6, a heterotetrameric enzyme, is 
the principal DNA polymerase in eukaryotic DNA replication. It interacts with PCNA 
protein (PCNA stands for proliferating cell nuclear antigen). Through its association 
with PCNA, DNA polymerase 6 carries out highly processive DNA synthesis. PCNA is 
the eukaryotic counterpart of the £E. coli Bsliding clamp; it clamps DNA polymerase 
ô to the DNA. Like By, PCNA encircles the double helix, but in contrast to the prokary- 
otic Bysliding clamp, PCNA is a homotrimer, not a homodimer (Figure 28.13). DNA 
polymerase € also plays a major role in DNA replication. It has an acidic C-terminal 
extension lacking in other DNA polymerases and evidence suggests that this domain 
is a sensor for DNA damage checkpoint control, halting DNA replication until the 
damage is repaired. DNA polymerase y is the DNA-replicating enzyme of mitochon- 
dria; DNA polymerase ß functions in DNA repair. The more recently discovered eu- 
karyotic DNA polymerases (including ¢, n, t, k, and Rev1) are novel in that they are 
more error-prone, resulting in lower fidelity of DNA replication. Nevertheless, they 
have the important ability to function in DNA replication and repair when damaged 
regions of DNA are encountered. 

Other proteins involved in eukaryotic DNA replication include replication 
protein A (RPA), an ssDNA-binding protein that is the eukaryotic counterpart 
of SSB, and replication factor C (RFC). RFC loads the PCNA sliding clamp onto 
replicating DNA, thus acting as the eukaryotic counterpart of the prokaryotic 
y-complex. 


28:5" How Are the Ends of Chromosomes Replicated? 


Telomeres are the structures at the ends of eukaryotic chromosomes. Telomeres 
are short (5 to 8 bp), tandemly repeated, G-rich nucleotide sequences that form 
protective caps 1 to 12 kbp long on the chromosome ends (see Chapter 11). Ver- 


28.5 How Are the Ends of Chromosomes Replicated? 875 





FIGURE 28.14 Telomere replication. (a) In replication of the lagging strand, short RNA primers are added (pink) 
and extended by DNA polymerase. When the RNA primer at the 5’-end of each strand is removed, there is no 
nucleotide sequence to read in the next round of DNA replication. The result is a gap (primer gap) at the 5’-end 
of each strand (only one end of a chromosome is shown in this figure). (b) Asterisks indicate sequences at the 
3’-end that cannot be copied by conventional DNA replication. Synthesis of telomeric DNA by telomerase 
extends the 5’-ends of DNA strands, allowing the strands to be copied by normal DNA replication. 


tebrate telomeres have a TTAGGG consensus sequence. Telomeres are necessary 
for chromosome maintenance and stability. DNA polymerases cannot replicate the 
extreme 5’-ends of chromosomes because these enzymes require a template and a 
primer and replicate only in the 5'-3’ direction. Thus, lagging strand synthesis at 
the 3’-ends of chromosomes is primed by RNA primase to form Okazaki fragments, 
but these RNA primers are subsequently removed, resulting in gaps in the progeny 
5'-terminal strands at each end of the chromosome after each round of replication 
(“primer gap”; see Figure 28.14). 

Telomerase is an RNA-dependent DNA polymerase. Telomerase maintains 
telomere length by restoring telomeres at the 3’-ends of chromosomes. The RNA 
upon which telomerase activity depends is actually part of the enzyme’s structure. 
That is, telomerase is a ribonucleoprotein, and its RNA component contains a 
9- to 30-nucleotide-long region that serves as a template for the synthesis of telom- 
eric repeats at DNA ends. The human telomerase RNA component is 450 nucleo- 
tides long; its template sequence is CUAACCCUAAC (base-pairs with TTAGGG). 
Telomerase uses the 3’-end of the DNA as a primer and adds successive TTAGGG 





HUMAN BIOCHEMISTRY 


Telomeres—A Timely End to Chromosomes? 





Mammalian cells in culture undergo only 50 or so cell divisions be- 
fore they die. Somatic cells are known to lack telomerase activity, 
and thus, they inevitably lose bits of their telomeres with each cell 
division. Telomerase activity is missing because the telomerase- 
reverse transcriptase gene (the TRT gene) is switched off. This fact 
has led to a telomere theory of cell aging, which suggests that cells 
senesce and die when their telomeres are gone. In support of this 


notion, a team of biologists headed by Calvin B. Harley at Geron 
Corporation used recombinant DNA techniques to express the cat- 
alytic subunit of human telomerase in skin cells in culture and ob- 
served that such cells divide 40 times more after cells lacking this 
treatment have become senescent. These results, although contro- 
versial, may have relevance to the aging process. 
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repeats to it, employing its RNA as template over and over again (Figure 28.14, 
see also the figure in the Chapter 11 Human Biochemistry box Telomeres and 
Tumors). 


28:6: How Are RNA Genomes Replicated? 


Many viruses have genomes composed of RNA, not DNA. How then is the informa- 
tion in these RNA genomes replicated? In 1964, Howard Temin noted that in- 
hibitors of DNA synthesis prevented infection of cells in culture by RNA tumor 
viruses such as avian sarcoma virus. On the basis of this observation, Temin pro- 
posed that DNA is an intermediate in the replication of such viruses; that is, an RNA 
tumor virus can use viral RNA as the template for DNA synthesis. 


RNA viral chromosome—-> DNA intermediate——> RNA viral chromosome 


In 1970, Temin and David Baltimore independently discovered a viral enzyme ca- 
pable of mediating such a process, namely, an RNA-directed DNA polymerase or, as 
it is usually called, reverse transcriptase. All RNA tumor viruses contain such an en- 
zyme within their virions (viral particles). RNA viruses that replicate their RNA 
genomes via a DNA intermediate are classified as retroviruses. 

Like other DNA and RNA polymerases, reverse transcriptase synthesizes polynu- 
cleotides in the 5’—3' direction, and like all DNA polymerases, reverse transcriptase 
requires a primer. Interestingly, the primer is a specific tRNA molecule captured by 
the virion from the host cell in which it was produced. The 3’-end of the tRNA is base- 
paired with the viral RNA template at the site where DNA synthesis initiates, and its 
free 3'-OH accepts the initial deoxynucleotide once transcription commences. Re- 
verse transcriptase then transcribes the RNA template into a complementary DNA 
(cDNA) strand to form a double-stranded DNA: RNA hybrid. 


The Enzymatic Activities of Reverse Transcriptases 


Reverse transcriptases possess three enzymatic activities, all of which are essential to 
viral replication: 


1. RNA-directed DNA polymerase activity, for which the enzyme is named (see Figure 
12.10). 

2. RNase H activity. Recall that RNase H is a nuclease that specifically degrades RNA 
chains in DNA:RNA hybrids (see Figure 12.10). The RNase H function of re- 
verse transcriptase is an exonuclease activity that degrades the template genomic 
RNA and also removes the priming tRNA after DNA synthesis is completed. 

3. DNA-directed DNA polymerase activity. This activity replicates the ssDNA remaining af- 
ter RNase H degradation of the viral genome, yielding a DNA duplex. This DNA du- 
plex directs the remainder of the viral infection process or becomes integrated into 


A DEEPER LOOK 


RNA as Genetic Material 





Whereas the genetic material of cells is dsDNA, virtually all plant 
viruses, several bacteriophages, and many animal viruses have 
genomes consisting of RNA. In most cases, this RNA is single 
stranded. Viruses with single-stranded genomes use the single 
strand as a template for synthesis of a complementary strand, 
which can then serve as template in replicating the original strand. 
Retroviruses are an interesting group of eukaryotic viruses with 


single-stranded RNA genomes that replicate through a dsDNA in- 
termediate. Furthermore, the life cycle of retroviruses includes an 
obligatory step in which the dsDNA is inserted into the host cell 
genome in a transposition event. Retroviruses are responsible for 
many diseases, including tumors and other disorders. HIV-1, the 
human immunodeficiency virus that causes AIDS, is a retrovirus. 
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the host chromosome, where it can lie dormant for many years as a provirus. Acti- 
vation of the provirus restores the infectious state. 


HIV reverse transcriptase is of great clinical interest because it is the enzyme for 
replication of the AIDS virus. DNA synthesis by HIV reverse transcriptase is blocked 
by nucleotide analogs such as AZT and 3TC (Figure 28.15). HIV reverse transcrip- 
tase incorporates these analogs into growing DNA chains in place of dTMP (in the 
case of AZT) or dCMP (in the case of 3TC). Once incorporated, these analogs block 
further chain elongation because they lack a 3’-OH where the next incoming dNTP 
can be added. HIV reverse transcriptase is error-prone: It incorporates the wrong 
base at a frequency of 1 per 2000 to 4000 nucleotides polymerized. This high error 
rate during replication of the HIV genome means that the virus is ever changing, a 
feature that makes it difficult to devise an effective vaccine. 


287a How Is the Genetic Information Shuffled 
by Genetic Recombination? 


Genetic recombination is the natural process by which genetic information is re- 
arranged to form new associations. For example, compared to their parents, prog- 
eny may have new combinations of traits because of genetic recombination. At the 
molecular level, genetic recombination is the exchange (or incorporation) of one 
DNA sequence with (or into) another. When recombination involves reaction be- 
tween very similar sequences (homologous sequences) of DNA, the process is called 
homologous recombination. When very different nucleotide sequences recombine, 
it’s nonhomologous recombination. Transposition—the enzymatic insertion of a 
transposon (a mobile segment of DNA, see pages 885-886) into a new location in the 
genome—and nonhomologous recombination (incorporation of a DNA segment 
whose sequence differs greatly from the DNA at the point of insertion) are two types 
of recombination that play a significant evolutionary role. Nonhomologous recom- 
bination occurs at a low frequency in all cells and serves as a powerful genetic force 
that reshapes the genomes of all organisms. Homologous recombination involves 
an exchange of DNA sequences between homologous chromosomes, resulting in 
the arrangement of genes into new combinations. Homologous recombination is 
generally used to fix the DNA so that information is not lost. For example, large le- 
sions in DNA are repaired via recombination of the damaged chromosome with a 
homologous chromosome. 

The process underlying homologous recombination is termed general recombi- 
nation because the enzymatic machinery that mediates the exchange can use essen- 
tially any pair of homologous DNA sequences as substrates. Homologous recombi- 
nation occurs in all organisms and is particularly prevalent during the production of 
gametes (meiosis) in diploid organisms. In higher animals—that is, those with im- 
mune systems—recombination also occurs in the DNA of somatic cells responsible 
for expressing proteins of the immune response, such as the immunoglobulins. This 
somatic recombination rearranges the immunoglobulin genes, dramatically increas- 
ing the potential diversity of immunoglobulins available from a fixed amount of ge- 
netic information. Homologous recombination can also occur in bacteria. Indeed, 
even viral chromosomes undergo recombination. For example, if two mutant viral 
particles simultaneously infect a host cell, a recombination event between the two 
viral genomes can lead to the formation of a virus chromosome that is wild-type. 


General Recombination Requires Breakage and Reunion 
of DNA Strands 


Recombination occurs by the breakage and reunion of DNA strands so that a phys- 
ical exchange of parts takes place. Matthew Meselson and J. J. Weigle demonstrated 
this in 1961 by coinfecting E. coli with two genetically distinct bacteriophage A 


(a) 





(b) 





FIGURE 28.15 The structures of AZT (3'-azido-2',3'- 
dideoxythymidine) and 3TC (2’,3’-dideoxy-3’- 
thiacytidine). These nucleosides are phosphorylated 
in vitro to form deoxynucleoside-5‘-triphosphate sub- 
strate analogs for HIV reverse transcriptase. 
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“| CENGAGENOW” ANIMATED FIGURE 28.16 Meselson and Weigle’s experiment. Density-labeled,"heavy” 
phage, symbolized as ABC phage, was used to coinfect bacteria along with “light” phage, the abc phage. 
The progeny from the infection were collected and subjected to CsCl density gradient centrifugation. 
Parental-type ABC and abc phage were well separated in the gradient, but recombinant phage (ABc, Abc, 
aBc, aBC, and so on) were distributed diffusely between the two parental bands because they contained 
chromosomes constituted from fragments of both “heavy” and “light” DNA. See this figure animated at 
www.cengage.com/login. 


strains, one of which had been density-labeled by growth in !8C- and !°N-containing 
media (Figure 28.16). The phage progeny were recovered and separated by CsCl 
density gradient centrifugation. Phage particles that displayed recombinant geno- 
types were distributed throughout the gradient while parental (nonrecombinant) 
genotypes were found within discrete “heavy” and “light” bands in the density gra- 
dient. The results showed that recombinant phage contained DNA derived in vary- 
ing proportions from both parents. The obvious explanation is that these recombi- 
nant DNAs arose via the breakage and rejoining of DNA molecules. 

A second important observation made during this type of experiment was that 
some of the plaques formed by the phage progeny contained phage of two differ- 
ent genotypes, even though each plaque was caused by a single phage infecting one 
bacterium. Therefore, some infecting phage chromosomes must have contained a 
region of heteroduplex DNA, duplex DNA in which a part of each strand is con- 
tributed by a different parent (Figure 28.17). 


Homologous Recombination Proceeds According 
to the Holliday Model 


In 1964, Robin Holliday proposed a model for homologous recombination that has 
proved to be correct in its essential features (Figure 28.18). The two homologous 
DNA duplexes are first juxtaposed so that their sequences are aligned. This process 
of chromosome pairing is called synapsis (Figure 28.18a). Holliday suggested that 
recombination begins with the introduction of single-stranded nicks in the DNA at 
homologous sites on the two paired chromosomes (Figure 28.18b). The two du- 
plexes partially unwind, and the free, single-stranded end of one duplex begins to 
base-pair with its nearly complementary, single-stranded region along the intact 
strand in the other duplex, and vice versa (Figure 28.18c). This strand invasion is 
followed by ligation of the free ends from different duplexes to create a cross- 


+- +- +- 
X X X X X X 
Semiconservative = 
X y DNA replication Y Y y y 
Z Z Z Z Z Z 


Heteroduplex DNA Progeny phage of 2 different 
(+ and — strands) genotypes 





FIGURE 28.17 The generation of progeny bacteriophage of two different genotypes from a single phage parti- 
cle carrying a heteroduplex DNA region within its chromosome. The heteroduplex DNA is composed of one 
strand that is genotypically XYZ (the + strand), and the other strand that is genotypically XyZ (the — strand). 
That is, the genotype of the two parental strands for gene Y is different (one is Y, the other y). 
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CENGAGENOW” ACTIVE FIGURE 28.18 The Holliday 
model for homologous recombination. The + signs 
and — signs label strands of like polarity. For exam- 
ple, assume that the two strands running 5’3' as 
read left to right are labeled +; and the two strands 
running 3’-95’ as read left to right are labeled —. 
Only strands of like polarity exchange DNA during 
recombination. (See text for detailed description.) 
Test yourself on the concepts in this figure at 
www.cengage.com/login. 
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stranded intermediate known as a Holliday junction (Figure 28.18d). The cross- 
stranded junction can now migrate in either direction (branch migration) by un- 
winding and rewinding of the two duplexes (Figure 28.18e). Branch migration re- 
sults in strand exchange; heteroduplex regions of varying length are possible. In 
order for the joint molecule formed by strand exchange to be resolved into two 
DNA duplex molecules, another pair of nicks must be introduced. Resolution can 
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be represented best if the duplexes are drawn with the chromosome arms bent “up” 
or “down” to give a planar representation (Figure 28.18f). Nicks then take place, ei- 
ther at E and W, that is, in the — strands that were originally nicked (see Figure 
28.18b), or at N and S, that is, in the + strands (the strands not previously nicked). 
Duplex resolution is most easily kept straight by remembering that + strands are 
complementary to — strands and any resultant duplex must have one of each. Nicks 
made in the strands originally nicked lead to DNA duplexes in which one strand of 
each remains intact. Although these duplexes contain heteroduplex regions, they 
are not recombinant for the markers (AZ, az) that flank the heteroduplex region; 
such heteroduplexes are called patch recombinants (Figure 28.18g). Nicks intro- 
duced into the two strands not previously nicked yield DNA molecules that are both 
heteroduplex and recombinant for the markers A/a and Z/z; these heteroduplexes 
are termed splice recombinants (Figure 28.18h). Although this Holliday model ex- 
plains the outcome of recombination, it provides no mechanistic explanation for 
the strand exchange reactions and other molecular details of the process. 


The Enzymes of General Recombination Include RecA, RecBCD, 
RuvA, RuvB, and RuvC 


To illustrate recombination mechanisms, we focus on general recombination as it 
occurs in E. coli. The principal players in the process are the RecBCD enzyme com- 
plex, which initiates recombination; the RecA protein, which binds single-stranded 
DNA, forming a nucleoprotein filament capable of strand invasion and homolo- 
gous pairing; and the RuvA, RuvB, and RuvC proteins, which drive branch migra- 
tion and process the Holliday junction into recombinant products. Eukaryotic 
homologs of these prokaryotic recombination proteins have been identified, indi- 
cating that the fundamental process of general recombination is conserved across 
all organisms. 


The RecBCD Enzyme Complex Unwinds dsDNA and Cleaves 
Its Single Strands 


The RecBCD complex is composed of the proteins RecB (140 kD; 1180 amino acids), 
RecC (130 kD; 1122 amino acids), and RecD (67 kD; 608 amino acids). This multi- 
functional enzyme complex has both helicase and nuclease activity and initiates re- 
combination by attaching to the end of a DNA duplex (or at a double-stranded break 
in the DNA) and using its ATP-dependent helicase function to unwind the dsDNA 
(Figure 28.19a). RecB and RecD are helicases powered by ATP hydrolysis; each con- 
sumes an ATP per base pair of DNA traversed. The RecD motor is faster than the 
RecB motor and leads the way. The greater speed of RecD causes the DNA to loop 
out between RecD and RecB. The RecB subunit contains the nuclease domain. As 
RecBCD progresses along unwinding the duplex, the RecB nuclease activity cleaves 
both of the newly formed single strands (although the strand that provided the 3’-end 
at the RecBCD entry site is cut more frequently than the 5’-terminal strand [Figure 
28.19b]). SSB (and some RecA protein) readily binds to the emerging single strands. 
Sooner or later, RecBCD encounters a particular nucleotide sequence, a so-called Chi 
(or x) site, characterized by the sequence 5’-GCTGGTGG-3’. These y sites are re- 
combinational “hot spots”; 1009 y sites have been identified in the FE. coli genome 
(on average, about one every 4.5 kb of DNA). When a y sequence is encountered by 
a RecBCD complex approaching its 3’-side (the ..G-3'-side), RecBCD cleaves the 
x-bearing DNA strand four to six bases to the 3’ side of y (Figure 28.19c). RecBCD 
flips so that the RecB motor leads the way, and the RecBCD complex no longer ex- 
presses nuclease activity against the 3'-terminal strand, but nuclease activity against 
the 5’-terminal strand increases (Figure 28.19d). 

Resuming its helicase function, RecBCD unwinds the dsDNA, and collectively 
these processes generate an ssDNA tail bearing a y site at its 3’-terminal end. 
This ssDNA may reach several kilobases in length. RecA protein now binds to the 
3'-terminal strand to form a nucleoprotein filament (Figure 28.19e). This filament 
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is active in pairing and strand invasion with a homologous region in another dsDNA 
molecule. 


The RecA Protein Can Bind ssDNA and Then Interact with Duplex DNA 


The RecA protein, or recombinase, is a multifunctional protein that acts in general 
recombination (Figure 28.20a). RecA mediates the ATP-dependent DNA strand 
exchange reaction leading to formation of a Holliday junction (Figure 28.18b-—f). In 
the presence of ATP and ssDNA, RecA forms a right-handed helical filament having 


FIGURE 28.19 Model of RecBCD-dependent initiation of 
recombination. (a) RecBCD binds to a duplex DNA end, 
and its helicase activity begins to unwind the DNA dou- 
ble helix."Rabbit ears” of ssDNA loop out from RecBCD 
because the rate of DNA unwinding exceeds the rate of 
ssDNA release by RecBCD. (b) As it unwinds the DNA, 
SSB (and some RecA) bind to the single-stranded re- 
gions; the RecBCD endonuclease activity randomly 
cleaves the ssDNA, showing a greater tendency to cut 
the 3’-terminal strand rather than the 5’-terminal strand. 
(c) When RecBCD encounters a properly oriented y site, 
the 3’-terminal strand is cleaved just below the 3’-end 
of x. (d) RecBCD now directs the binding of RecA to the 
3’-terminal strand, as RecBCD endonuclease activity now 
acts more often on the 5’-terminal strand. (e) A nucleo- 
protein filament consisting of RecA-coated 3'-strand 
ssDNA is formed. This nucleoprotein filament is capable 
of homologous pairing with a dsDNA and strand inva- 
sion. (Adapted from Figure 2 in Eggleston, A. K, and West, S.C., 
1996. Exchanging partners: recombination in E. coli. Trends in 
Genetics 12:20-25; and Figure 3 in Eggleston, A. K., and West, S. C, 
1997. Recombination initiation: Easy as A,B,C, D ....v? Current 
Biology 7:R745-R749.) 
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FIGURE 28.20 (a) RecA, a 352-residue, 38-kD protein, 
with dATP in the ATP-binding site (pdb id = 2ODN). 
(b) Structure of a fused RecA hexamer with bound ssDNA 
(as poly(dT);s, magenta). Each of the six RecA units is a dif- 
ferent color and each has a bound ADP molecule (marine 
blue). The adenine nucleotide-binding sites lie at the inter- 
faces between RecA units (pdb id = 3CMU). 





SSB Homologous 
RecA DNA duplex 





(a) N-term 





six monomers per turn, with each monomer spanning about three nucleotides of 
DNA. The RecA nucleoprotein filament serves as a scaffold upon which the events of 
recombination take place. This filament has a deep spiral groove large enough to en- 
compass three strands of DNA. Although RecA-bound ssDNA is relatively stretched 
and underwound, with about 18.5 nucleotides per turn, its local conformation re- 
sembles B-DNA (Figure 28.20b). Thus, when dsDNA adds to the complex, the ssDNA 
is poised to search for the sequence homology that leads to strand exchange. 
Procession of strand separation of dsDNA and the re-pairing into hybrid strands 
along the DNA duplex initiates branch migration (Figure 28.21b). Branch migration 
drives the displacement of the homologous DNA strand from the DNA duplex and 
its replacement with the ssDNA strand, a process known as single-strand assimilation 


FIGURE 28.21 Model for homologous recombination as promoted by RecA enzyme. (a) RecA protein (and SSB) aid 
strand invasion of the 3’-ssDNA into a homologous DNA duplex, (b) forming a D-loop. (c) The D-loop strand that 
has been displaced by strand invasion pairs with its complementary strand in the original duplex to form a Holli- 
day junction as strand invasion continues. 
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CENGAGENOW” ACTIVE FIGURE 28.22 Model for the resolution of a Holliday junction in E. coli by the RuvA, 
RuvB, and RuvC proteins. (a) Ribbon diagram of the RuvA tetramer. RuvA monomers have an overall L shape 

pdb id = 1CUK). (b) Model for RuvA/RuvB action (left). The RuvA tetramer fits snugly within the Holliday junction 
point. (center) Oppositely facing RuvB hexameric rings assemble on the heteroduplexes, with the DNA passing 
hrough their centers. These RuvB hexamers act as motors to promote branch migration by driving the passage 
of the DNA duplexes through themselves. (right) Binding of RuvC at the Holliday junction and strand scission by 
its nuclease activity. The locations of the RuvC active sites are indicated by the scissors. (c) Charge distribution on 
he concave surface of an RuvA tetramer. Blue indicates positive charge and red, negative charge. Note the overall 
positive charge on this surface of (RuvA),, with the exception of the four red (negatively charged) pins at its 
center. (d) Structural model for the interaction of (RuvA)4 with the hypothesized square-planar Holliday junction 
center (pdb id = 1C7Y). (Adapted from Figures 1, 2, and 3 in Rafferty, J.B, et al, 1996. Crystal structure of DNA recombination 
protein RuvA and a model for its binding to the Holliday junction. Science 274:415-421.) Test yourself on the concepts in 
this figure at www.cengage.com/login. 











(or single-strand uptake). Strand assimilation does not occur if there is no sequence 
homology between the ssDNA and the invaded DNA duplex. The DNA strand dis- 
placed by the invading 3’-terminal ssDNA is free to anneal with the 5’-terminal 
strand in the original DNA, a step that is also mediated by RecA protein and SSB 
(Figure 28.21c). The result is a Holliday junction, the classic intermediate in genetic 
recombination. 


RuvA, RuvB, and RuvC Proteins Resolve the Holliday Junction 
to Form the Recombination Products 


The Holliday junction is processed into recombination products by RuvA (203 amino 
acids), RuvB (336 amino acids), and RuvC (173 amino acids). Specifically, RuvA and 
RuvB work together as a Holliday junction-specific helicase complex that dissociates 
the RecA filament and catalyzes branch migration. An RuvA tetramer (Figure 28.22a) 
fits precisely within the junction point (Figure 28.22b), which has a square-planar 
geometry, and this RuvA tetramer mediates the assembly of RuvB around opposite 
arms of the DNA junction. The RuvB protein binds to form two oppositely oriented, 
hexameric [(RuvB)«] ring structures encircling the dsDNAs, one on each side of the 
Holliday junction. Rotation of the dsDNAs by the RuvB hexameric rings pulls the 
dsDNAs through (RuvB), and unwinds the DNA strands across the “spool” of RuvA, 
which threads the separated single strands into newly forming hybrid (recombinant) 
duplexes (Figure 28.22b). The RuvA tetramer is a disclike structure, one face of which 
has an overall positive charge (Figure 28.22c), with the exception of four negatively 
charged central pins, each contributed by a pair of residues (Glu® and Asp®) from 
each RuvA monomer. These four pins fit neatly into the hole at the center of the 
Holliday junction. The negatively charged sugar—phosphate backbones of the four 
DNA duplexes of the Holliday junction are threaded along grooves in the positively 
charged RuvA face, with the negatively charged central pins appropriately situated to 
transiently separate the dsDNA molecules into their component single strands 
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A DEEPER LOOK 


The Three R’s of Genomic Manipulation: Replication, Recombination, and Repair 


DNA replication, recombination, and repair have traditionally 
been treated as separate aspects of DNA metabolism. However, 
scientists have come to realize that DNA replication is an essen- 
tial component of both DNA recombination and DNA repair 
processes. Furthermore, recombination mechanisms play an ab- 
solutely vital role in restarting replication forks that become 
halted at breaks or other lesions in the DNA strands. If a double- 
stranded break (DSB) or a nick in just one of the DNA strands 
(called a single-stranded gap, or SSG) is present in the DNA under- 
going replication, the replication fork stalls and the replication 
complex dissociates (replication fork “collapse”). Significantly, the 


whole process of homologous recombination can initiate only at 
SSGs or DSBs, and establishment of homologous recombination 
at such sites can rescue DNA replication. This recombination- 
dependent replication (RDR) has the interesting property of ini- 
tiating DNA replication at sites other than the orC site, and thus 
RDR is an important mechanism for restarting DNA replication if 
the replication fork is disrupted for any reason. As might be ex- 
pected from the close relationships between replication, recom- 
bination, and repair, many of the same proteins are involved in all 
three, and all three must be viewed as essential processes in the 
perpetuation of the genome. 








through repulsive electrostatic interactions with the phosphate backbones of the DNA. 
The separated strands of each parental duplex are then channeled into grooves in the 
RuvA face, where they are led into hydrogen-bonding interactions with bases con- 
tributed by strands of the other parental DNA to form the two daughter hybrid du- 
plexes flowing out from the RuvAB complex (Figure 28.22b). Figure 28.22d illustrates 
a model for the RuvA tetramer with the square-planar Holliday junction. 

Depending on how the strands in the Holliday junction are cleaved and resolved, 
patch or splice recombinant duplexes result (Figure 28.18g and h). RuvC is an endo- 
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“Knockout” Mice: A Method to Investigate the Essentiality of a Gene 


Homologous recombination can be used to replace a gene with an 
inactivated equivalent of itself. Inactivation is accomplished by in- 
serting a foreign gene, such as neo, a gene encoding resistance to 
the drug G418, within one of the exons of a copy of the gene of in- 
terest. Homologous recombination between the neo-bearing trans- 
gene and DNA in wild-type mouse embryonic stem (ES) cells 
replaces the target gene with the inactive transgene (see accom- 
panying figure). ES cells in which homologous recombination has 
occurred will be resistant to G418, and such cells can be selected. 


1 2 3 


These recombinant ES cells can then be injected into early-stage 
mouse embryos, where they have a chance of becoming the 
germline cells of the newborn mouse. If they do, an inactivated 
target gene is then present in the gametes of this mouse. Mating 
between male and female mice with inactive target genes yields a 
generation of homozygous “knockout” mice—mice lacking a func- 
tional copy of the targeted gene. Characterization of these knock- 
out mice reveals which physiological functions the gene directs. 
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nuclease that resolves Holliday junctions into heteroduplex recombinant products 
(RuvC resolvase). An RuvC dimer binds at the Holliday junction and cuts pairs of 
DNA strands of similar polarity (Figure 28.22b); whether a patch or a splice recom- 
binant results depends on which DNA pair is cleaved. 
RuvB hexamers are AAA*-ATPase-type, DNA-driving molecular motors; similar 
motors act during DNA replication to propel strand separation and initiate DNA 
synthesis. Thus, the RuvABC system for processing Holliday junctions may repre- 
sent a general paradigm for DNA manipulation in all cells. 


Recombination-Dependent Replication Restarts DNA Replication 
at Stalled Replication Forks 


It is likely that most replication forks that begin at the E. coli oriC initiation sites (or 
analogous initiation sites in eukaryotes) are derailed by nicks or more extensive DNA 
damage lying downstream. However, DNA replication can be reinitiated (and 
genome replication can be completed) following replication fork restart. Recombi- 
national repair of stalled replication forks requires the action of enzymes from every 
aspect of DNA metabolism: replication, recombination, and repair. The initial steps 
in restoration of a replication fork depend on the recombination proteins RecA and 
RecBCD and the formation of a D-loop (Figure 28.21). The £E. coli protein PriA rec- 
ognizes and binds with high affinity to D-loops. Once bound, PriA coordinates 
resumption of DNA replication by recruiting DnaB helicase to the D-loop and 
reestablishing a replication fork complete with two copies of the replicative DNA 
polymerase. The ability of RecA to mediate recombinational repair of stalled repli- 
cation forks is undoubtedly the reason for the presence of RecA-type proteins in vir- 
tually all organisms. Indeed, the recombination system, a feature common to all 
cells, evolved to carry out this essential repair purpose. 


Transposons Are DNA Sequences That Can Move from Place 
to Place in the Genome 


In 1950, Barbara McClintock reported the results of her studies on an activator gene 
in maize (Zea mays, or as it’s usually called, corn) that was recognizable principally by 
its ability to cause mutations in a second gene. Activator genes were thus an internal 
source of mutation. A most puzzling property was their ability to move relatively freely 


HUMAN BIOCHEMISTRY 





The Breast Cancer Susceptibility Genes BRCA1 and BRCA2 Are Involved 
in DNA Damage Control and DNA Repair 





Mutations in the BRCA] and BRCA2 genes cause increased likeli- 
hood of breast, ovarian, and other cancers. The BRCAI protein 
functions in regulation of the cell cycle in response to DNA damage 
control. Phosphorylation of BRCA1 by DNA damage-response pro- 
teins controls the expression, phosphorylation, and cellular local- 
ization of specific cyclin-CDKs involved in the cell cycle Go/M 
checkpoint and the onset of mitosis. Activation of these cyclin-CDKs 
leads to arrest of the cell cycle in Gg until the damage to DNA is re- 
paired. Mutations in BRCA1 that impair its function allow the cell 
cycle to enter mitosis and DNA damage to accumulate, raising the 
risk of cancer. 

The BRCA2 protein participates in the pathway for DNA repair 
by homologous recombination. The BRCA2 protein (3418 amino 
acids) is a very large protein with 8 conserved sequence motifs of 
about 30 amino acids each, known as the BRC repeats. These re- 


peats act as binding sites for RAD51, the eukaryotic analog of 
RecA. BRCA2 transfers RAD51 to a ssDNA strand coated with RPA 
(the eukaryotic counterpart of SSB), allowing formation of the 
RAD51-ssDNA nucleoprotein filament that is an essential inter- 
mediate in eukaryotic homologous recombination (see Figure 
28.20b). Mutations that impair BRCA2 function prevent DNA re- 
pair by homologous recombination, leading to accumulation of 
DNA damage and a greater likelihood of cancer. 


Source: Yarden, R. I., et al., 2003. BRCA1 regulates the G/M checkpoint 
by activating Chk] kinase upon DNA damage. Nature Genetics 30:285-289; 
Simon, N., Powell, S. M., Willers, H., and Xia, F., 2003. BRCA2 keeps Rad51 
in line: High-fidelity homologous recombination prevents breast and ovar- 
ian cancer? Molecular Cell 10:1262-1263; and Pelligrini, L., et al., 2002. In- 
sights into DNA recombination from the structure of a RAD51-BRCA2 
complex. Nature 420:287-293. 
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CENGAGENOW” ACTIVE FIGURE 28.23 The typical 
transposon has inverted nucleotide-sequence repeats 
at its termini, represented here as the 12-bp sequence 
ACGTACGTACGT (a). It acts at a target sequence (shown 
here as the sequence CATGC) within host DNA by creat- 
ing a staggered cut (b) whose protruding single- 
stranded ends are then ligated to the transposon (c). 
The gaps at the target site are then filled in, and the 
filled-in strands are ligated (d). Transposon insertion 
thus generates direct repeats of the target site in the 
host DNA, and these direct repeats flank the inserted 
transposon. Test yourself on the concepts in this 
figure at www.cengage.com/login. 


about the genome. Scientists had labored to establish that chromosomes consisted of 
genes arrayed in a fixed order, so most geneticists viewed as incredible this idea of 
genes moving around. The recognition that McClintock so richly deserved for her 
explanation of this novel phenomenon had to await verification by molecular biolo- 
gists. In 1983, Barbara McClintock was finally awarded the Nobel Prize in Physiology 
or Medicine. By this time, it was appreciated that many organisms, from bacteria to hu- 
mans, possessed similar “jumping genes” able to move from one site to another in the 
genome. This mobility led to their designation as mobile elements, transposable 
elements, or, simply, transposons. 

Transposons are segments of DNA that are moved enzymatically from place to 
place in the genome (Figure 28.23). That is, their location within the DNA is unsta- 
ble. Transposons range in size from several hundred base pairs to more than 8 kbp. 
Transposons contain a gene encoding an enzyme necessary for insertion into a chro- 
mosome and for the remobilization of the transposon to different locations. These 
movements are termed transposition events. The smallest transposons are called 
insertion sequences, or ISs, signifying their ability to insert apparently at random in 
the genome. Insertion into a new site can cause a mutation if a gene or regulatory re- 
gion at the site is disrupted. Because transposition events can move genes to new 
places or lead to the duplication of existing genes, transposition is a major force in 
evolution. 
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28:8: Can DNA Be Repaired? 


Biological macromolecules are susceptible to chemical alterations that arise from 
environmental damage or errors during synthesis. For RNAs, proteins, or other cel- 
lular molecules, most consequences of such damage are avoided by replacement of 
these molecules through normal turnover (synthesis and degradation). However, 
the integrity of DNA is vital to cell survival and reproduction. Its information con- 
tent must be protected over the life span of the cell and preserved from generation 
to generation. Safeguards include (1) high-fidelity replication systems and (2) re- 
pair systems that correct DNA damage that might alter its information content. 
DNA is the only molecule that, if damaged, is repaired by the cell. Usually, accurate 
repair is possible because the information content of duplex DNA is inherently re- 
dundant; the nucleotide sequence in one strand is directly related to the sequence 
in the other. DNA damage may arise from endogenous processes or from exoge- 
nous agents, such as UV light, ionizing radiation, or mutagenic chemicals. The most 
common forms of endogenous DNA damage arise from chemical reactions (oxi- 
dation, alkylation, or deamination of bases) or loss of bases due to cleavage of 
N- glycosidic bonds. Exogenous agents can damage DNA in a variety of ways, in- 
cluding UV-induced free-radical generation and crosslinking of adjacent pyrim- 
idines, breakage of the polynucleotide backbone by ionizing radiation, and base 
modifications through chemical reactions. Cells have extraordinarily diverse and ef- 
fective systems to repair these lesions in DNA so that the genetic information is not 
lost or altered. The human genome has 150 or so genes associated with DNA repair. 
DNA repair systems include direct reversal damage repair, single-strand damage re- 
pair, double-stranded break (DSB) repair, and translesion DNA synthesis. 

Chemical reactions that reverse the damage, returning the DNA to its proper 
state, are called direct reversal repair systems. Examples include methyltransferases 
to remove methyl groups from chemically modified bases and photolyase, which 
repairs thymine dimers, as discussed in a later section (see Figure 28.27). Single- 
strand damage repair relies on the intact complementary strand to guide repair. Sys- 
tems repairing this sort of DNA damage include mismatch repair (MMR), base 
excision repair (BER), and nucleotide excision repair (NER). Such systems will be 
described further on. 

DSBs are a particular threat to genome stability, because lost sequence informa- 
tion cannot be recovered from the same DNA double helix. Because the threat is so 
severe, cells have several systems to repair DSBs. The simplest way to repair a DSB 
is to rejoin the broken strands through nonhomologous DNA end-joining, or NHEJ 
(Figure 28.24). This repair can occur any time in the cell cycle. A key problem in 
NHE] is to keep the ends near one another so that the fragments can be linked to- 
gether again. A heterodimeric protein, Ku70/80, binds the DNA ends and recruits 
a set of proteins that juxtaposes the damaged ends, repairs them, and religates them 
without the use of a repair template. (Some proteins involved in repairing DSBs also 
function in immunoglobulin gene rearrangements; see Figure 28.39.) The lack of a 
proper template for NHEJ means that it is error-prone. 

DSBs that arise during the S phase of the cell cycle can be repaired through ho- 
mologous recombination (Figure 28.25). The intact sister chromatid of the dam- 
aged DNA duplex guides the process. Processing of the DSB creates single-stranded 
tails that become substrates for RecA-mediated nucleoprotein filament formation 
and homology recognition within the sister chromatid. DNA synthesis propagated 
from the D-loop and migration of the Holliday junction, followed by strand cleav- 
age within the Holliday junction by resolvase, either restores the DNA to its intact 
state (Figure 28.25g, left) or results in strand exchange between the two sister chro- 
matids (Figure 28.25g, right). 

What is effectively a DSB can arise during DNA replication if DNA damage 
causes the replication fork to stall (Figure 28.26). Suppose a lesion in the leading 
strand (circle, Figure 28.26b) causes leading-strand synthesis to stall. Continued 
lagging-strand synthesis results in a single-stranded region on the leading-strand 
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FIGURE 28.24 DSB repair through nonhomologous DNA 
end joining (NHEJ). Ku70/80 binds the ends and recruits 
a set of proteins that juxtaposes the broken ends. Pro- 
cessing of the ends to generate proper substrates for 
DNA ligase IV then occurs, followed by DNA-ligase- 
mediated end joining. (Adapted from Figure 1 in Wyman, C, 
and Kanaar, R., 2006. DNA double-strand break repair: All's well 
that ends well. Annual Review of Genetics 40:363-383.) 
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FIGURE 28.25 DSB repair through homologous DNA recombination. The orange-red pair 
of lines symbolizes the double-stranded DNA with a DSB; the black-blue pair represents the 
sister chromatid. Homologous recombination creates a D-loop (c), and sister chromatid- 
directed DNA replication restores the information content of the damaged duplex (d-f). 
Depending on how the Holliday junctions are resolved, the products (g) are either (left) 
noncrossover or (right) crossover recombinants. (Adapted from Figure 2 in Wyman, C., and Kanaar, 
R. 2006. DNA double-strand break repair: All's well that ends well. Annual Review of Genetics 40:363-383.) 
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FIGURE 28.26 Restarting a stalled replication fork through ho- 
mologous DNA recombination. A lesion in the DNA is symbol- 
ized by a circle; in this case, the lesion is in the leading-strand 
template (a). Leading-strand synthesis halts because of the le- 
sion (b). Lagging-strand synthesis (red) continues, and the 
Okazaki fragments are ligated (c). When the leading strand in- 
vades the new DNA duplex formed by lagging-strand synthesis, 
a D-loop is formed and strand exchange occurs. Using the lag- 
ging strand as a template, synthesis of the leading strand (black) 
resume 
from Fig 
repair: All's well that ends well. Annual Review of Genetics 40:363-383.) 


s (d), and the replication fork is reestablished (e). (Adapted 
ure 3 in Wyman, C., and Kanaar, R., 2006. DNA double-strand break 








template. Leading-strand invasion of the new lagging-strand duplex (mediated by 
RecA) creates a D-loop (Figure 28.26c), and leading-strand synthesis directed by 
the lagging-strand template (Figure 29.26d) reestablishes a competent replication 
fork (Figure 28.26e). The site of the lesion is repaired later, usually by nucleotide 
excision repair (see following section). Homologous recombination-mediated 
restarting of replication forks is believed to be the evolutionary driving force for 
the emergence of homologous recombination. Even if the NHEJ or homologous 
recombination repair systems fail to act, the genome may be preserved if an “error- 
prone” mode of replication allows the lesion to be bypassed. Such translesion DNA 
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Transgenic Animals Are Animals Carrying Foreign Genes 


Experimental advances in gene transfer techniques have made it 
possible to introduce genes into animals by transfection. Transfec- 
tion is defined as the uptake or injection of plasmid DNA into re- 
cipient cells. Animals that have acquired new genetic information as 
a consequence of the introduction of foreign genes are termed 
transgenic. Plasmids carrying the gene of interest are injected into 
the nucleus of an oocyte or fertilized egg, and the egg is then im- 
planted into a receptive female. The technique has been perfected 
for mice (see figure, part a). In a small number of cases—10% or 
so—the mice that develop from the injected eggs carry the trans- 
fected gene integrated into a single chromosomal site. The gene is 
subsequently inherited by the progeny of the transfected animal as 
if it were a normal gene. Expression of the donor gene in the trans- 
genic animals is variable because the gene is randomly integrated 
into the host genome and gene expression is often influenced by 
chromosomal location. Nevertheless, transfection of animals has 
produced some startling results, as in the case of the transfection of 
mice with the gene encoding the rat growth hormone (rGH). The 
transgenic mice grew to nearly twice the normal size (see figure, 
part b). Growth hormone levels in these animals were several hun- 
dred times greater than normal. Similar results were obtained in 
transgenic mice transfected with the human growth hormone 
(hGH) gene. The biotechnology of transfection has been extended 
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to farm animals, and transgenic chickens, cows, pigs, rabbits, sheep, 
and even fish have been produced. 

The first animal cloned from an adult cell, a sheep named 
Dolly, represented a milestone in cloning technology. Subsequent 
accomplishments include incorporation of the human gene en- 
coding blood coagulation factor IX into sheep. Fetal sheep fi- 
broblast cells were transfected with the human factor IX gene, nu- 
clei from the transfected cells were transferred into sheep oocytes 
lacking nuclei, and these transgenic oocytes were placed in the 
uterus of receptive female sheep, which subsequently gave birth to 
transgenic lambs. The introduced factor IX transgene was specifi- 
cally designed so that factor IX protein, a medically useful product 
for the treatment of hemophiliacs, would be expressed in the milk 
of the transgenic sheep. Similar successes in cows, which produce 
much more milk, has brought the potential for commercial pro- 
duction of virtually any protein into the realm of reality. 

Transfection technology also holds promise as a mechanism for 
“gene therapy” by replacing defective genes in animals with func- 
tional genes (see Chapter 12). Problems concerning delivery, in- 
tegration and regulation of the transfected gene, including its ap- 
propriate expression in the right cells at the proper time during 
development and growth of the organism, must be brought under 
control before gene therapy becomes commonplace in humans. 
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mouse. Integration of the plasmid into the mouse genome can be ascertained by Southern blot analysis 
of DNA from the newborn mouse. Expression of the foreign gene can be determined by assaying for the 


gene product, in this case, rat growth hormone. (b) Photograph showing a transgenic mouse with an 
active rat growth hormone gene (left). This transgenic mouse is twice the size of a normal mouse (right). 





synthesis is more of a tolerance mechanism than a repair mechanism because it al- 
lows replication without necessarily repairing the damage. Specialized translesion 
DNA polymerases, such as DNA polymerase IV in E. coli and DNA polymerase 7 
(7 = eta) in humans, substitute for the replicative DNA polymerase in this process. 
Although translesion DNA polymerases tend to be error-prone, they have the ad- 
vantage of allowing DNA replication to continue. 

Human DNA replication has an error rate of about three base-pair mistakes dur- 
ing copying of the 6 billion base pairs in the diploid human genome. The low error 
rate is due to those DNA repair systems that review and edit the newly replicated 
DNA. Furthermore, about 10! bases (mostly purines) are lost per cell per day from 
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FIGURE 28.27 UV irradiation causes dimerization of adja- 
cent thymine bases. A cyclobutyl ring is formed between 
carbons 5 and 6 of the pyrimidine rings. Normal base 
pairing is disrupted by the presence of such dimers. 


spontaneous breakdown in human DNA; repair systems must replace these bases to 
maintain the fidelity of the encoded information. 


Mismatch Repair Corrects Errors Introduced During DNA Replication 


The mismatch repair system corrects errors introduced when DNA is replicated. It 
scans newly synthesized DNA for mispaired bases, excises the mismatched region, and 
then replaces it by DNA polymerase—mediated local replication. The key to such re- 
placement is to know which base of the mismatched pair is correct. 

The E. coli methyl-directed pathway of mismatch repair relies on methylation pat- 
terns in the DNA to determine which strand is the newly synthesized one and which 
one was the parental (template) strand. DNA methylation, often an identifying and 
characteristic feature of a prokaryote’s DNA, occurs just after DNA replication. Dur- 
ing methylation, methyl groups are added to certain bases along the new DNA 
strand. However, a window of opportunity exists between the start of methylation and 
the end of replication, when only the parental strand of a dsDNA is methylated. This 
window in time provides an opportunity for the mismatch repair system to review the 
dsDNA for mismatched bases that arose as a consequence of replication errors. By 
definition, the newly synthesized strand is the one containing the error, and the 
methylated strand is the one having the correct nucleotide sequence. When the 
methyl-directed mismatch repair system encounters a mismatched base pair, it 
searches along the DNA—through thousands of base pairs if necessary—until it finds 
a methylated base. 

The system identifies the strand bearing the methylated base as parental, as- 
sumes its sequence is the correct one, and replaces the entire stretch of nucleotides 
within the new strand from this recognition point to and including the mismatched 
base. Mismatch repair does this by using an endonuclease to cut the new, un- 
methylated strand and an exonuclease to remove the mismatched bases, creating a 
gap in the newly synthesized strand. DNA polymerase III holoenzyme then fills in 
the gap, using the methylated strand as template. Finally, DNA ligase reseals the 
strand. 


Damage to DNA by UV Light or Chemical Modification 
Can Also Be Repaired 


Repair of Pyrimidine Dimers Formed by UV Light UV irradiation promotes the for- 
mation of covalent bonds between adjacent thymine residues in a DNA strand, creat- 
ing a cyclobutyl ring (Figure 28.27). Because the C—C bonds in this ring are shorter 
than the normal 0.34-nm base stacking in B-DNA, the DNA is distorted at this spot 
and is no longer a proper template for either replication or transcription. Photolyase 
(also called photoreactivating enzyme), a flavin- and pterin-dependent enzyme, binds 
at the dimer and uses the energy of visible light to break the cyclobutyl ring, restor- 
ing the pyrimidines to their original form. 
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Excision Repair Replacement of chemically damaged or modified bases occurs via Damaged base 
two fundamental excision repair systems—base excision and nucleotide excision. 
Base excision repair acts on single bases that have been damaged through oxidation or 
other chemical modifications during normal cellular processes. The damaged base is 
removed by DNA glycosylase, which cleaves the glycosidic bond, creating an apurinic 
acid (AP) site where the sugar—phosphate backbone is intact but a purine (apurinic 
site) or a pyrimidine (apyrimidinic site) is missing. An AP endonuclease then cleaves the 
backbone, an exonuclease removes the deoxyribose-P and a number of additional 
residues, and the gap is repaired by DNA polymerase and DNA ligase (Figure 28.28). 
The information of the complementary strand is used to dictate which bases are 
added in refilling the gap. In E. coli, DNA polymerase I binds at the gap and moves 
in the 5'—>3' direction, removing nucleotides with its 5’-exonuclease activity. The 
5'>3' DNA polymerase activity of DNA polymerase I fills in the sequence behind the oe 
5'-exonuclease action. No net synthesis of DNA results, but this action of DNA po- on midinic 
lymerase I “edits out” sections of damaged DNA. Excision is coordinated with 5’3' endonuclease 
polymerase-catalyzed replacement of the damaged nucleotides so that DNA of the 
right sequence is restored. 

Nucleotide excision repair recognizes and repairs larger regions of damaged DNA 
than base excision repair. The nucleotide excision repair system cuts the sugar- 
phosphate backbone of a DNA strand in two places, one on each side of the lesion, 
and removes the region. The region removed in prokaryotic nucleotide excision re- 
pair spans 12 or 13 nucleotides; in eukaryotic excision repair, an oligonucleotide Excision 
stretch 27 to 29 units long is removed. The resultant gap is then filled in using DNA ere 
polymerase (DNA polymerase I in prokaryotes or DNA polymerase 6 or € and PCNA 
plus RFC in eukaryotes), and the sugar—phosphate backbone is covalently closed by ; I | 
DNA ligase. Í 

In mammalian cells, nucleotide excision repair is the main pathway for removal of i fil lal Hat 1 
carcinogenic (cancer-causing) lesions caused by sunlight or other mutagenic agents. i ae i 
Such lesions are recognized by XPA protein, named for xeroderma pigmentosum, an in- Polymerase 
herited human syndrome whose victims suffer serious skin lesions if exposed to sun- \ 
light. At sites recognized by XPA, a multiprotein endonuclease is assembled and the 
damaged strand is cleaved and repaired. 
















































































28:95 What Is the Molecular Basis of Mutation? 


Genes are normally transmitted unchanged from generation to generation, owing to 
the great precision and fidelity with which genes are copied during chromosome du- 
plication. However, on rare occasions, genetically heritable changes (mutations) occur 
and result in altered forms. Most mutated genes function less effectively than the un- 
altered, wild-type allele, but occasionally mutations arise that give the organism a se- 
lective advantage. When this occurs, they may be propagated to many offspring. To- 
gether with recombination, mutation provides for genetic variability within species FIGURE 28.28 Base excision repair. A damaged base 

: : . (m) is excised from the sugar-phosphate backbone by 

and, ultimately, the evolution of new species. i ae 
, : f TE DNA glycosylase, creating an AP site. Then, an apurinic/ 
Mutations change the sequence of bases in DNA, either by the substitution Of  apyrimidinic endonuclease severs the DNA strand, and 

one base pair for another (so-called point mutations) or by the insertion or deletion an excision nuclease removes the AP site and several 


of one or more base pairs (insertions and deletions). nucleotides. DNA polymerase | and DNA ligase then 
repair the gap. 











Point Mutations Arise by Inappropriate Base-Pairing 


Point mutations arise when a base pairs with an inappropriate partner. The two pos- 
sible kinds of point mutations are transitions, in which one purine (or pyrimidine) 
is replaced by another, as in AG (or T—C), and transversions, in which a purine 
is substituted for a pyrimidine, or vice versa. 

Point mutations arise by the pairing of bases with inappropriate partners dur- 
ing DNA replication, by the introduction of base analogs into DNA, or by chemi- 
cal mutagens. Bases may rarely mispair (Figure 28.29), either because of their 
tautomeric properties (see Chapter 10) or because of other influences. Even in 
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FIGURE 28.30 5-Bromouracil usually favors the keto 
tautomer that mimics the base-pairing properties of 
thymine, but it frequently shifts to the enol form, where- 
upon it can base-pair with guanine, causing a T-A to 
CG transition. 
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FIGURE 28.29 Point mutations due to base mispairings. (a) An example based on tautomeric properties. The 
rare imino tautomer of adenine base pairs with cytosine rather than thymine. (1) The normal A-T base pair. 

(2) The A*-C base pair is possible for the adenine tautomer in which a proton has been transferred from the 
6-NH) of adenine to N-1. (3) Pairing of C with the imino tautomer of A (A*) leads to a transition mutation (A-T to 
G-C) appearing in the next generation. (b) A in the syn conformation pairing with G (G is in the usual anti con- 
formation). (c) T and C form a base pair by H-bonding interactions mediated by a water molecule. 


mispairing, the C,’—C,' distances between bases must still be close to that of a 
Watson-Crick base pair (11 nm or so; see Figure 11.6) to maintain the mis- 
matched base pair in the double helix. In tautomerization, for example, an amino 
group (—NHbo), usually an H-bond donor, can tautomerize to an imino form 
(=NH) and become an H-bond acceptor. Or a keto group (C=O), normally an 
H-bond acceptor, can tautomerize to an enol C—OH, an H-bond donor. Proof- 
reading mechanisms operating during DNA replication catch most mispairings. 
The frequency of spontaneous mutation in both £. colt and fruit flies (Drosophila 
melanogaster) is about 107! per base pair per replication. 


Mutations Can Be Induced by Base Analogs 


Base analogs that become incorporated into DNA can induce mutations through 
changes in base-pairing possibilities. Two examples are 5-bromouracil (5-BU) and 
2-aminopurine (2-AP). 5-Bromouracil is a thymine analog and becomes inserted into 
DNA at sites normally occupied by T; its 5-Br group sterically resembles thymine’s 
5-methyl group. However, because 5-BU frequently assumes the enol tautomeric form 
and pairs with G instead of A, a point mutation of the transition type may be induced 
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(Figure 28.30). Less often, 5-BU is inserted into DNA at cytosine sites, not T sites. 
Then, if it base-pairs in its keto form, mimicking T, a C-G to T-A transition ensues. 
The adenine analog, 2-aminopurine (recall that adenine is 6-aminopurine) normally 
behaves like A and base-pairs with T. However, 2-AP can form a single H bond of suf- 
ficient stability with cytosine (Figure 28.31) that occasionally C replaces T in DNA 
replicating in the presence of 2-AP. Hypoxanthine (Figure 28.32) is an adenine ana- 
log that arises in situ in DNA through oxidative deamination of A. Hypoxanthine 
base-pairs with cytosine, creating an A-T to G-C transition. 


Chemical Mutagens React with the Bases in DNA 


Chemical mutagens are agents that chemically modify bases so that their base- 
pairing characteristics are altered. For instance, nitrous acid (HNOg:) causes the ox- 
idative deamination of primary amine groups in adenine and cytosine. Oxidative 
deamination of cytosine yields uracil, which base-pairs the way T does and gives a 
C-G to T-A transition (Figure 28.33a). Hydroxylamine specifically causes C-G to 
T-A transitions because it reacts specifically with cytosine, converting it to a deriva- 
tive that base-pairs with adenine instead of guanine (Figure 28.33c). Alkylating 
agents (Figure 28.33e) are also chemical mutagens. Alkylation of reactive sites on the 
bases to add methyl or ethyl groups alters their H bonding and hence base pairing. 
For example, methylation of O° on guanine (giving O®methylguanine) causes this 
G to mispair with thymine, resulting in a G-C to A-T transition (Figure 28.33d). 
Alkylating agents can also induce point mutations of the transversion type. Alkylation 
of N” of guanine labilizes its N-glycosidic bond, which leads to elimination of the 
purine ring, creating a gap in the base sequence. An enzyme, AP endonuclease, then 
cleaves the sugar—phosphate backbone of the DNA on the 5’-side, and the gap can 
be repaired by enzymatic removal of the 5’-sugar—phosphate and insertion of a new 
nucleotide (see Figure 28.28). A transversion results if a pyrimidine nucleotide is in- 
serted in place of the purine during enzymatic repair of this gap. 


Insertions and Deletions 


The addition or removal of one or more base pairs leads to insertion or deletion mu- 
tations, respectively. Either shifts the triplet reading frame of codons, causing 
frameshift mutations (misincorporation of all subsequent amino acids) in the protein 
encoded by the gene. Such mutations can arise if flat aromatic molecules such as acri- 
dine orange insert themselves between successive bases in one or both strands of the 
double helix. This insertion or, more aptly, intercalation, doubles the distance be- 
tween the bases as measured along the helix axis (see Figure 11.12). This distortion 
of the DNA results in inappropriate insertion or deletion of bases when the DNA is 
replicated. Disruptions that arise from the insertion of a transposon within a gene also 
fall in this category of mutation (see Figure 28.23). 
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Prions Are Proteins That Can Act as Genetic Agents 


DNA is the genetic material in organisms, although some viruses have RNA genomes. 
The idea that proteins could carry genetic information was considered early in the 
history of molecular biology and dismissed for lack of evidence. Prions may be an ex- 
ception to this rule. 

Prion is an acronym derived from the words proteinaceous infectious particle. The 
term prion was coined to distinguish such particles, which are pathogenic and thus 
capable of causing disease, from nucleic acid-containing infectious particles such 
as viruses and virions. Prions are transmissible agents (genetic material?) that are 
apparently composed only of a protein that has adopted an abnormal conforma- 
tion. They produce fatal degenerative diseases of the central nervous system in 
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FIGURE 28.31 (a) 2-Aminopurine normally base-pairs 
with T but (b) may also pair with cytosine through a 
single hydrogen bond. 
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FIGURE 28.32 Oxidative deamination of adenine in DNA 
yields hypoxanthine, which base-pairs with cytosine, 
resulting in an A-T to G-C transition. 
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FIGURE 28.33 Chemical mutagens. (a) HNO; (nitrous acid) Neg =o 
converts cytosine to uracil and adenine to hypoxanthine. 
(b) Nitrosoamines, organic compounds that react to form 
nitrous acid, also lead to the oxidative deamination of A and Alkyl . 
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C. (c) Hydroxylamine (NHOH) reacts with cytosine, convert- O o 
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with T. (e) Alkylating agents include nitrosoamines, nitro- O O 
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mustards. Note that nitrosoamines are mutagenic in two 
ways: They can react to yield HNO,, or they can act as 
alkylating agents. The nitrosoguanidine, N-methyl-N’-nitro- 
N-nitrosoguanidine, is a very potent mutagen used in labora- 
tories to induce mutations in experimental organisms such 
as Drosophila melanogaster. Ethylmethane sulfonate (EMS) 
and dimethyl sulfate are also favorite mutagens among 
geneticists. 
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mammals and are believed to be the agents responsible for the human diseases 
kuru, Creutzfeldt-Jakob disease, Gerstmann-Straussler-Sheinker syndrome, and fatal 
familial insomnia. Prions also cause diseases in animals, including scrapie (in 
sheep), “mad cow disease” (bovine spongiform encephalopathy), and chronic wast- 
ing disease (in elk and mule deer). All attempts to show that the infectivity of these 
diseases is due to a nucleic acid-carrying agent have been unsuccessful. Prion dis- 
eases are novel in that they are genetic and infectious; their occurrence may be spo- 
radic, dominantly inherited, or acquired by infection. 

PrP, the prion protein, comes in various forms, such as PrP‘, the normal cellular 
prion protein, and PrP*‘, the scrapie form of PrP, a conformational variant of PrP‘ 
that is protease resistant, sometimes written as PrP". These two forms are thought 
to differ only in terms of their secondary and tertiary structure. One model suggests 
that PrP‘ is dominated by a-helical elements (Figure 28.34a), whereas PrP* has both 
a-helices and -strands (Figure 28.34b). It has been hypothesized that the presence 
of PrP“ can cause PrP‘ to adopt the PrP* conformation. The various diseases are a 
consequence of the accumulation of the abnormal PrP* form, which accumulates as 
amyloid plaques (amyloid = starchlike), that cause destruction of tissues in the cen- 
tral nervous system. Ironically, recent evidence suggests that PrP? may function as a 
nucleic acid-binding protein. The 1997 Nobel Prize in Physiology or Medicine was 
awarded to Stanley B. Prusiner for his discovery of prions. 
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CENGAGENOW” ANIMATED FIGURE 28.34 Speculative 
models suggest that (a) PrP‘ is mostly a-helical, whereas 
(b) PrP% has both a-helices and B-strands. (Adapted from 
Figure 1 in Prusiner, S. B., 1996. Molecular biology and the patho- 
genesis of prion diseases. Trends in Biochemical Sciences 
21:482-487, See this figure animated at www.cengage 
.com/login. 








Gene Rearrangements and Immunology—ls It Possible to Generate 
Protein Diversity Using Genetic Recombination? 


Animals have evolved a way to exploit genetic recombination in order to generate 
protein diversity. This development was crucial to the evolution of the immune sys- 
tem. For example, the immunoglobulin genes are a highly evolved system for max- 
imizing protein diversity from a finite amount of genetic information. This diversity 
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A DEEPER LOOK 


Inteins—Bizarre Parasitic Genetic Elements Encoding a Protein-Splicing Activity 


Inteins are parasitic genetic elements found within Host gene and intein 
protein-coding regions of genes. These selfish DNA el- 


ements are transcribed and translated along with the DNA J Coding sequence | Intein [Coding sequence | 


flanking host gene sequences. The typical intein pro- 
tein consists of two domains: One domain is capable of 
self-catalyzed protein splicing; the other is an endo- v 
nuclease that mediates the insertion of the intein nu- MRNA 5 | | 3! 
cleotide sequence into host genes. After the full protein l i 

is synthesized, the intein catalyzes excision of itself from Translation 

the host protein and ligation of adjacent host polypep- Vv 
tide regions to form the functional protein that the Protein 
host gene encodes. These adjacent polypeptides are C Neren  [Intein ë Gextein E) 
termed exteins (“external proteins”) to distinguish 

them from the intein (“internal protein”). Inteins haye Protein Splicing: 
been found across all domains of life—archaea, bacte- 


Transcription 








Vv 
ria, and eukaryotes—although thus far only in unicel- N-extein n OF o OH Cepia 
lular organisms. Inteins vary in size from about 130 to E N z= 
600 amino acid residues. The protein splicing function — 

of inteins is found in its N-terminal and its C-terminal O H 


: : O “NH 
regions; the endonuclease function that carries out par- 2 


asitic insertion of the intein sequence into host genes is 1 | 
found in the central part of the intein. Splicing of the 
protein is an intramolecular process that liberates the Cy Cette 
intein sequence and ligates the host protein sequences O O 
(see accompanying figure). EN a) 

Inteins are usually found as inserts in highly conserved 2 H 
host genes that have essential functions, such as genes en- O NHB, 
coding DNA or RNA polymerases, proton-translocating 2 
ATPases, or other vital metabolic enzymes. Their location | 
in such genes means that removal of the intein via dele- 
tion or genetic rearrangement is more difficult. The endo- N-extein A 
nuclease activity of the intein recognizes a 14- to 40-base- | 
pair sequence in a potential host gene and cleaves the OH Q O Cextein 
DNA there. During repair of the double-stranded DNA H-N P 
break, the intein gene is copied into the cleavage site, 2 H 
thereby establishing the parasitic genetic element in the 
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> Transcription and translation of the combined intein-host 
gene flanking sequences leads to synthesis of a fused intein— 
extein protein. The intein splices itself out when (1) the 4 

C-terminal residue of the N-extein is shifted to the O (or S) 

atom of a neighboring intein Ser (or Cys) residue, (2) the 

N-extein C-terminal carbonyl undergoes nucleophilic attack OH O o OH : 

by the O (or S) atom of a Ser (or Cys) residue at the end of N-extein i C-extein 

the C-extein in a transesterification reaction that creates a HoN NH ac. =< 
branched protein intermediate, (3) cyclization of the intein N 

C-terminal asparagine residue excises the intein, and (4) the O H 

two exteins are properly united via a peptide bond when the 

N-extein C-terminus spontaneously shifts to the C-extein 

N-terminus to form an intact host protein. Excised Intein Intact Host Protein 


Adapted from Paulus, H., 2000. Protein splicing and related 
forms of protein autoprocessing. Annual Review of Biochemistry 
69:447-496; and Gogarten, J. P., et al., 2002. Inteins: Structure, 
function, and evolution. Annual Review of Microbiology 56:263-287. 











is essential for gaining immunity to the great variety of infectious organisms and for- 
eign substances that cause disease. 


Cells Active in the Immune Response Are Capable of Gene 
Rearrangement 


Only vertebrates show an immune response. If a foreign substance, called an 
antigen, gains entry to the bloodstream of a vertebrate, the animal responds via a 
protective system called the immune response. The immune response involves pro- 
duction of proteins capable of recognizing and destroying the antigen. This re- 
sponse is mounted by certain white blood cells—the B- and T-cell lymphocytes and 
the macrophages. B cells are so named because they mature in the bone marrow; 
T cells mature in the thymus gland. Each of these cell types is capable of gene re- 
arrangement as a mechanism for producing proteins essential to the immune re- 
sponse. Antibodies, which can recognize and bind antigens, are immunoglobulin 
proteins secreted from B cells. Because antigens can be almost anything, the im- 
mune response must have an incredible repertoire of structural recognition. Thus, 
vertebrates must have the potential to produce immunoglobulins of great diversity 
in order to recognize virtually any antigen. 


Immunoglobulin G Molecules Contain Regions of Variable 
Amino Acid Sequence 


Immunoglobulin G (IgG or y-globulin) is the major class of antibody molecules 
found circulating in the bloodstream. IgG is a very abundant protein, amounting to 
12 mg per mL of serum. It is a 150-kD a2Beo-type tetramer. The a or H (for heavy) 
chain is 50 kD; the 6 or L (for light) chain is 25 kD. A preparation of IgG from 
serum is heterogeneous in terms of the amino acid sequences represented in its L 
and H chains. However, the IgG L and H chains produced from any given B lym- 
phocyte are homogeneous in amino acid sequence. L chains consist of 214 amino 
acid residues and are organized into two roughly equal segments: the V, and C, re- 
gions. The V; designation reflects the fact that L chains isolated from serum IgG 
show variations in amino acid sequence over the first 108 residues, Vi, symbolizing 
this “variable” region of the L polypeptide. The amino acid sequence for residues 
109 to 214 of the L polypeptide is constant, as represented by its designation as the 
“constant light,” or Cı, region. The heavy, or H, chains consist of 446 amino acid 
residues. Like L chains, the amino acid sequence for the first 108 residues of H 
polypeptides is variable, ergo its designation as the Vy region, while residues 109 to 
446 are constant in amino acid sequence. This “constant heavy” region consists of 
three quite equivalent domains of homology designated Cyl, Cy2, and Cy3. Each 
L chain has two intrachain disulfide bonds: one in the V; region and the other in 
the Cr region. The C-terminal amino acid in L chains is cysteine, and it forms an in- 
terchain disulfide bond to a neighboring H chain. Each H chain has four intrachain 
disulfide bonds, one in each of the four regions. Figure 28.35 presents a diagram of 
IgG organization. Within the variable regions of the L and H chains, certain posi- 
tions are hypervariable with regard to amino acid composition. These hypervariable 
residues occur at positions 24 to 34, 50 to 55, and 89 to 96 in the L chains and at 
positions 31 to 35, 50 to 65, 81 to 85, and 91 to 102 in the H chains. The hypervari- 
able regions are also called complementarity-determining regions, or CDRs, because 
it is these regions that form the structural site that is complementary to some part 
of an antigen’s structure, providing the basis for antibody: antigen recognition. 

In the immunoglobulin genes, the arrangement of exons correlates with protein 
structure. In terms of its tertiary structure, the IgG molecule is composed of 12 dis- 
crete collapsed B-barrel domains. Within each domain, alternating -strands are anti- 
parallel to one another, a pattern known by the name Greek key motif: The charac- 
teristic structure of this domain is referred to as the immunoglobulin fold (Figure 
28.36). Each of IgG’s two heavy chains contributes four of these domains and each 
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FIGURE 28.35 Diagram of the organization of the IgG 
molecule. Two identical L chains are joined with two 
identical H chains. Each L chain is held to an H chain 
via an interchain disulfide bond. The variable regions 
(purple) of the four polypeptides lie at the ends of the 
arms of the Y-shaped molecule. These regions are re- 
sponsible for the antigen recognition function of the 
antibody molecules. The actual antigen-binding site is 
constituted from hypervariable residues within the V, 
and Vy regions. For purposes of illustration, some fea- 
tures are shown on only one or the other L chain or H 
chain, but all features are common to both chains. 
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CENGAGENOW”™ ACTIVE FIGURE 28.36 

The characteristic “collapsed B-barrel 
domain” known as the immunoglobulin 
fold. The B-barrel structures for both 

(a) variable and (b) constant regions are 
shown. (c) A schematic diagram of the 

12 collapsed B-barrel domains that make 
up an IgG molecule. CHO indicates the 
carbohydrate addition site; Fap denotes one 
of the two antigen-binding fragments of 
IgG, and F,, the proteolytic fragment con- 
sisting of the pairs of C42 and C43 domains. 
Test yourself on the concepts in this fig- 
ure at www.cengage.com/login. 


of its light chains contributes two. The four variableregion domains (one on each 
chain) are encoded by multiple exons, but the eight constant-region domains are 
each the product of a single exon. All of these constant-region exons are derived from 
a single ancestral exon encoding an immunoglobulin fold. The major variable- 
region exon probably derives from this ancestral exon also. Contemporary immuno- 
globulin genes are a consequence of multiple duplications of the ancestral exon. 
The discovery of variability in amino acid sequence in otherwise identical poly- 
peptide chains was surprising and almost heretical to protein chemists. For geneti- 
cists, it presented a genuine enigma. They noted that mammals, which can make 
millions of different antibodies, don’t have millions of different antibody genes. 
How can the mammalian genome encode the diversity seen in L and H chains? 


The Immunoglobulin Genes Undergo Gene Rearrangement 


The answer to the enigma of immunoglobulin sequence diversity is found in the or- 
ganization of the immunoglobulin genes. The genetic information for an immuno- 
globulin polypeptide chain is scattered among multiple gene segments along a chro- 
mosome in germline cells (sperm and eggs). During vertebrate development and the 
formation of B lymphocytes, these segments are brought together and assembled by 
DNA rearrangement (that is, genetic recombination) into complete genes. DNA re- 
arrangement, or gene reorganization, provides a mechanism for generating a variety 
of protein isoforms from a limited number of genes. DNA rearrangement occurs in 
only a few genes, namely, those encoding the antigen-binding proteins of the immune 
response—the immunoglobulins and the T-cell receptors. The gene segments encod- 
ing the amino-terminal portion of the immunoglobulin polypeptides are also unusu- 
ally susceptible to mutation events. The result is a population of B cells whose antibody- 
encoding genes collectively show great sequence diversity even though a given cell can 
make only a limited set of immunoglobulin chains. Hence, at least one cell among the 
B-cell population will likely be capable of producing an antibody that will specifically 
recognize a particular antigen. 


DNA Rearrangements Assemble an L-Chain Gene by Combining 
Three Separate Genes 


The organization of various immunoglobulin gene segments in the human genome 
is shown in Figure 28.37. L-chain variable-region genes are assembled from two kinds 
of germline genes: V; and Jı (/ stands for joining). In mammals, there are two differ- 
ent families of L-chain genes: the x, or kappa, gene family and the A, or lambda, gene 
family; each family has V and J members. These families are on different chromo- 
somes. Humans have 40 functional V genes and 5 functional J genes for the x light 
chains and 31 V genes and 4 J genes for the A light chain. The V and J genes lie up- 
stream from the single C, gene that encodes the L-chain constant region. Each V, 
gene has its own L, segment for encoding the L-chain leader peptide that targets the 
L chain to the endoplasmic reticulum for IgG assembly and secretion. (This leader 
peptide is cleaved once the L chain reaches the ER lumen.) The A family of L-chain 
genes is organized similarly (Figure 28.37). In different mature B-lymphocyte cells, 
V, and J, genes have joined in different combinations, and along with the C-V, 
gene, form complete L-V, chains with a variety of V, regions. However, any given 
B lymphocyte expresses only one V,-J,, combination. Construction of the mature 
B-lymphocyte L-chain gene has occurred by DNA rearrangements that combine three 
genes (L-V,.a, Jka Cka) to make one polypeptide! 


DNA Rearrangements Assemble an H-Chain Gene by Combining 
Four Separate Genes 


The first 98 amino acids of the 108-residue, H-chain variable region are encoded by 
a Vy gene. Each Vy gene has an accompanying Ly gene that encodes its essential 
leader peptide. It is estimated that there are 200 to 1000 Vy genes and that they can 
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FIGURE 28.37 Organization of human immunoglobulin 
gene segments. Green, orange, blue, or purple colors indi- 
cate the exons of a particular V, or V4 gene. (a) L-chain 
gene assembly: During B-lymphocyte maturation in the 
bone marrow, one of the 40 V genes combines with one 
of the 5 J genes and is joined with a C gene. During the 
recombination process, the intervening DNA between the 
gene segments is deleted (see Figure 28.39). These rear- 
rangements occur by a mostly random process, giving rise 
to many possible light-chain sequences from each gene 
family. (b) H-chain gene assembly: H chains are encoded 
by V,D,J,and C genes. In H-chain gene rearrangements, a 
D gene joins with a J gene and then one of the V genes 
adds to the DJ assembly. (Adapted from Figure 2b and c in 
Nossal, G. J.V., 2003. The double helix and immunology. Nature 
421:440-444,) 
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be subdivided into eight distinct families based on nucleotide sequence homology. 
The members of a particular Vy family are grouped together on the chromosome, 
separated from one another by 10 to 20 bp. In assembling a mature H-chain gene, 
a Vy gene is joined to a D gene (D for diversity), which encodes amino acids 99 to 
113 of the H chain. These amino acids comprise the core of the third CDR in the 
variable region of H chains. The Vy-D gene assemblage is linked in turn to a 
Ju gene, which encodes the remaining part of the variable region of the H chain. 
The Vy, D, and Ju genes are grouped in three separate clusters on the same chro- 
mosome. The four Jy genes lie 7 kb upstream of the eight C genes, the closest of 
which is C,. Any of four C genes may encode the constant region of IgG H chains: 
Cy, Cyoa, Cy2», and Cy3. Each C gene is composed of multiple exons (only C, is 
shown in Figure 28.37, none of the other C genes). Ten to twenty D genes are found 
1 to 80 kb farther upstream. The Vy genes lie even farther upstream. In B lympho- 
cytes, the variable region of an H-chain gene is composed of one each of the Ly-Vy 
genes, a D gene, and a Jy gene joined head to tail. Because the H-chain variable re- 
gion is encoded in three genes and the joinings can occur in various combinations, 
the H chains have a greater potential for diversity than the L-chain variable regions 
that are assembled from just two genes (for example, L,-V, and J,,). In making 
H-chain genes, four genes have been brought together and reorganized by DNA re- 
arrangement to produce a single polypeptide! 


V-J and V-D-J Joining in Light- and Heavy-Chain Gene Assembly 
Is Mediated by the RAG Proteins 


Specific nucleotide sequences adjacent to the various variable-region genes suggest 
a mechanism in which these sequences act as joining signals. All germline V and D 
genes are followed by a consensus CACAGTG heptamer separated from a consen- 
sus ACAAAAACC nonamer by a short, nonconserved 23-bp spacer. Likewise, all 
germline D and J genes are immediately preceded by a consensus GGTTTTTGT 
nonamer separated from a consensus CACTGTG heptamer by a short noncon- 
served 12-bp spacer (Figure 28.38). Note that the consensus elements downstream 
of a gene are complementary to those upstream from the gene with which it re- 
combines. Indeed, it is these complementary consensus sequences that serve as 
recombination signal sequences (RSSs) and determine the site of recombination 
between variable-region genes. Functionally meaningful recombination happens 
only where one has a 12-bp spacer and the other has a 23-bp spacer (Figure 28.38). 
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Lymphoid cell-specific recombination-activating gene proteins 1 and 2 (RAGI and 
RAG2) recognize and bind at these RSSs, presumably through looping out of the 
12- and 23-bp spacers and alignment of the homologous heptamer and nonamer re- 
gions (Figure 28.39). RAGI and RAG2 together function as the V(D)J recombinase. 
RAG1/RAG2 action cleaves and processes the ends of the V and J DNA, producing 
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FIGURE 28.39 Model for V(D)J recombination. A RAG1 : RAG2 complex is assembled on DNA in the region of 
recombination signal sequences (a), and this complex introduces double-stranded breaks in the DNA at the bor- 
ders of protein-coding sequences and the recombination signal sequences (b). The products of RAG1 : RAG2 DNA 
cleavage are novel: The DNA bearing the recombination signal sequences has blunt ends, whereas the coding 
DNA has hairpin ends. That is, the two strands of the V and J coding DNA segments are covalently joined as a result 
of transesterification reactions catalyzed by RAG1 : RAG2.To complete the recombination process, the two RSS 
ends are precisely joined to make a covalently closed circular dsDNA, but the V and J coding ends undergo further 
processing (c). Coding-end processing involves opening of the V and J hairpins and the addition or removal of 
nucleotides. This processing means that joining of the V and J coding ends is imprecise, providing an additional 
means for introducing antibody diversity. Finally, the V and J coding segments are then joined to create a recombi- 
nant immunoglobulin-encoding gene (d). The processing and joining reactions require RAG1 : RAG2, DNA- 
dependent protein kinase (DNA-PK)—Ku70, Ku80, and DNA ligase. (Adapted from Figure 1 in Weaver, D.T. and Alt, W, 
1997. From RAGs to stitches. Nature 388:428-429,) 
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FIGURE 28.38 Consensus elements are located above 
and below germline variable-region genes that recom- 
bine to form genes encoding immunoglobulin chains. 
These consensus elements are complementary and are 
arranged in a heptamer-nonamer, 12- to 23-bp spacer 
pattern. (Adapted from Tonewaga, S., 1983. Somatic generation 
of antibody diversity. Nature 302:575.) 















DNA-PK complex 


what is effectively a double-stranded break (DSB). Proteins involved in NHEJ-type 
repair of DSBs (Ku70/80, DNA-PK, and DNA ligase) bind at the DSB and religate 
the DNA to create a recombinant immunoglobulin gene (Figure 28.39). 


Imprecise Joining of Immunoglobulin Genes Creates New 


Joining of the ends of the immunoglobulin-coding regions during gene reorganiza- 
tion is somewhat imprecise. This imprecision actually leads to even greater antibody di- 
versity because new coding arrangements result. Position 96 in x chains is typically en- 
coded by the first triplet in the J,, element. Most « chains have one of four amino acids 
here, depending on which J, gene was recruited in gene assembly. However, occasion- 
ally only the second and third bases or just the third base of the codon for position 96 
is contributed by the J, gene, with the other one or two nucleotides supplied by the V, 
segment (Figure 28.40). So, the precise point where recombination occurs during 
gene reorganization can vary over several nucleotides, creating even more diversity. 


Antibody Diversity Is Due to Immunoglobulin Gene Rearrangements 


Taking as an example the mouse with perhaps 300 V, genes, 4 J, genes, 200 Vy 
genes, 12 D genes, and 4 Jy genes, the number of possible combinations is given by 
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94 95 96 97 
Val Gln 
Vic Ceipisi@was T/C T TIC GA 
Js A T G|G C AJA G C/|\T TG 
Ser Leu 
Coding Arrangements 
Val His 
Vic CAO C T TiC GA 
Jk A T GIG C AJA G CTT G 
Ser Leu 
Val His 
Vic CO T TiC GA 
Jk A T GIG C AJA G CTT G 
Arg Leu 
Val His 
Vic CO wuinncumia T/C G A 
Js A T GIG C AJA G CTT G 
Leu Leu 











FIGURE 28.40 Recombination between the V, and J, 
genes can vary by several nucleotides, giving rise to 
variations in amino acid sequence and hence diversity 


in immunoglobulin L chains. 


300 X 4 X 200 X 12 X 4. Thus, more than 10” different antibody molecules can be 
created from roughly 500 or so different mouse variable-region genes. Including 
the possibility for V,-J, joinings occurring within codons adds to this diversity, as 
does the high rate of somatic mutation associated with the variable-region genes. 
(Somatic mutations are mutations that arise in diploid cells and are transmitted to 
the progeny of these cells within the organism, but not to the offspring of the or- 
ganism.) Clearly, gene rearrangement is a powerful mechanism for dramatically en- 
hancing the protein-coding potential of genetic information. 


SUMMARY 


28.1 How Is DNA Replicated? 


DNA replication is accomplished 


polymerases cannot initiate DNA synthesis de novo—all require a 


through strand separation and the copying of each strand. Strand sepa- 
ration is achieved by untwisting the double helix. Each separated strand 
acts as a template for the synthesis of a new complementary strand 
whose nucleotide sequence is fixed by Watson—Crick base-pairing rules. 
Base pairing then dictates an accurate replication of the original DNA 
double helix. DNA replication follows a semiconservative mechanism 
where each original strand is copied to yield a complete complementary 
strand and these paired strands, one old and one new, remain together 
as a duplex DNA molecule. Replication begins at specific regions called 
origins of replication and proceeds in both directions. Bidirectional 
replication involves two replication forks, which move in opposite di- 
rections. Helicases unwind the double helix, and DNA gyrases act to 
overcome torsional stress by introducing negative supercoils at the ex- 
pense of ATP hydrolysis. Because DNA polymerases synthesize DNA 
only in a 5-53’ direction, replication is semidiscontinuous: The 3'—>5' 
strand can be copied continuously by DNA polymerase proceeding in 
the 5'>3' direction. The other parental strand is copied only when a 
sufficient stretch of its sequence has been exposed for DNA polymerase 
to move along it in the 5'—>3' mode. Thus, one parental strand is copied 
continuously to form the leading strand, while the other parental strand 
is copied in an intermittent, or discontinuous, mode to yield a set of 
Okazaki fragments that are joined later to give the lagging strand. 


28.2 What Are the Properties of DNA Polymerases? All DNA poly- 
merases share the following properties: (1) The incoming base is se- 
lected within the DNA polymerase active site through base-pairing with 
the corresponding base in the template strand, (2) chain growth is in 
the 5'-3’ direction antiparallel to the template strand, and (3) DNA 


primer with a free 3'-OH to build upon. DNA polymerase III holoen- 
zyme, the enzyme that replicates of the E. coli chromosome, is com- 
posed of ten different kinds of subunits. DNA polymerases are immobi- 
lized in replication factories. 


28.3 Why Are There So Many DNA Polymerases? Both prokaryotic 
and eukaryotic cells have a number of DNA polymerases. These differ- 
ent enzymes can be assigned to families based on sequence similarities. 
The various families of DNA polymerases fill different biological roles; 
the prominent roles include DNA replication, DNA repair, and telo- 
mere maintenance. All DNA polymerases share a common architecture 
resembling a right hand, composed of distinct finger, thumb, and palm 
structural domains, each serving a specific role in the polymerase 
reaction. 


28.4 How Is DNA Replicated in Eukaryotic Cells? Eukaryotic DNA is 
organized into chromosomes within the nucleus. These chromosomes 
must be replicated once (and only once!) each cell cycle. Progression 
through the cell cycle is regulated through checkpoints that control 
whether the cell continues into the next phase. Cyclins and CDKs main- 
tain these checkpoints. Replication licensing factors (MCM proteins) in- 
teract with origins of replication and render chromosomes competent 
for replication. Three DNA polymerases—a, ô, and e—carry out 
genome replication. DNA polymerase a initiates replication through 
synthesis of an RNA. DNA polymerase 6 is the principal DNA polym- 
erase in eukaryotic DNA replication. 


28.5 How Are the Ends of Chromosomes Replicated? Telomeres are 
short, tandemly repeated, G-rich nucleotide sequences that form pro- 


tective caps on the chromosome ends. DNA polymerases cannot repli- 
cate the extreme 5’-ends of chromosomes, but a special polymerase 
called telomerase maintains telomere length. Telomerase is a ribo- 
nucleoprotein, and its RNA component serves as template for telomere 
synthesis. 


28.6 How Are RNA Genomes Replicated? Many viruses have genomes 
composed of RNA, not DNA. DNA may be an intermediate in the repli- 
cation of such viruses; that is, viral RNA serves as the template for DNA 
synthesis. This reaction is catalyzed by reverse transcriptase, an RNA- 
dependent DNA polymerase. 


28.7 How Is the Genetic Information Shuffled by Genetic Recombina- 
tion? Genetic recombination is the exchange (or incorporation) of one 
DNA sequence with (or into) another. Recombination between very simi- 
lar DNA sequences is called homologous recombination. Homologous re- 
combination proceeds according to the Holliday model. The RecBCD en- 
zyme complex unwinds dsDNA and cleaves its single strands. RecA protein 
acts in recombination to catalyze the ATP-dependent DNA strand ex- 
change reaction. Procession of strand separation and re-pairing into hy- 
brid strands along the DNA duplex initiates branch migration, displacing 
the homologous DNA strand from the DNA duplex and replacing it with 
the ssDNA strand. RuvA, RuvB, and RuvC resolve the Holliday junction to 
form the recombination products. DNA replication is an essential compo- 
nent of both DNA recombination and DNA repair processes. Further- 
more, recombination mechanisms can restart replication forks that have 
halted at breaks or other lesions in the DNA strands. 

Transposons are mobile DNA segments ranging in size from several 
hundred base pairs to more than 8 kbp that move enzymatically from 
place to place in the genome. 


28.8 Can DNA Be Repaired? Repair systems correct damage to DNA 
in order to maintain its information content. The most common forms 
of damage are (1) replication errors, (2) deletions or insertions, (3) UV- 
induced alterations, (4) DNA strand breaks, and (5) covalent crosslink- 
ing of strands. Cells have extraordinarily diverse and effective DNA re- 
pair systems to deal with these problems, some of which are also 
involved in DNA replication and recombination. When repair systems 
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fail, the genome may still be preserved if an “error-prone” mode of 
replication allows the lesion to be bypassed. 


28.9 What Is the Molecular Basis of Mutation? Mutations change the 
sequence of bases in DNA, either by the substitution of one base pair for 
another (point mutations) or by the insertion or deletion of one or 
more base pairs. Point mutations arise by the pairing of bases with in- 
appropriate partners during DNA replication, by the introduction of 
base analogs into DNA, or by chemical mutagens. Chemical mutagens 
are agents that chemically modify bases so that their base-pairing char- 
acteristics are altered. 


28.10 Do Proteins Ever Behave as Genetic Agents? DNA is the ge- 
netic material in organisms, although some viruses have RNA genomes. 
The possibility that proteins carry genetic information was a point of 
speculation early in the history of molecular biology but was ultimately 
discounted for lack of evidence. Prions (an acronym for proteinaceous 
infectious particle) may be an exception. Prion particles are devoid of 
nucleic acid, yet they can transmit disease. Prion diseases are novel be- 
cause they may be either inherited (like a genetic agent) or acquired by 
infection. PrP, the prion protein, comes in several forms: PrP*, the nor- 
mal cellular protein, and a conformational variant of PrP° known as 
PrP* or PrP found in association with prion diseases. The propensity 
of the PrP% conformational form to polymerize into cell-destructive ag- 
gregates is thought to be the basis of prion diseases. 


Special Focus: Gene Rearrangements and Immunology—ls It Possible 
to Generate Protein Diversity Using Genetic Recombination? Animals 
have evolved a way to exploit genetic recombination in order to generate 
protein diversity. The immunoglobulin genes are a highly evolved system 
for maximizing protein diversity from a finite amount of genetic infor- 
mation. Cells active in the immune response are capable of gene re- 
arrangements. The antibody diversity found in IgG molecules is a prime 
example of proteins produced via gene rearrangements. IgG L-chain 
genes are created by combining three separate genes, and H-chain genes 
by combining four. V-J and V—D-]J joining in L- and H-chain gene assem- 
bly is mediated by RAG proteins. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. If #N-labeled E. coli DNA has a density of 1.724 g/mL, 'N-labeled 
DNA has a density of 1.710 g/mL, and E. coli cells grown for many 
generations on 'NH,* as a nitrogen source are transferred to media 
containing '°NH,* as the sole N-source, (a) what will be the density 
of the DNA after one generation, assuming replication is semicon- 
servative? (b) Supposing replication took place by a dispersive mech- 
anism, what would be the density of DNA after one generation? 
(c) Design an experiment to distinguish between semiconservative 
and dispersive modes of replication. 

2. (a) What are the respective roles of the 5’-exonuclease and 3’- 
exonuclease activities of DNA polymerase I? (b) What might be a fea- 
ture of an E. coli strain that lacked DNA polymerase I 3'-exonuclease 
activity? 

3. Assuming DNA replication proceeds at a rate of 750 base pairs per 
second, calculate how long it will take to replicate the entire E. coli 
genome. Under optimal conditions, E. coli cells divide every 20 min- 
utes. What is the minimal number of replication forks per E. coli 
chromosome in order to sustain such a rate of cell division? 

4. On the basis of Figure 28.2, draw a simple diagram illustrating 
replication of the circular E. coli chromosome (a) at an early stage, 
(b) when one-third completed, (c) when two-thirds completed, 
and (d) when almost finished, assuming the initiation of replica- 
tion at oriC has occurred only once. Then, draw a diagram showing 
the E. coli chromosome in problem 3 where the E. coli cell is 
dividing every 20 minutes. 


5. It is estimated that there are forty molecules of DNA polymerase III 
per E. coli cell. Is it likely that the growth rate of E. coli is limited by 
DNA polymerase III availability? 

6. Approximately how many Okazaki fragments are synthesized in the 
course of replicating an E. coli chromosome? How many in repli- 
cating an “average” human chromosome? 

7. How do DNA gyrases and helicases differ in their respective func- 
tions and modes of action? 

8. Assuming DNA replication proceeds at a rate of 100 base pairs 
per second in human cells and origins of replication occur every 
300 kbp, how long would it take to replicate the entire diploid 
human genome? How many molecules of DNA polymerase does 
each cell need to carry out this task? 

9. From the information in Figure 28.17, diagram the recombina- 
tional event leading to the formation of a heteroduplex DNA re- 
gion within a bacteriophage chromosome. 

10. Homologous recombination in £E. coli leads to the formation of 
regions of heteroduplex DNA. By definition, such regions con- 
tain mismatched bases. Why doesn’t the mismatch repair system 
of E. coli eliminate these mismatches? 


11. If RecA protein unwinds duplex DNA so that there are about 18.6 bp 
per turn, what is the change in Ad, the helical twist of DNA, com- 
pared to its value in B-DNA? 

12. Diagram a Holliday junction between two duplex DNA molecules 
and show how the action of resolvase might give rise to either patch 
or splice recombinant DNA molecules. 
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13. Show the nucleotide sequence changes that might arise in a dsDNA 
(coding strand segment GCTA) upon mutagenesis with (a) HNO», 
(b) bromouracil, and (c) 2-aminopurine. 

14. Transposons are mutagenic agents. Why? 


15. Give a plausible explanation for the genetic and infectious proper- 
ties of PrP*. 


16. Hexameric helicases, such as DnaB, the MCM proteins, and papil- 
loma virus El helicase (illustrated in Figures 16.23-16.25), unwind 
DNA by passing one strand of the DNA duplex through the central 
pore, using a mechanism based on ATP-dependent binding interac- 
tions with the bases of that strand. The genome of E. coli K12 consists 
of 4,686,137 nucleotides. Assuming that DnaB functions like papil- 
loma virus E1 helicase, from the information given in Chapter 16 on 
ATP-coupled DNA unwinding, calculate how many molecules of ATP 
would be needed to completely unwind the E. coli K12 chromosome. 

17. Asako Furukohri, Myron F. Goodman, and Hisaji Maki wanted to dis- 
cover how the translesion DNA polymerase IV takes over from DNA 
polymerase II at a stalled replication fork (see Journal of Biological 
Chemistry 283:11260-11269, 2008). They showed that DNA poly- 
merase IV could displace DNA polymerase III from a stalled replica- 
tion fork formed in an in vitro system containing DNA, DNA poly- 
merase III, the B-clamp, and SSB. Devise your own experiment to 
show how such displacement might be demonstrated. (Hint: Assume 


that you have protein identification tools that allow you to distinguish 
easily between DNA polymerase II and DNA polymerase IV.) 

18. The eukaryotic translesion DNA polymerases fall into the Y family 
of DNA polymerases. Structural studies reveal that their fingers and 
thumb domains are small and stubby (see Figure 28.10). In addi- 
tion, ¥family polymerase active sites are more open and less con- 
strained where base pairing leads to selection of a dNTP substrate 
for the polymerase reaction. Discuss the relevance of these struc- 
tural differences. Would you expect ¥Yfamily polymerases to have 
3’-exonuclease activity? Explain your answer. 


Preparing for the MCAT Exam 


19. Figure 28.11 depicts the eukaryotic cell cycle. Many cell types “exit” 

the cell cycle and don’t divide for prolonged periods, a state termed 

Go; some, for example neurons, never divide again. 

a. In what stage of the cell cycle do you suppose a cell might be when 
it exits the cell cycle and enters Go? 

b. The cell cycle is controlled by checkpoints, cyclins, and CDKs. De- 
scribe how biochemical events involving cyclins and CDKs might 
control passage of a dividing cell through the cell cycle. 


20. Figure 28.40 gives some examples of recombination in IgG codons 
95 and 96, as specified by the V, and J, genes. List the codon possi- 
bilities and the amino acids encoded if recombination occurred in 
codon 97. Which of these possibilities is less desirable? 


FURTHER READING 


General 

Holliday, R., 1964. A mechanism for gene conversion in fungi. Genetic 
Research 5:282-304. The classic model for the mechanism of DNA 
strand exchange during homologous recombination. 

Kornberg, A., 2005. DNA Replication, 2nd ed., New York: Macmillan. A 
comprehensive detailed account of the enzymology of DNA metab- 
olism, including replication, recombination, repair, and more. 

Lewin, B., 2007. Genes IX. Sudbury, MA: Jones and Bartlett. A contem- 
porary genetics text that seeks to explain heredity in terms of mo- 
lecular structures. 

Meselson, M., and Stahl, F. W., 1958. The replication of DNA in Escherichia 
coli. Proceedings of the National Academy of Sciences U.S.A. 44:671-682. 
The classic paper showing that DNA replication is semiconservative. 

Meselson, M., and Weigle, J. J., 1961. Chromosome breakage accompa- 
nying genetic recombination in bacteriophage. Proceedings of the 
National Academy of Sciences U.S.A. 47:857-869. The experiments 
demonstrating that physical exchange of DNA occurs during re- 
combination. 

Ogawa, T., and Okazaki, T., 1980. Discontinuous DNA replication. An- 
nual Review of Biochemistry 49:421—-457. Okazaki fragments and their 
implications for the mechanism of DNA replication. 

Palmiter, R. D., et al., 1982. Dramatic growth of mice that develop from 
eggs microinjected with metallothionein-growth hormone fusion 
genes. Nature 300:611-615. 


DNA Replication 

Baker, T. A., and Bell, S. P., 1998. Polymerases and the replisome: Ma- 
chines within machines. Cell 92:295-305. 

Bell, S. P., and Dutta, A., 2002. DNA replication in eukaryotic cells. An- 
nual Review of Biochemistry '71:333-374. 

Blow, J. J., and Dutta, A., 2005. Preventing re-replication of chromoso- 
mal DNA. Nature Reviews Molecular Cell Biology 6:476—486. 

Botchan, M., 2007. A switch for S phase. Nature 445:272-274. 

Cvetic, C. A., and Walter, J. C., 2006. Getting a grip on licensing: Mech- 
anism of stable MCM2-7 loading onto replication origins. Cell 21: 
143-148. 

Franklin, M. C., Wang, J., and Steitz, T. A., 2001. Structure of the repli- 
cating complex of a pol a family DNA polymerase. Cell 105:657—-667. 

Frick, D. N., and Richardson, C. C., 2001. DNA primases. Annual Review 
of Biochemistry 70:39-80. 


Goodman, M. F., 2002. Error-prone repair DNA polymerases in prokary- 
otes and eukaryotes. Annual Review of Biochemistry 71:17-50. 

Hubscher, U., Maga, G., and Spadari, S., 2002. Eukaryotic DNA po- 
lymerases. Annual Review of Biochemistry '71:133-163. 

Keck, J. L., 2000. Structure of the RNA polymerase domain of the E. coli 
primase. Science 287:2482-2486. 

Kool, E. T., 2002. Active site tightness and substrate fit in DNA replica- 
tion. Annual Review of Biochemistry 71:191-219. 

Leu, F. P., Georgescu, R., and O’Donnell, M., 2003. Mechanism of the 
E. coli T processivity switch during lagging-strand synthesis. Molecular 
Cell 11:315-327. 

Machida, Y. J., and Dutta, A., 2005. Cellular checkpoint mechanisms 
monitoring proper initiation of DNA replication. Journal of Biologi- 
cal Chemistry 280:6253-6256. 

Marians, K. J., 2008. Understanding how the replisome works. Nature 
Structural and Molecular Biology 15:125-127. 

McHenry, C., 2003. Chromosomal replicases as asymmetric dimers: 
Studies of subunit arrangement and functional consequences. Mo- 
lecular Microbiology 49:1157-1165. 

Pomerantz, R. T., and O’Donnell, M., 2007. Replisome mechanics: In- 
sights into a twin DNA polymerase machine. Trends in Microbiology 
15:156-164. 

Randell, J. C. W., Bowers, J. L., Rodriguez, H. K., and Bell, S. P., 2006. 
Sequential ATP hydrolysis by Cdc6 and ORC directs loading of the 
Mcm2-7 helicase. Molecular Cell 21:29-39. 

Rothwell, P. J., and Waksman, G., 2005. Structure and mechanism of 
DNA polymerases. Advances in Protein Chemistry '71:401-440. 

Steitz, T. A., 1998. A mechanism for all polymerases. Nature 391:231-232. 

Tye, B. K., 1999. MCM proteins in DNA replication. Annual Review of Bio- 
chemistry 68:649-686. 


Protein Rings in DNA Metabolism 

Hingorani, M. M., and O’Donnell, M., 2000. A tale of toroids in DNA 
metabolism. Nature Reviews Molecular Cell Biology 1:22-30. 

Wyman, C., and Botchan, M., 1995. DNA replication: A familiar ring to 
DNA polymerase processivity. Current Biology 5:334-337. 


Telomerase 
Blackburn, E. H., 1992. Telomerases. Annual Review of Biochemistry 61: 
113-129. 


Collins, K., 1999. Ciliate telomerase biochemistry. Annual Review of Bio- 
chemistry 68:187-218. 

Kim, N. W., 1994. Specific association of human telomerase activity with 
immortal cells and cancer. Science 266:2011-2015. 

Nakamura, T. M., et al., 1997. Telomerase catalytic subunit homologs 
from fission yeast and human. Science 277:955-959. 


Prions 

Cohen, F. E., and Prusiner, S. B., 1998. Pathological conformations of 
prion proteins. Annual Review of Biochemistry 67:793-819. 

Prusiner, S. B., 1996. Molecular biology and pathogenesis of prion dis- 
eases. Trends in Biochemical Sciences 21:482-487. 

Prusiner, S. B., 1997. Prion diseases and the BSE crisis. Science 278: 
245-251. 


Recombination 

Alberts, B., 2003. DNA replication and recombination. Nature 421: 
431-435. 

Anderson, D. G., and Kowalczykowski, S. C., 1997. The translocating 
RecBCD enzyme stimulates recombination by directing RecA pro- 
tein onto ssDNA in a x-regulated manner. Cell 90:77-86. 

Baumann, P., and West, S. C., 1998. Role of the human RAD51 protein 
in homologous recombination and double-stranded-break repair. 
Trends in Biochemical Sciences 23:247-252. 

Beernink, H. T. H., and Morrical, S. W., 1999. RMPs: Recombination/ 
replication proteins. Trends in Biochemical Sciences 24:385-389. 

Chen, Z., Yang, H., and Pavletich, N. P., 2008. Mechanism of homolo- 
gous recombination from the RecA-ssDNA/dsDNA structures. Na- 
ture 453:489-494. 

Cox, M. M., 2007. Motoring along with the bacterial RecA protein. Na- 
ture Reviews Molecular Cell Biology 8:127-138. 

Haber, J. E., 1999. DNA recombination: The replication connection. 
Trends in Biochemical Sciences 24:271-275. 

Kowalczykowski, S. C., 2000. Initiation of genetic recombination and 
recombination-dependent replication. Trends in Biochemical Sciences 
25:156-165. 

Krishna, R., Prabu, J. R., Manjunath, G. P., Datta, S., et al., 2007. Snap- 
shots of RecA protein involving movement of the C-domain and dif- 
ferent conformations of the DNA-binding loops: Crystallographic 
and comparative analysis of 11 structures of Mycobacterium smegmatis 
RecA. Journal of Molecular Biology 367:1130-1144. 

Lovett, S. T., 2003. Connecting replication and recombination. Molecu- 
lar Cell 11:554-556. 

Lusetti, S. L., and Cox, M. M., 2002. The bacterial RecA protein and the 
recombinational DNA repair of replication forks. Annual Review of 
Biochemistry '71:71-100. 

Rafferty, J. B., et al., 1996. Crystal structure of DNA recombination pro- 
tein RuvA and a model for its binding to the Holliday junction. Sci- 
ence 274:415-421. 

Roca, A. I., and Cox, M. M., 1997. RecA protein: Structure, function, 
and role in recombinational DNA repair. Progress in Nucleic Acid Re- 
search and Molecular Biology 56:127-223. 

Taylor, A. F., and Smith, G. R., 2003. RecBCD enzyme is a DNA helicase 
with fast and slow motors of opposite polarity. Nature 423:889-893. 
See also Dillingham, M. S., Spies, M., and Kowalczykowski, S. C., 
2003. RecBCD is a bipolar DNA helicase. Nature 423:893-897. 

Wigley, D. B., 2007. RecBCD: The supercar of DNA repair. Cell 131: 
651-653. 


Further Reading 905 


Yamada, K., Ariyoshi, M., and Morikawa, K., 2004. Three-dimensional 
structural views of branch migration and resolution in DNA homolo- 
gous recombination. Current Opinion in Structural Biology 14:130-137. 


Transposons 

Lambowitz, A. M., and Belfort, M., 1993. Introns as mobile genetic ele- 
ments. Annual Review of Biochemistry 62:587-622. 

Stellwagen, A. E., and Craig, N. L., 1998. Mobile DNA elements: Control- 
ling transposition with ATP-dependent molecular switches. Trends in 
Biochemical Sciences 23:486—-490. 


V(D)J Recombination and the Immunoglobulin Genes 

Gellert, M., 2002. V(D)J recombination: RAG proteins, repair factors, 
and regulation. Annual Review of Biochemistry 71:101-132. 

Hiom, K., and Gellert, M., 1997. A stable RAG1-RAG2-DNA complex 
that is active in V(D)J cleavage. Cell 88:65-72. 

Lewis, S. M., and Wu, G. E., 1997. The origins of V(D)J recombination. 
Cell 88:159-162. 

Nossal, G. J. V., 2003. The double helix and immunology. Nature 421: 
440-444. 


Transgenic Animals 

Morgan, R. A., and Anderson, W. F., 1993. Human gene therapy. Annual 
Review of Biochemistry 62:192-217. 

Schnieke, A. E., et al., 1997. Human factor IX transgenic sheep pro- 
duced by transfer of nuclei from transfected fetal fibroblasts. Science 
278:2130-2133. 

Wilmut, I., et al., 1997. Viable offspring derived from fetal and adult 
mammalian cells. Nature 385:810-818. See also Campbell, K. H. S., 
et al., 1996. Sheep cloned by nuclear transfer from a cultured cell 
line. Nature 380:64—-66. 


Repair 

Bartek, J., and Lukas, J., 2003. Damage alert. Nature 421:486—-488. 

Friedberg, E. C., 2003. DNA damage and repair. Nature 421:436-440. 

Friedberg, E. C., Walker, G. C., and Siede, W., 1995. DNA Repair and Mu- 
tagenesis. Washington, DC: ASM Press. 

Marians, K. J., 2000. PriA-directed replication fork restart in Escherichia 

coli. Trends in Biochemical Sciences 25:185-189. 

McCollough, A. K., et al., 1999. Initiation of base excision repair: Gly- 

cosylase mechanisms and structures. Annual Review of Biochemistry 

68:255-285. 

Michel, B., 2000. Replication fork arrest and DNA recombination. 

Trends in Biochemical Sciences 25:173-178. 

Modrich, P., and Lahue, R., 1996. Mismatch repair in replication fi- 

delity, genetic recombination, and cancer biology. Annual Review of 

Biochemistry 65:101-133. 

Mol, C. D., et al., 1999. DNA repair mechanisms for the recognition and 

removal of damaged DNA bases. Annual Review of Biophysics and Bio- 

molecular Structure 28:101—128. 

Morgan, A. R., 1993. Base mismatches and mutagenesis: How important 
is tautomerism? Trends in Biochemical Sciences 18:160-163. 

Parikh, S. S., et al., 1999. Envisioning the molecular choreography of 
DNA base excision repair. Current Opinion in Structural Biology 9:37-47. 

Sancar, A., 1994. Mechanisms of DNA excision repair. Science 266: 
1954-1956. (Science named the extended family of DNA repair en- 
zymes its “Molecules of the Year” in 1994. See the 23 December 
1994 issue of Science for additional readings.) 





Mems hA 
AIIE AAE 
BLot MAD E Bivens 
33 Se Bunche beaten an 
YN cob Wise. 
cine 


sji TEENA 1) (sya an ufsa Rks 
NE E AE A uae méfu X a a ats 
rase eh b aA aran Ey 
15 pL lee ENG 4 Wo EII MIVAN 
anr ane Bsa tales ARIES 
Ue ted: AN ISIC PAW SigA < 
Blak le ovine bau 
mrs 4} ARAT eia bsp |rt Si oa 
» wt 629 Pies | wle a9 PMR MN 
The Rosetta stone, inscribed in 196 B.c. The writing 
on the Rosetta stone is in three forms: hieroglyphs, 
Demotic (the conventional Egyptian script of the 
time), and Greek (the Greeks ruled Egypt in 196 B.c.). 
The Rosetta stone represents the transcription of 
hieroglyphic symbols into two living languages. 
Shown here is part of the interface where hieroglyphs 
and Demotic meet. 


Aad heen MEP 





British Museum, UK/Bridgeman Art Library 





Now that we have all this useful information, it 
would be nice to do something with it. 
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Transcription 
and the Regulation 
of Gene Expression 


ESSENTIAL QUESTION 


Expression of the information encoded in DNA depends on transcription of that infor- 
mation into RNA. 


How are the genes of prokaryotes and eukaryotes transcribed to form RNA 
products that can be translated into proteins? 


All cells contain three major classes of RNA—mRNA, ribosomal RNA (rRNA), 
and transfer RNA (tRNA) and all participate in protein synthesis (see Chapters 
10 and 30). Further, all RNAs are synthesized from DNA templates by DNA- 
dependent RNA polymerases in the process known as transcription. However, 
only mRNAs direct the synthesis of proteins. Protein synthesis occurs via the 
process of translation, wherein the instructions encoded in the sequence of bases 
in mRNA are translated into a specific amino acid sequence by ribosomes, the 
“workbenches” of polypeptide synthesis (see Chapter 30). 

Transcription is tightly regulated in all cells. In prokaryotes, only 3% or so of 
the genes are undergoing transcription at any given time. The metabolic condi- 
tions and the growth status of the cell dictate which gene products are needed at 
any moment. Similarly, differentiated eukaryotic cells express only a small per- 
centage of their genes in fulfilling their biological functions, not the full genetic 
potential encoded in their chromosomes. 


29s1n How Are Genes Transcribed in Prokaryotes? 


In prokaryotes, virtually all RNA is synthesized by a single species of DNA- 
dependent RNA polymerase. The only exception is the short RNA primers 
formed by primase during DNA replication. Like DNA polymerases, RNA po- 
lymerase links ribonucleoside 5'-triphosphates (ATP, GTP, CTP, and UTP, repre- 
sented generically as NTPs) in an order specified by base pairing with a DNA 
template: 


n NTP —> (NMP) „ + n PP; 


The enzyme moves along a DNA strand in the 3’->5’ direction, joining the 
5'-phosphate of an incoming ribonucleotide to the 3’-OH of the previous residue. 
Thus, the RNA chain grows 5'—>3' during transcription, just as DNA chains do dur- 
ing replication. Subsequent hydrolysis of PP; to inorganic phosphate by the pyro- 
phosphatases present in all cells removes the product PP;, thus making the poly- 
merase reaction thermodynamically favorable. 


Prokaryotic RNA Polymerases Use Their Sigma Subunits to Identify 
Sites Where Transcription Begins 


Transcription is initiated in prokaryotes by RNA polymerase holoenzyme, a complex 
multimeric protein (about 400 kD) large enough to be visible in the electron micro- 
scope. Its subunit composition is a »B8B'o. After two a-subunits (35 kD each) dimerize, 
one recruits 6 and the other f’ to form the clawlike core polymerase structure (Fig- 
ure 29.1). The two largest subunits, 8’ (171 kD) and £ (124 kD), perform most of the 
enzymatic functions. The 6-subunit forms most of the upper jaw of the claw and con- 
tains the catalytic Mg?*-binding site; B’ forms the lower jaw. DNA passes through a 
2.7-nm channel between the jaws of the claw. Nucleotide substrates reach the 
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Certain conventions are useful in tracing the course of infor- 
mation transfer from DNA to protein. The strand of duplex 
DNA that is read by RNA polymerase is termed the template 
strand. Thus, the strand that is not read is the nontemplate 
strand. Because the template strand is read by the RNA poly- 
merase moving 3’-5’ along it, the RNA product, called the 
transcript, grows in the 5’—3’ direction (see accompanying 
figure). Note that the nontemplate strand has a nucleotide se- 
quence and direction identical to those of the RNA transcript, 
except that the transcript has U residues in place of T. Portions 
of the RNA transcript will eventually be translated into the 
amino acid sequence of a protein (see Chapter 30) by a 
process in which successive triplets of bases (termed codons), 
read 5'—>3', specify a particular amino acid. Polypeptide chains 
are synthesized in the NC direction, and the 5’-end of 
mRNA encodes the N-terminus of the protein. 

By convention, when the order of nucleotides in DNA is 
shown as a single strand, it is the 5'—>3' sequence of nucleo- 
tides in the nontemplate strand that is presented. Conse- 
quently, if convention is followed, DNA sequences are written 
in terms that correspond directly to mRNA sequences, which 
correspond in turn to the amino acid sequences of proteins as 
read beginning with the N-terminus. 
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catalytic center through a secondary channel entering on the back side of the en- 
zyme. Binding of the o-subunit to B’ allows the RNA polymerase to recognize differ- 
ent DNA sequences that act as promoters. A number of related proteins, the sigma 
(@) factors, can serve as the o-subunit. Promoters are nucleotide sequences that 
identify the location of transcription start sites, where transcription begins. Both 6 and 
B’ contribute to formation of the catalytic site for RNA synthesis. Dissociation of the 
o-subunit from the holoenzyme leaves the core polymerase (a8'), which can tran- 
scribe DNA into RNA but is unable to recognize promoters and initiate transcription. 
Bacteriophage T7 expresses a simpler (monomeric) RNA polymerase (Figure 29.2) 
that shares the functional characteristics of prokaryotic RNA polymerases. 


FIGURE 29.1 Structure of the Thermus thermophilus core 
RNA polymerase a28’ (pdb id = 2051).The template 
DNA strand is shown in green, the nontemplate DNA 
strand in blue, and the RNA transcript in hot pink. The 
two a chains are orange, the B chain is cyan, and the 

B’ chain is yellow. The active-site Mg?* is shown as a 
red sphere. 
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FIGURE 29.2 Bacteriophage T7 RNA polymerase (pdb id 
= 1MSW) in the act of transcription. T7 RNA polymerase 
is a 99-kD monomeric protein. The DNA is shown enter- 
ing the enzyme from the upper right. The template 
strand is green, the nontemplate strand is blue, the RNA 
transcript is hot pink. 





The Process of Transcription Has Four Stages 


Transcription can be divided into four stages: (1) binding of RNA polymerase 
holoenzyme at promoter sites, (2) initiation of polymerization, (3) chain elonga- 
tion, and (4) chain termination. 


Binding of RNA Polymerase to Template DNA The process of transcription begins 
when the o-subunit of RNA polymerase recognizes a promoter sequence (Figure 
29.3), and RNA polymerase holoenzyme and the promoter form a closed promoter 
complex (Figure 29.3, Step 2). This stage in RNA polymerase: DNA interaction is re- 
ferred to as the closed promoter complex because the dsDNA has not yet been 
“opened” (unwound) so that the RNA polymerase can read the base sequence of the 
DNA template strand and transcribe it into a complementary RNA sequence. 

Once the closed promoter complex is established, the RNA polymerase holoen- 
zyme unwinds about 14 base pairs of DNA (base pairs located at positions —12 to +2, 
relative to the transcription start site; see later discussion), forming the very stable open 
promoter complex (Figure 29.3, Step 3). Promoter sequences can be identified in vitro 
by DNA footprinting: RNA polymerase holoenzyme is bound to a putative promoter 
sequence in a DNA duplex, and the DNA: protein complex is treated with DNase I. 
DNase I cleaves the DNA at sites not protected by bound protein, and the set of DNA 
fragments left after DNase I digestion reveals the promoter (by definition, the pro- 
moter is the RNA polymerase holoenzyme binding site). 

RNA polymerase binding typically protects a nucleotide sequence spanning the 
region from —70 to +20, where the +1 position is defined as the transcription start 
site: that base in the nontemplate DNA strand that is identical with the first base in 
the RNA transcript. The next base, +2, specifies the second base in the transcript. 
Nontemplate strand bases in the 5’, or “minus,” direction from the transcript start site 
are numbered —1, —2, and so on. (Note that there is no zero.) Nontemplate nu- 
cleotides in the “minus” direction are said to lie upstream of the transcription start 
site, whereas nucleotides in the 3’, or “plus,” direction are downstream of the tran- 
scription start site. The transcript start site on the template strand is usually a pyrimi- 
dine, so most transcripts begin with a purine. RNA polymerase binding protects 90 bp 
of DNA, equivalent to a distance of 30 nm along B-DNA. Because RNA polymerase is 
only 16 nm in its longest dimension, the DNA must be wrapped around the enzyme. 


Properties of Prokaryotic Promoters Promoters recognized by the principal ø factor, 
a”, serve as the paradigm for prokaryotic promoters. These promoters vary in size 
from 20 to 200 bp but typically consist of a 40-bp region located on the 5’-side 


A consensus sequence can be defined as the of the transcription start site. Within the promoter are two consensus sequence 
bases that appear with highest frequency ateach: elements. These two elements are the Pribnow box! near —10, whose consensus se- 
position when a series of sequences believed to quence is the hexameric TATAAT, and a sequence in the —35 region containing the 
have common function is compared. hexameric consensus TTGACA (Figure 29.4). The Pribnow box and the —35 region 


are separated by about 17 bp of nonconserved sequence. RNA polymerase holo- 


'Named for David Pribnow, who, along with David Hogness, first recognized the importance of this se- 
quence element in transcription. 
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galP2 ATTTATTCC TCGCATCTTTGT ATGGTTATTTCATACCAT 
lac ACCCCAG ATGCTTCCGGCTCG 
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rrnD1 CAAAAAAAT AATTGGGATCCC GCGCCTCCGTTGAGACGA 
rrnE 1 CAATTTTTC CGGAGAACTCCC GCGCCTCCATCGACACGG 
iRNATY CAACGTAAC CGCGTCATTTGA GCGCCCCGCTTCCCGATA 
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Initiation 
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FIGURE 29.4 The nucleotide sequences of representative E.coli promoters. (In accordance with convention, 
these sequences are those of the nontemplate strand where RNA polymerase binds.) Consensus sequences 
for the —35 region, the Pribnow box, and the initiation site are shown at the bottom. The numbers represent 
the percent occurrence of the indicated base. (Note: The —35 region is only roughly 35 nucleotides from the 
transcription start site; the Pribnow box [the —10 region] likewise is located at approximately position — 10.) 
In this figure, sequences are aligned relative to the Pribnow box. 
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DNA Footprinting—Identifying the Nucleotide Sequence in DNA Where 
a Protein Binds 


DNA footprinting is a widely used technique to DNA:protein complex Naked DNA 
identify the nucleotide sequence within DNA 
where a specific protein binds, such as the 
promoter sequence(s) bound by RNA polymerase 
holoenzyme. In this technique, the protein is in- 
cubated with a labeled (*) DNA fragment con- 
taining the nucleotide sequence where the pro- 
tein is believed to bind. (The DNA fragment is 
labeled at only one end.) Then, a DNA cleaving 
agent, such as DNase I, is added to the solution 
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I. When these DNase I digests are analyzed by gel 
electrophoresis, a difference is found between the 


set of labeled fragments from the DNA: protein Gel 
complex and the set from naked DNA. The ab- electrophoresis 
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sence of certain fragments in the digest of the 
DNA: protein complex reveals the location of the 
protein-binding site on the DNA (see accompany- 
ing figure). 








Adapted from Rhodes, D., and Fairall, L., 1997. Analysis 
of sequence-specific DNA-binding proteins. In Protein 
Function: A Practical Approach, Creighton, T. E., ed., 
Oxford: IRL Press at Oxford University Press. 
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enzyme uses its o-subunit to bind to the conserved sequences, and the more closely 
the —35 region sequence corresponds to its consensus sequence, the greater is the 
efficiency of transcription of the gene. The highly expressed rrn genes in E. coli that 
encode ribosomal RNA (rRNA) have a third sequence element in their promoters, 
the upstream element (UP element), located about 20 bp immediately upstream of 
the —35 region. (Transcription from the rrn genes accounts for more than 60% of 
total RNA synthesis in rapidly growing E. coli cells.) Whereas the o-subunit recog- 
nizes the —10 and —35 elements, the C-terminal domains (CTD) of the a-subunits 
of RNA polymerase recognize and bind the UP element. 

In order for transcription to begin, the DNA duplex must be “opened” so that 
RNA polymerase has access to single-stranded template. The efficiency of initiation 
is inversely proportional to the melting temperature, Tm, in the Pribnow box, sug- 
gesting that the A: T-rich nature of this region is aptly suited for easy “melting” of the 
DNA duplex and creation of the open promoter complex (see Figure 29.3). Nega- 
tive supercoiling facilitates transcription initiation by favoring DNA unwinding. 

The RNA polymerase o-subunit is directly involved in melting the dsDNA. Inter- 
action of the o-subunit with the nontemplate strand maintains the open complex 
formed between RNA polymerase and promoter DNA, with the o-subunit acting as a 
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sequence-specific single-stranded DNA-binding protein. Association of the o-subunit 
with the nontemplate strand stabilizes the open promoter complex and leaves the 
bases along the template strand available to the catalytic site of the RNA polymerase. 


Initiation of Polymerization RNA polymerase has two binding sites for NTPs: the 
initiation site and the elongation site. The first nucleotide binds at the initiation 
site, base-pairing with the +1 base exposed within the open promoter complex (see 
Figure 29.3, Step 4). The second incoming nucleotide binds at the elongation site, 
base-pairing with the +2 base. The ribonucleotides are then united when the 3’-O 
of the first nucleotide makes a phosphoester bond with the a-phosphorus atom of 
the second nucleotide, and PP; is eliminated. Note that the 5'-end of the transcript 
starts out with a triphosphate attached to it. Movement of RNA polymerase along 
the template strand (translocation) to the next base prepares the RNA polymerase to 
add the next nucleotide (see Figure 29.3, Step 5). Once an oligonucleotide 9 to 
12 residues long has been formed, the o-subunit dissociates from RNA polymerase, 
signaling the completion of initiation (see Figure 29.3, Step 6). The core RNA po- 
lymerase is highly processive and goes on to synthesize the remainder of the mRNA. 
As the core RNA polymerase progresses, advancing the 3'-end of the RNA chain, 
the DNA duplex is unwound just ahead of it. About 12 base pairs of the growing 
RNA remain base-paired to the DNA template at any time, with the RNA strand 
becoming displaced as the DNA duplex rewinds behind the advancing RNA 
polymerase. 


Chain Elongation Elongation of the RNA transcript is catalyzed by the core po- 
lymerase, because once a short oligonucleotide chain has been synthesized, the 
o-subunit dissociates. The accuracy of transcription is such that about once every 
104 nucleotides, an error is made and the wrong base is inserted. Because many 
transcripts are made per gene and most transcripts are smaller than 10 kb, this 
error rate is acceptable. 

Two possibilities can be envisioned for the course of the new RNA chain. In one, 
the RNA chain is wrapped around the DNA as the RNA polymerase follows the tem- 
plate strand around the axis of the DNA duplex, but this possibility seems unlikely 
due to its potential for tangling the nucleic acid strands (Figure 29.5a). In reality, 
transcription involves supercoiling of the DNA, so positive supercoils are created 
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FIGURE 29.5 Supercoiling versus transcription. (a) If the RNA polymerase followed the template strand around 
the axis of the DNA duplex, no supercoiling of the DNA would occur but the RNA chain would be wrapped 
around the double helix once every 10 bp. This possibility seems unlikely because it would be difficult to un- 
tangle the transcript from the DNA duplex. (b) Alternatively, gyrases and topoisomerases could remove the 
torsional stresses induced by transcription. 
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FIGURE 29.6 The termination site for the E. coli trp mRNA terminus 
operon (the trp operon encodes the enzymes of trypto- re in 
phan biosynthesis). The inverted repeats give rise to a 
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ahead of the transcription bubble and negative supercoils are created behind it 
(Figure 29.5b). To prevent torsional stress from inhibiting transcription, gyrases in- 
troduce negative supercoils (and thereby remove positive supercoils) ahead of RNA 
polymerase, and topoisomerases remove negative supercoils behind the DNA seg- 
ment undergoing transcription (Figure 29.5b). 


Chain Termination Two types of transcription termination mechanisms operate in 
bacteria: one that is dependent on a specific protein called rho termination factor 
(for the Greek symbol, p) and another, intrinsic termination, that is not. In intrin- 
sic termination, termination is determined by specific sequences in the DNA called 
termination sites. These sites are not indicated by particular bases showing where 
transcription halts. Instead, these sites consist of three structural features whose 
base-pairing possibilities lead to termination: 


1. Inverted repeats, which are typically G:C-rich, so a stable stem-loop structure can 
form in the transcript via intrachain base-pairing (Figure 29.6) 

2. A nonrepeating segment that punctuates the inverted repeats 

3. Arun of 6 to 8 As in the DNA template, coding for Us in the transcript 


Termination then occurs as follows: A G:C-rich, stem-loop structure, or “hairpin,” 
forms in the transcript. The hairpin apparently causes the RNA polymerase to 
pause, whereupon the A: U base pairs between the transcript and the DNA template 
strand are displaced through formation of somewhat more stable A:T base pairs be- 
tween the template and nontemplate strands of the DNA. The result is spontaneous 
dissociation of the nascent transcript from DNA. 

The alternative mechanism of termination—factor-dependent termination—is 
less common and mechanistically more complex. Rho factor is an ATP-dependent 
helicase (hexamer of 50-kD subunits) that catalyzes the unwinding of RNA: DNA 
hybrid duplexes (or RNA:RNA duplexes). The rho factor recognizes and binds to 
C-rich regions in the RNA transcript. These regions must be unoccupied by trans- 
lating ribosomes for rho factor to bind. Once bound, rho factor advances in the 
5’3' direction until it reaches the transcription bubble (Figure 29.7). There it cat- 
alyzes the unwinding of the transcript and template, releasing the nascent RNA 
chain. It is likely that the RNA polymerase stalls in a G:C-rich termination region, al- 
lowing rho factor to overtake it. 


29:28 How Is Transcription Regulated in Prokaryotes? 


In bacteria, genes encoding the enzymes of a particular metabolic pathway are often 
grouped adjacent to one another in a cluster on the chromosome. This pattern of or- 
ganization allows all of the genes in the group to be expressed in a coordinated fash- 
ion through transcription into a single polycistronic mRNA encoding all the enzymes 
of the metabolic pathway.? A regulatory sequence lying adjacent to the DNA being 


2A polycistronic mRNA is a single RNA transcript that encodes more than one polypeptide. Cistron is a 
genetic term for a DNA region representing a protein: Cistron and gene are essentially equivalent terms. 
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FIGURE 29.7 The rho factor mechanism of tran- 
scription termination. Rho factor (a) attaches to 
a recognition site on mRNA and (b) moves 
along it behind RNA polymerase. (c) When RNA 
polymerase pauses at the termination site, rho 
factor unwinds the DNA: RNA hybrid in the 
transcription bubble, (d) releasing the nascent 
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FIGURE 29.8 The general organization of operons. Operons consist of transcriptional control regions and a set 
of related structural genes, all organized in a contiguous linear array along the chromosome. The transcrip- 
tional control regions are the promoter and the operator, which lie next to, or overlap, each other, upstream 
from the structural genes they control. Operators may lie at various positions relative to the promoter, either 
upstream or downstream. Expression of the operon is determined by access of RNA polymerase to the pro- 
moter, and occupancy of the operator by regulatory proteins influences this access. Induction activates tran- 
scription from the promoter; repression prevents it. 


transcribed determines whether transcription takes place. This sequence is termed the 
operator (Figure 29.8). The operator is located next to a promoter. Interaction of a 
regulatory protein with the operator controls transcription of the gene cluster by con- 
trolling access of RNA polymerase to the promoter.” Such co-expressed gene clusters, 
together with the operator and promoter sequences that control their transcription, 
are called operons. 


Transcription of Operons Is Controlled by Induction and Repression 


In prokaryotes, gene expression is often responsive to small molecules serving as sig- 
nals of the nutritional or environmental conditions confronting the cell. Increased 
synthesis of enzymes in response to the presence of a particular substrate is termed 


3Although this is the paradigm for prokaryotic gene regulation, it must be emphasized that many 
prokaryotic genes do not contain operators and are regulated in ways that do not involve protein : op- 
erator interactions. 
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FIGURE 29.9 The structure of lactose, a B-galactoside. 
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FIGURE 29.10 The structure of IPTG (isopropyl 
B-thiogalactoside). 


induction. For example, lactose (Figure 29.9) can serve as both carbon and energy 
source for E. coli. Metabolism of lactose depends on hydrolysis into its component sug- 
ars, glucose and galactose, by the enzyme P-galactosidase. In the absence of lactose, 
E. coli cells contain very little -galactosidase (less than 5 molecules per cell). How- 
ever, lactose availability induces the synthesis of 6-galactosidase by activating transcrip- 
tion of the lac operon. One of the genes in the lac operon, lacZ, is the structural gene 
for B-galactosidase. When its synthesis is fully induced, 6-galactosidase can amount to 
almost 10% of the total soluble protein in E. coli. When lactose is removed from the 
culture, synthesis of -galactosidase halts. 

The alternative to induction—namely decreased synthesis of enzymes in response to 
a specific metabolite—is termed repression. For example, the enzymes of tryptophan 
biosynthesis in £E. coli are encoded in the trp operon. If sufficient Trp is available to the 
growing bacterial culture, the trp operon is not transcribed, so the Trp biosynthetic 
enzymes are not made; that is, their synthesis is repressed. Repression of the trp operon 
in the presence of Trp is an eminently logical control mechanism: If the end product 
of the pathway is present, why waste cellular resources making unneeded enzymes? 

Induction and repression are two faces of the same phenomenon. In induction, 
a substrate activates enzyme synthesis. Substrates capable of activating synthesis of 
the enzymes that metabolize them are called co-inducers, or, often, simply inducers. 
Some substrate analogs can induce enzyme synthesis even though the enzymes are 
incapable of metabolizing them. These analogs are called gratuitous inducers. A 
number of thiogalactosides, such as IPTG (isopropyl -thiogalactoside, Figure 
29.10), are excellent gratuitous inducers of -galactosidase activity in E. coli. In re- 
pression, a metabolite, typically an end product, depresses synthesis of its own 
biosynthetic enzymes. Such metabolites are called co-repressors. 


The /ac Operon Serves as a Paradigm of Operons 


In 1961, Francois Jacob and Jacques Monod proposed the operon hypothesis to ac- 
count for the coordinate regulation of related metabolic enzymes. The operon was 
considered to be the unit of gene expression, consisting of two classes of genes: the 
structural genes for the enzymes and regulatory genes that control expression of the 
structural genes. The two kinds of genes could be distinguished by mutation. Muta- 
tions in a structural gene would abolish one particular enzymatic activity, but muta- 
tions in a regulatory gene would affect all of the different enzymes under its control. 
Mutations of both kinds were known in E. coli for lactose metabolism. Bacteria with 
mutations in either the lacZ gene or the lacY gene (Figure 29.11) could no longer 
metabolize lactose—the lacZ mutants (lacZ~ strains) because B-galactosidase activity 
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FIGURE 29.11 The /ac operon.The operon consists of two transcription units. In one unit, there are three structural 
genes, lacZ, lacy, and lacA, under control of the promoter, Piac, and the operator O. In the other unit, there is a 





regulator gene, /acl, with its own promoter, Piaci. [acl encodes a 360-residue, 38.6-kD polypeptide that forms a 
tetrameric lac repressor protein. lacZ encodes B-galactosidase, a tetrameric enzyme of 116-kD subunits. lacY is the 
B-galactoside permease structural gene, a 46.5-kD integral membrane protein active in B-galactoside transport 
into the cell. The remaining structural gene encodes a 22.7-kD polypeptide, but the metabolic role of this protein in 
vivo remains uncertain. 


29.2 How ls Transcription Regulated in Prokaryotes? 915 


was absent, the lacY mutants because lactose was no longer transported into the cell. 
Other mutations defined another gene, the /acI gene. lac! mutants were different be- 
cause they both expressed $-galactosidase activity and immediately transported lac- 
tose, without prior exposure to an inducer. That is, a single mutation led to the expres- 
sion of lactose metabolic functions independently of inducer. Expression of genes 
independently of regulation is termed constitutive expression. Thus, acl had the 
properties of a regulatory gene. The lac operon includes the regulatory gene Jacl; its 
promoter p; and three structural genes, lacZ, lacY, and lacA, with their own promoter 
Puc and operator O (Figure 29.11). 


lac Repressor Is a Negative Regulator of the lac Operon 


The structural genes of the lac operon are controlled by negative regulation. 
That is, they are transcribed to give an mRNA unless turned off by the Jacl gene 
product. This gene product is the lac repressor, a tetrameric protein (Figure 
29.12). (Note that the language can be misleading: Inducers and co-repressors are 
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FIGURE 29.12 The mode of action of lac repressor. The 
structure of the lac repressor tetramer with bound IPTG 
(purple) is also shown (pdb id = 1LBH). 
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FIGURE 29.13 The nucleotide sequence of the lac oper- 
ator. This sequence comprises 36 bp showing nearly 
palindromic symmetry. The inverted repeats that consti- 
tute this approximate twofold symmetry are shaded in 
rose. The bases are numbered relative to the +1 start 
site for transcription. The G:C base pair at position +11 
represents the axis of symmetry. In vitro studies show 
that bound lac repressor protects a 26-bp region from 
—5 to +21 against nuclease digestion. Bases that inter- 
act with bound lac repressor are indicated below the 
operator. Note the symmetry of protection at +1 
through +4TTAA to +18 through +21 AATT. 
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*Values for repressor: DNA binding are given as association 
constants, Ka, for the formation of DNA: repressor complex 
from DNA and repressor. 

tSpecificity is defined as the ratio (K, for repressor binding to 
operator DNA)/(K, for repressor binding to random DNA). 





DNA TGTTGTGTGGAATTGTGAG CI@GATAACAATTTCACACA 
ACANGAGAG CITAACA CIC GMC TATTGTTAAAGTGTGT 





Axis of symmetry 





AAT TT 
Protected bases TTAA TTGTT 
| | | | | l | | 
-10 -5 +1 +5 +10 +15 +20 +25 


Lo mRNA transcription 


small molecules/metabolites; repressors are proteins.) The lac repressor has an 
N-terminal DNA-binding domain; the rest of the protein functions in inducer 
binding and tetramer formation. In the absence of an inducer, lac repressor 
blocks lac gene expression by binding to the operator DNA site upstream from 
the lac structural genes. The lac operator is a palindromic DNA sequence (Figure 
29.13). Palindromes, or “inverted repeats,” provide a twofold, or dyad, symmetry, 
a structural feature common at sites in DNA where proteins specifically bind. 
Despite the presence of lac repressor, RNA polymerase can still initiate tran- 
scription at the promoter (puc), but lac repressor blocks elongation of transcrip- 
tion, so initiation is aborted. In /acI mutants, the lac repressor is absent or defec- 
tive in binding to operator DNA, lac gene transcription is not blocked, and the 
lac operon is constitutively expressed in these mutants. Note that /ac/ is normally 
expressed constitutively from its promoter, so lac repressor protein is always avail- 
able to fill its regulatory role. About ten molecules of lac repressor are present in 
an E. coli cell. 

Derepression of the lac operon occurs when appropriate 6-galactosides occupy 
the inducer site on lac repressor, causing a conformational change in the protein 
that lowers the repressor’s affinity for operator DNA. As a tetramer, lac repressor has 
four inducer binding sites, and its response to inducer shows cooperative allosteric 
effects. Thus, as a consequence of the “inducer”-induced conformational change, 
the inducer: lac repressor complex dissociates from the DNA, and RNA polymerase 
transcribes the structural genes (see Figure 29.12). Induction reverses rapidly: lac 
mRNA has a half-life of only 3 minutes, and once the inducer is used up through 
metabolism by the enzymes, free lac repressor reassociates with the operator DNA, 
transcription of the operon is halted, and any residual lac mRNA is degraded. 

In the absence of inducer, lac repressor binds nonspecifically to duplex DNA 
with an association constant, Ka, of 2 X 10° M7! (Table 29.1) and to the lac 
operator DNA sequence with much higher affinity, Ka = 2 X 10'° M7!. Thus, lac 
repressor binds 10’ times better to lac operator DNA than to any random DNA se- 
quence. IPTG binds to lac repressor with an association constant of about 10° M71. 
The IPTG: lac repressor complex binds to operator DNA with an association con- 
stant, Ka = 2 X 10!°M~!. Although this affinity is high, it is 3 orders of magnitude 
less than the affinity of inducer-free repressor for lac operator. There is no differ- 
ence in the affinity of free lac repressor and lacrepressor with IPTG bound for non- 
operator DNA. The lac repressor apparently acts by binding to DNA and sliding 
along it, testing sequences in a one-dimensional search until it finds the lac opera- 
tor. The lac repressor then binds there with high affinity until inducer causes this 
affinity to drop by 3 orders of magnitude (Table 29.1). 


CAP Is a Positive Regulator of the Jac Operon 


Transcription by RNA polymerase from some promoters proceeds with low efficiency 
unless assisted by an accessory protein that acts as a positive regulator. One such protein 
is CAP, or catabolite activator protein. Its name derives from the phenomenon of 
catabolite repression in E. coli. Catabolite repression is a global control that coordi- 
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Quantitative Evaluation of lac Repressor: DNA Interactions 


The affinity of lac repressor for random DNA ensures that essentially 
all repressor is DNA bound. Assume that E. coli DNA has a single spe- 
cific lac operator site for repressor binding and 4.64 X 10° base 


If the number of nonspecific binding sites is 4.64 X 10°, there are 
(4.64 X 10°) /(6.022 X 10%) = 0.77 X 107!” moles of binding sites 
contained in the volume of a bacterial cell (roughly 107" liters). 


Therefore, [DNA] = (0.77 X 107!”)/(107!°) = 0.77 X 1072 M. Since 
K, = 2 X 10° M`! (Table 29.1), 


pairs and any nucleotide sequence even one base out of phase with 
the operator constitutes a nonspecific binding site. Thus, there are 
4.64 X 10° nonspecific sites for repressor binding. 

The binding of repressor to DNA is given by the association 
constant, Ka: 


[repressor] 1 1 
[repressor:DNA] — (2 X 10°) (0.77 X 107°) ~ (1.54 X 104) 





So, the ratio of free repressor to DNA-bound repressor is 6.5 X 
10%. Less than 0.01% of repressor is not bound to DNA! The behavior 
of lac repressor is characteristic of DNA-binding proteins. These 
proteins bind with low affinity to random DNA sequences, but with 
much higher affinity to their unique target sites (Table 29.1). 


[repressor : DNA] 
a [repressor] [DNA] 





where [repressor:DNA] is the concentration of repressor: DNA 
complex, [repressor] is the concentration of free repressor, and 
[DNA] is the concentration of nonspecific binding sites. Rearrang- 
ing gives the following: 


[repressor] il 
[repressor:DNA] ~ Ka[DNA] 











nates gene expression with the total physiological state of the cell: As long as glucose 
is available, E. coli catabolizes it in preference to any other energy source, such as lac- 
tose or galactose. Catabolite repression ensures that the operons necessary for me- 
tabolism of these alternative energy sources, that is, the lac and gal operons, remain 
repressed until the supply of glucose is exhausted. Catabolite repression overrides the 
influence of any inducers that might be present. 

Catabolite repression is maintained until the E. coli cells become starved of glu- 
cose. Glucose starvation leads to activation of adenylyl cyclase, and the cells begin to 
make cAMP. (In contrast, glucose uptake is accompanied by deactivation of adenylyl 
cyclase.) The action of CAP as a positive regulator is cAMP-dependent. cAMP is a 
small-molecule inducer for CAP, and cAMP binding enhances CAP’s affinity for 
DNA. CAP, also referred to as CRP (for cAMP receptor protein), is a dimer of iden- 
tical 22.5-kD polypeptides. The N-terminal domains bind cAMP; the C-terminal do- 
mains constitute the DNA-binding site. Two molecules of cAMP are bound per 
dimer. The CAP—(cAMP)», complex binds to specific target sites near the promoters 
of operons (Figure 29.14). Binding of CAP—(cAMP)»s to DNA causes the DNA to 
bend more than 80° (Figure 29.15). This CAP-induced DNA bending near the pro- 
moter assists RNA polymerase holoenzyme binding and closed promoter complex 
formation. Contacts made between the CAP—(cAMP)» complex and the a-subunit of 
RNA polymerase holoenzyme activate transcription. 
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Negative and Positive Control Systems Are Fundamentally Different k J 
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Negative and positive control systems operate in fundamentally different ways (al- 
though in some instances both govern the expression of the same gene). Genes un- 
der negative control are transcribed unless they are turned off by the presence of a 
repressor protein. Often, transcription activation is merely the release from nega- 
tive control. In contrast, genes under positive control are expressed only if an active 
regulator protein is present. The lac operon illustrates these differences. The action 
of lac repressor is negative. It binds to operator DNA and blocks transcription; ex- 
pression of the operon occurs only when this negative control is lifted through the 
release of the repressor. In contrast, regulation of the lac operon by CAP is positive: 
Transcription of the operon by RNA polymerase is stimulated by CAP’s action as a 
positive regulator. 


FIGURE 29.14 The mechanism of catabolite repression 
and CAP action. Glucose instigates catabolite repression 
by lowering cAMP levels. cAMP is necessary for CAP 
binding near promoters of operons whose gene prod- 
ucts are involved in the metabolism of alternative en- 
ergy sources such as lactose, galactose, and arabinose. 
The binding sites for the CAP—(cAMP), complex are 
consensus DNA sequences containing the conserved 
pentamer TGTGA and a less well conserved inverted re- 
peat, TCANA (where N is any nucleotide). 
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FIGURE 29.15 Binding of CAP-(cAMP), induces a severe 
bend in DNA about the center of dyad symmetry at the 
CAP-binding site. The CAP dimer with two molecules of 
cAMP bound interacts with 27 to 30 base pairs of du- 
plex DNA. Two a-helices of the CAP dimer insert into 
the major groove of the DNA at the dyad-symmetric 
CAP-binding site. The two cAMP molecules bound by 
the CAP dimer are indicated in yellow. Binding of 
CAP-(cAMP), to its specific DNA site involves H bonding 
and ionic interactions between protein functional 
groups and DNA phosphates, as well as H-bonding in- 
teractions in the DNA major groove between amino 
acid side chains of CAP and DNA base pairs (pdb id = 
1CGP). 





Operons can also be classified as inducible, repressible, or both, depending on how 
they respond to the small molecules that mediate their expression. Repressible oper- 
ons are expressed only in the absence of their co-repressors. Inducible operons are 
transcribed only in the presence of small-molecule co-inducers (Figure 29.16). 


The araBAD Operon Is Both Positively and Negatively 
Controlled by AraC 


E. coli can use the plant pentose L-arabinose as sole source of carbon and energy. Ara- 
binose is metabolized via conversion to D-xylulose-5-P (a pentose phosphate pathway 
intermediate and transketolase substrate [see Chapter 22]) by three enzymes en- 
coded in the araBAD operon. Transcription of this operon is regulated by both 
catabolite repression and arabinose-mediated induction. CAP functions in catabolite 
repression; arabinose induction is achieved via the product of the araC gene, which 
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(a) The araBAD operon CAP RNA 
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lies next to the araBAD operon on the E. coli chromosome. The araC gene product, 
the protein AraC,‘ is a 292-residue protein consisting of an N-terminal domain 
(residues 1 to 170) that binds arabinose and acts as a dimerization motif and a 
C-terminal (residues 178 to 292) DNA-binding domain. Regulation of araBAD by 
AraC is novel in that it acts both negatively and positively. The ara operon has three 
binding sites for AraC: araO,, located at nucleotides —106 to —144 relative to the 
araBAD transcription start site; araO (spanning positions —265 to —294); and aral, 
the araBAD promoter. The aral site consists of two “half-sites”; arah (nucleotides 
—56 to —78) and arah (—35 to —51). (The araO, site contributes minimally to ara 
operon regulation.) 

The details of araBAD regulation are as follows: When AraC protein levels are low, 
the araC gene is transcribed from its promoter p, (adjacent to araO,) by RNA poly- 
merase (Figure 29.17). araC is transcribed in the direction away from araBAD. When 
cAMP levels are low and arabinose is absent, an AraC protein dimer binds to two sites, 
araO and the arah half-site, forming a DNA loop between them and restricting tran- 
scription of araBAD (Figure 29.17). In the presence of L-arabinose, the monomer of 
AraC bound to the araQ, site is released from that site; it then associates with the un- 
occupied aral half-site, arab. L-Arabinose thus behaves as an allosteric effector that al- 
ters the conformation of AraC. In the arabinose-liganded conformation, the AraC 
dimer interacts with CAP—(cAMP), to activate transcription by RNA polymerase. Thus, 
AraC protein is both a repressor and an activator. 


‘Proteins are often named for the genes encoding them. By convention, the name of the protein is 
capitalized but not italicized. 
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FIGURE 29.17 Regulation of the araBAD operon by the 
combined action of CAP and AraC protein. 
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Positive control of the araBAD operon occurs in the presence of L-arabinose and 
cAMP. Arabinose binding by AraC protein causes the release of araO:, opening of 
the DNA loop, and association of AraC with arah. CAP—(cAMP)» binds at a site be- 
tween araQ, and aral, and together the AraC-(arabinose) 9 and CAP—(cAMP)». com- 
plexes influence RNA polymerase through protein: protein interactions to create 
an active transcription initiation complex. Supercoiling-induced DNA looping may 
promote protein: protein interactions between DNA-binding proteins by bringing 
them into juxtaposition. 


The trp Operon Is Regulated Through a Co-Repressor—Mediated 
Negative Control Circuit 


The trp operon of E. coli (and S. typhimurium) encodes the five polypeptides, trpE 
through irpA (Figure 29.18), that assemble into the three enzymes catalyzing tryp- 
tophan synthesis from chorismate (see Chapter 25). Expression of the trp operon is 
under the control of Trp repressor, a dimer of 108-residue polypeptide chains. 
When tryptophan is plentiful, Trp repressor binds two molecules of tryptophan and 
associates with the trp operator that is located within the trp promoter. Trp repres- 
sor binding excludes RNA polymerase from the promoter, preventing transcription 
of the trp operon. When Trp becomes limiting, repression is lifted because Trp re- 
pressor lacking bound Trp (Trp apo-repressor) has a lowered affinity for the trp pro- 
moter. Thus, the behavior of Trp repressor corresponds to a co-repressor—mediated, 
negative control circuit (see Figure 29.16). Trp repressor not only is encoded by the 
trpR operon but also regulates expression of the trpR operon. This is an example of 
autogenous regulation (autoregulation): regulation of gene expression by the prod- 
uct of the gene. 


Attenuation Is a Prokaryotic Mechanism for Post-Transcriptional 
Regulation of Gene Expression 


In addition to repression, expression of the trp operon is controlled by transcription 

attenuation. Unlike the mechanisms discussed thus far, attenuation regulates tran- 

scription after it has begun. Charles Yanofsky, the discoverer of this phenomenon, has 
FIGURE 29.18 The trp operon of E. coli. 
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Operon Amino acid Sequence 

his Met— Thr —Arg—Val— Gln —Phe — Lys — His —His —His— His — His—His—His—Pro—Asp 

ilu Met—Thr —Ala—Leu — Leu — Arg—Val—Ile— Ser —Leu—Val — Val —Ile — Ser—Val —Val—Val—Ile —Ile —Ile —Pro— Pro —Cys—Gly—Ala—Ala— Leu—Gly—Arg— Gly—Lys—Ala 
leu Met—Ser—His—Ile—Val—Arg—Phe—Thr — Gly—Leu —Leu—Leu—Leu— Asn—Ala—Phe—Ile —Val—Arg—Gly—Arg—Pro —Val—Gly—Gly —Ile —Gln—His 

pheA Met— Lys —His—Ile — Pro —-Phe —Phe— Phe —Ala —Phe — Phe— Phe —Thr—Phe — Pro 

thr Met—Lys— Arg —Ile Ser- —Thr—Thr — Ile Thr —Thr—Thr— le Thi Ne Thr —Thr —Gln—Asn —Gly —Ala —Gly 

trp Met—Lys—Ala—Ile — Phe —Val— Leu — Lys —Gly —Trp — Trp — Arg —Thr —Ser 





FIGURE 29.19 Amino acid sequences of leader peptides in 
by attenuation. Color indicates amino acids synthesized in 





various amino acid biosynthetic operons regulated 


he pathway catalyzed by the operon’s gene prod- 


ucts. (The ilv operon encodes enzymes of isoleucine, leucine, and valine biosynthesis.) 


defined attenuation as any regulatory mechanism that manipulates transcription termination 
or transcription pausing to regulate gene transcription downstream. In prokaryotes, tran- 
scription and translation (see Chapters 10 and 30) are coupled, and the translating 
ribosome is affected by the formation and persistence of secondary structures in the 
mRNA. In many operons encoding enzymes of amino acid biosynthesis, a transcribed 
150- to 300-bp leader region is positioned between the promoter and the first major 
structural gene. These regions encode a short leader peptide containing multiple 
codons for the pertinent amino acid. For example, the leader peptide of the leu 
operon has four leucine codons, the trp operon has two tandem tryptophan codons, 
and so forth (Figure 29.19). Translation of these codons depends on an adequate sup- 
ply of the relevant aminoacyl-tRNA, which in turn rests on the availability of the 
amino acid. When tryptophan is scarce, the entire trp operon from trpL to irpA is tran- 
scribed to give a polycistronic mRNA. But, as [Trp] increases, more and more of the 
trp transcripts consist of only a 140-nucleotide fragment corresponding to the 5'-end 
of trpL. Tryptophan availability is causing premature termination of trp transcription, 
that is, transcription attenuation. Although attenuation occurs when tryptophan is 
abundant, attenuation is blocked when levels of tryptophan are low and little trypto- 
phanyl-tRNA is available. The secondary structure of the 160-bp leader region tran- 
script is the principal control element in transcription attenuation (Figure 29.20). 
This RNA segment includes the coding region for the 14-residue leader peptide. 
Three critical base-paired hairpins can form in this RNA: the 1:2 pause structure, the 
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FIGURE 29.20 Alternative secondary struc- 
tures for the leader region (troL mRNA) of 


the trp operon transcript. 
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FIGURE 29.21 The mechanism of attenuation in the trp 
operon. 


3:4 terminator, and the 2:3 antiterminator. Obviously, the 1:2 pause, 3:4 terminator, 
and the 2:3 antiterminator represent mutually exclusive alternatives. A significant fea- 
ture of this coding region is the tandem UGG tryptophan codons. 

Transcription of the trp operon by RNA polymerase begins and progresses un- 
til position 92 is reached, whereupon the 1:2 hairpin is formed, causing RNA 
polymerase to pause in its elongation cycle. While RNA polymerase is paused, a 
ribosome begins to translate the leader region of the transcript. Translation by 
the ribosome releases the paused RNA polymerase and transcription continues, 
with RNA polymerase and the ribosome moving in unison. As long as tryptophan 
is plentiful enough that tryptophanyl-tRNA"? is not limiting, the ribosome is not 
delayed at the two tryptophan codons and follows closely behind RNA poly- 
merase, translating the message soon after it is transcribed. The presence of the 
ribosome atop segment 2 blocks formation of the 2:3 antiterminator hairpin, al- 
lowing the alternative 3:4 terminator hairpin to form (Figure 29.21). Stable hair- 
pin structures followed by a run of Us are features typical of rho-independent 
transcription termination signals, so the RNA polymerase perceives this hairpin 
as a transcription stop signal and transcription is terminated at this point. On the 
other hand, a paucity of tryptophan and hence low availability of tryptophanyl- 
tRNA"? causes the ribosome to stall on segment 1. This leaves segment 2 free to 
pair with segment 3 and to form the 2:3 antiterminator hairpin in the transcript. 
Because this hairpin precludes formation of the 3:4 terminator, termination is 
prevented and the entire operon is transcribed. Thus, transcription attenuation 
is determined by the availability of tyrptophanyl-tRNA®? and its transitory influ- 
ence over the formation of alternative secondary structures in the mRNA. 


DNA: Protein Interactions and Protein: Protein Interactions 
Are Essential to Transcription Regulation 


Quite a variety of control mechanisms regulate transcription in prokaryotes. Several 
organizing principles materialize. First, DNA: protein interactions are a central fea- 
ture in transcriptional control, and the DNA sites where regulatory proteins bind 
commonly display at least partial dyad symmetry or inverted repeats. Furthermore, 
DNA-binding proteins themselves are generally even-numbered oligomers (for ex- 
ample, dimers, tetramers) that have an innate twofold rotational symmetry. Second, 
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protein: protein interactions are an essential component of transcriptional activation. 
We see this latter feature in the activation of RNA polymerase by CAP-(cAMP)o, for 
example. Third, the regulator proteins receive cues that signal the status of the envi- 
ronment (for example, Trp, lactose, cAMP) and act to communicate this information 
to the genome, typically via the medium of conformational changes and DNA: pro- 
tein interactions. 


Proteins That Activate Transcription Work Through 
Protein : Protein Contacts with RNA Polymerase 


Although transcriptional control is governed by a variety of mechanisms, an underly- 
ing principle of transcriptional activation has emerged. Transcriptional activation can 
take place when a transcriptional activator protein [such as CAP—(cAMP)>»] bound to 
DNA makes protein: protein contacts with RNA polymerase, and the degree of tran- 
scriptional activation is proportional to the strength of the protein:protein interac- 
tion. Generally speaking, a nucleotide sequence that provides a binding site for a 
DNA-binding protein can serve as an activator site if the DNA-binding protein bound 
there can interact with promoter-bound RNA polymerase. These interactions can 
involve either the a-, B-, B'-, or o-subunits of RNA polymerase. Moreover, if the 
DNA-bound transcriptional activator makes contacts with two different components 
of RNA polymerase, a synergistic effect takes place such that transcription is markedly 
elevated. Thus, transcriptional activation at specific genes relies on the presence of 
one or more activator sites where one or more transcriptional activator proteins can 
bind and make contacts with RNA polymerase bound at the promoter of the gene. In- 
deed, transcriptional activators may facilitate the recruitment and binding of RNA 
polymerase to the promoter. This general principle applies to transcriptional activa- 
tion in both prokaryotic and eukaryotic cells. In eukaryotes, transcriptional activators 
typically have discrete domains of protein structure dedicated to DNA binding (DB 
domains) and transcriptional activation (TA domains). 








DNA Looping Allows Multiple DNA-Binding Proteins 
to Interact with One Another 


Because transcription must respond to a variety of regulatory signals, multiple pro- 
teins are essential for appropriate regulation of gene expression. These regulatory 
proteins are the sensors of cellular circumstances, and they communicate this infor- 
mation to the genome by binding at specific nucleotide sequences. However, DNA is 
virtually a one-dimensional polymer, and there is little space for a lot of proteins to 
bind at (or even near) a transcription initiation site. DNA looping permits additional 
proteins to convene at the initiation site and to exert their influence on creating and 
activating an RNA polymerase initiation complex (Figure 29.22). The number of par- 
ticipants in transcriptional regulation is greatly expanded by DNA looping. 
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FIGURE 29.22 Formation of a DNA loop deliv- 
ers DNA-bound transcriptional activator to 
RNA polymerase positioned at the promoter. 
Protein: protein interactions between the 
transcriptional activator and RNA polymerase 
activate transcription. 
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a-Amanitin 
FIGURE 29.23 The structure of a-amanitin, one of a 


series of toxic Compounds known as amatoxins that are 
found in the mushroom Amanita phalloides. 


29:3 How Are Genes Transcribed in Eukaryotes? 


Although the mechanism of transcription in prokaryotes and eukaryotes is funda- 
mentally similar, transcription is substantially more complicated in eukaryotes. The 
significant difference is that the DNA of eukaryotes is wrapped around histones to 
form nucleosomes, and the nucleosomes are further organized into chromatin 
(see Chapter 11). Nucleosomes repress gene expression. Nucleosomes control gene ex- 
pression by controlling access of the transcriptional apparatus to genes. Two classes 
of transcriptional co-regulators are necessary to overcome nucleosome repression: 
(1) enzymes that covalently modify the nucleosome histone proteins and thereby 
loosen histone: DNA interactions and (2) ATP-dependent chromatin-remodeling 
complexes. However, gene activation depends not only on relief from nucleosome 
repression but also on interaction of RNA polymerase with the promoter. Only 
those genes activated by specific positive regulatory mechanisms are transcribed. A 
general understanding of transcription in eukaryotes rests on the following topics: 


e The three classes of RNA polymerase in eukaryotes: RNA polymerases I, I, 
and III 
e The structure and function of RNA polymerase I, the mRNA-synthesizing RNA 
polymerase 
e Transcription regulation in eukaryotes, including: 
— General features of gene regulatory sequences: promoters, enhancers, and re- 
sponse elements 
— Transcription initiation by RNA polymerase II 
The general transcription factors (GTFs) 
Alleviating the repression due to nucleosomes 
Histone acetyl transferases (HATs) 
Chromatin-remodeling complexes 
e A general model for eukaryotic gene activation, based on the preceding 


We turn now to a review of these various features of eukaryotic transcription. 


Eukaryotes Have Three Classes of RNA Polymerases 


Eukaryotic cells have three classes of RNA polymerase, each of which synthesizes a 
different class of RNA. All three enzymes are found in the nucleus. RNA polymerase 
Tis localized to the nucleolus and transcribes the major ribosomal RNA genes. RNA 
polymerase II transcribes protein-encoding genes, and thus it is responsible for the 
synthesis of mRNA. RNA polymerase III transcribes tRNA genes, the ribosomal RNA 
genes encoding 5S rRNA, and a variety of other small RNAs, including several in- 
volved in mRNA processing and protein transport. 

All three RNA polymerase types are large, complex multimeric proteins (500 to 
700 kD), consisting of ten or more types of subunits. Although the three differ in over- 
all subunit composition, they have several smaller subunits in common. Furthermore, 
all possess two large subunits (each 140 kD or greater) having sequence similarity to 
the large B- and B'-subunits of E. coli RNA polymerase, indicating that the funda- 
mental catalytic site of RNA polymerase is conserved among its various forms. 

In addition to their different functions, the three classes of RNA polymerase can be 
distinguished by their sensitivity to a@-amanitin (Figure 29.23), a bicyclic octapeptide 
produced by the poisonous mushroom Amanita phalloides (the “destroying angel” 
mushroom). a-Amanitin blocks RNA chain elongation. Although RNA polymerase I is 
resistant to this compound, RNA polymerase II is very sensitive and RNA polymerase 
III is less sensitive. 

The existence of three classes of RNA polymerases acting on three distinct sets 
of genes implies that at least three categories of promoters exist to maintain this 
specificity. Eukaryotic promoters are very different from prokaryotic promoters. 
All three eukaryotic RNA polymerases interact with their promoters via so-called 
transcription factors—DNA-binding proteins that recognize and accurately initiate 
transcription at specific promoter sequences. For RNA polymerase I, its templates 
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are the rRNA genes. Ribosomal RNA genes are present in multiple copies. Optimal 
expression of these genes requires the first 150 nucleotides in the immediate 
5'-upstream region. 

RNA polymerase III interacts with transcription factors TFTA, TFIIIB, and 
TFIHIC. Interestingly, TFIIIA and/or TFIIIC bind to specific recognition sequences 
that in some instances are located within the coding regions of the genes, not in the 
5'-untranscribed region upstream from the transcription start site. TFIIIB associates 
with TFIIA or TFHIC already bound to the DNA. RNA polymerase III then binds 
to TFIIB to establish an initiation complex. 


RNA Polymerase II Transcribes Protein-Coding Genes 


As the enzyme responsible for the regulated synthesis of mRNA, RNA polymerase II 
has aroused greater interest than RNA polymerases I and III. RNA polymerase II must 
be capable of transcribing a great diversity of genes, yet it must carry out its function 
at any moment only on those genes whose products are appropriate to the needs of 
the cell in its ever-changing metabolism and growth. The RNA polymerase II from 
yeast (Saccharomyces cerevisiae) has been extensively characterized, and its structure has 
been solved by x-ray crystallography (Figure 29.24). Strong homology between yeast 
and human RNA polymerase II subunits suggests that the yeast RNA polymerase II is 
an excellent model for human RNA polymerase II. The yeast RNA polymerase II 
consists of 12 different polypeptides, designated RPB1 through RPB12 and ranging 
in size from 192 to 8 kD (Table 29.2). RPB3, RPB4, and RPB7 are unique to RNA 
polymerase II, whereas RPB5, RPB6, RPB8, and RPB10 are common to all three 
eukaryotic RNA polymerases. 

RNA polymerases adopt a clawlike structure, grasping the DNA duplex as shown 
in Figure 29.24 for yeast RNA polymerase II. The DNA strands are unwound and 
separated, and the template strand enters the active site, where template-directed 
NTP substrate selection and NMP addition to the growing RNA transcript occur. 
NTP substrates access the active site through a channel in the floor. The DNA:RNA 
hybrid duplex exits at a right angle from the active site. As the hybrid duplex 
emerges from the top of the protein structure, the RNA transcript is separated from 
the DNA template by RPBI residue Phe®?, which acts as a wedge, splitting the po- 
sition —10 RNA:DNA base pair (the base pair located 10 bases from the active site). 
The template strand is now free to re-anneal with the nontemplate strand to 
re-establish the dsDNA structure. 
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FIGURE 29.24 Structure of the yeast RNA polymerase 
(pdb id = 1Y77).The template DNA strand is shown in 
green, the nontemplate DNA strand in blue. The RNA 
transcript (hot pink) is emerging at the bottom of the 
structure. RPB1, the largest polypeptide chain, is shown 
in orange, its C-terminal domain (CTD) is to the upper 
right. The active-site Mg?* is shown as a red sphere. The 
atoms of a ribonucleotide substrate analog (GMPCP) are 
shown as dark blue spheres. 
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Subunit Side (kD) Features Prokaryotic Homolog 
RPB1} 192 YSPTSPS CTD B' 
RPB2 139 NTP binding B 
RPB3 35 Core assembly a 
RPB4 25 Promoter recognition o 
RPB5 25 In polymerases I, II, and II 

RPB6 18 In polymerases I, II, and II 

RPB7 19 Unique to polymerase II 

RPB8 17 In polymerases I, II, and II 

RPB9 14 Nonessential 

RPB10 8 In polymerases I, II, and II 

RPB11 14 

RPB12 8 In polymerases I, II, and II 











*A very similar RNA polymerase II can be isolated from human cells. 

+tRPB stands for RNA polymerase B; RNA polymerases |, Il, and III are sometimes called RPA, RPB, and RPC. 

Adapted from Myer, V. E., and Young, R. A., 1998. RNA polymerase II holoenzymes and subcomplexes. The Journal of Biologi- 
cal Chemistry 273:27757-27760. 


The RPB1 subunit has an unusual structural feature not found in prokaryotes: Its 
C-terminal domain (CTD) contains 27 repeats of the amino acid sequence YSPTSPS. 
(The analogous subunit in RNA polymerase II enzymes of other eukaryotes has this 
heptapeptide tandemly repeated as many as 52 times.) Note that the side chains of 
5 of the 7 residues in this repeat have —OH groups, endowing the CTD with con- 
siderable hydrophilicity and multiple sites for phosphorylation. A number of CTD 
kinases have been described, targeting different residues at different stages of the 
transcription process. The CTD domain may project more than 50 nm from the sur- 
face of RNA polymerase II. 

The CTD is essential to RNA polymerase II function. Only RNA polymerase I 
whose CTD is not phosphorylated can initiate transcription. However, transcription 
elongation proceeds only after protein phosphorylation within the CTD, suggesting 
that phosphorylation triggers the conversion of an initiation complex into an elon- 
gation complex. Such a mechanism would allow protein phosphorylation to regu- 
late gene expression. Following termination of transcription, a phosphatase recycles 
RNA polymerase II to its unphosphorylated form. The CTD also plays a prominent 
role in orchestrating subsequent events in the transcription process. A multitude of 
additional proteins are essential to the formation of a translatable mRNA from the 
primary RNA polymerase II transcript; these proteins (described later in this chap- 
ter) include 5’-capping enzymes, splicing factors, and 3’-polyadenylylation com- 
plexes. Recruitment of these proteins is dependent upon phosphorylation of Ser 
residues at positions 2 and 5 in the CTD heptapeptide repeat. Phosphorylation of 
these Ser residues is also a prerequisite for interactions between the CTD and the 
histone methyltransferases capable of remodeling nucleosomes into a transcrip- 
tionally permissive state. 


The Regulation of Gene Expression Is More Complex in Eukaryotes 


Not only metabolic activity and cell division but also complex patterns of embryonic 
development and cell differentiation must be coordinated through the regulation of 
gene expression. All this coordinated regulation takes place in cells where the relative 
quantity (and diversity) of DNA is very great: A typical mammalian cell has 1500 times 
as much DNA as an EF. coli cell. The structural genes of eukaryotes are rarely organized 
in clusters akin to operons. Each eukaryotic gene typically possesses a discrete set of 
regulatory sequences appropriate to the requirements for regulating its transcription. 
Certain of these sequences provide sites of interaction for general transcription factors, 
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whereas others endow the gene with great specificity in expression by providing targets 
for specific transcription factors. 


Gene Regulatory Sequences in Eukaryotes Include Promoters, 
Enhancers, and Response Elements 


RNA polymerase IT promoters commonly consist of two separate sequence features: 
the core element, near the transcription start site, where general transcription factors 
(GTFs) bind, and more distantly located regulatory elements, known variously as 
enhancers or silencers. These regulatory sequences are recognized by specific 
DNA-binding proteins that activate transcription above basal levels (enhancers bind 
transcriptional activators) or repress transcription (silencers bind repressors). The site of 
transcription initiation, called initiator (Inr), has the consensus sequence (Py) 9CA(Py) 
(two pyrimidines, then CA, then five pyrimidines) located between positions —3 and 
+5, where +1 is the transcription start site. The core region often consists of a TATA 
box (a TATAAA consensus element) indicating the transcription start site; the TATA 
motif is usually located at position —25 (Figure 29.25). Genes that lack a TATA often 
have a downstream promoter element (DPE) centered on the —30 region. Other reg- 
ulatory elements include short nucleotide sequences (sometimes called response 
elements) found near the promoter (the promoter-proximal region) that can bind certain 
specific transcription factors, such as proteins that trigger expression of a related set of 
genes in response to some physiological signal (hormone) or challenge (temperature 
shock). 


Promoters The promoters of eukaryotic genes encoding proteins can be quite com- 
plex and variable, but they typically contain modules of short conserved sequences, 
such as the TATA box, the CAAT box, and the GC box. Sets of such modules embed- 
ded in the upstream region collectively define the promoter. The presence of a CAAT 
box, usually located around —80 relative to the transcription start site, signifies a strong 
promoter. One or more copies of the sequence GGGCGG or its complement (referred 
to as the GC box) have been found upstream from the transcription start sites of 
“housekeeping genes.” Housekeeping genes encode proteins commonly present in all 
cells and essential to normal function; such genes are typically transcribed at more or 
less steady levels. Figure 29.26 depicts the promoter regions of several representative 
eukaryotic genes. Table 29.3 lists transcription factors that bind to respective modules. 
These transcription factors typically behave as positive regulatory proteins essential to 
transcriptional activation by RNA polymerase II at these promoters. 


Enhancers Eukaryotic genes have, in addition to promoters, regulatory sequences 
known as enhancers. Enhancers (also called upstream activation sequences, or UASs) 
assist initiation. Enhancers differ from promoters in two fundamental ways. First, the 
location of enhancers relative to the transcription start site is not fixed. Enhancers may 


Inr 
Adenovirus GGGGOPATAAAAGGGGGTGGGGGCGCGTTCGTCCTCACTC 
late 


Chicken GAGGCHATATATTCCCCAGGGCTCAGCCAGTGTCTGTACA 
ovalbumin 


Mouse -globin GAGCAMAMAMIGGET GAGGTAGGATCAGTTGCTCCTCRCATTT 
major 
Rabbit TTGCCO@MMAMMAcGCAGAGCAGGGCAGCTGCTGCTGCTARCACT 
B-globin A 

+1 


Transcription 
start site 


FIGURE 29.25 The Inr and TATA box in selected eukary- 
otic genes. The consensus sequence of a number of 
such promoters is presented in the lower part of the fig- 
ure, the numbers giving the percent occurrence of vari- 
ous bases at the positions indicated. 
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FIGURE 29.26 Promoter regions of several representa- 
tive eukaryotic genes. (a) The SV40 early genes, the his- 
tone H2B gene, and the thymidine kinase gene. Note 
that these promoters contain different combinations of 
the various modules. In (b), the function of the modules 
within the thymidine kinase gene is shown. 
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be several thousand nucleotides away from the promoter, and they act to enhance tran- 
scription initiation even if positioned downstream from the gene. Second, enhancer se- 
quences are bidirectionalin that they function in either orientation. That is, enhancers 
can be removed and then reinserted in the reverse sequence orientation without im- 
pairing their function. Like promoters, enhancers represent modules of consensus se- 
quence. Enhancers are “promiscuous,” because they stimulate transcription from any 
promoter that happens to be in their vicinity. Nevertheless, enhancer function is depen- 
dent on recognition by a specific transcription factor. A specific transcription factor bound at 
an enhancer element stimulates transcription by interacting with RNA polymerase II 
at a nearby promoter. 























Response Elements Promoter modules in genes responsive to common regulation 
are termed response elements. Examples include the heat shock element (HSE), the 
glucocorticoid response element (GRE), and the metal response element (MRE). 
These various elements are found in the promoter regions of genes whose transcrip- 








TABLE 29.3 | 

Sequence Consensus 

Module Sequence DNA Bound Factor 

TATA box TATAAAA ~10 bp TBP 

CAAT box GGCCAATCT ~22 bp CTF/NFI1 

GC box GGGCGG ~20 bp SP1 

Octamer ATTTGCAT ~20 bp Oct-1 or Oct-2 
KB GGGACTTTCC ~10 bp NF«B or H2 TF1 
ATF GTGACGT ~20 bp ATF 











Adapted from Lewin, B., 1994. Genes V. Cambridge, MA: Cell Press. 
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Response Elements That Identify Genes Coordinately Regulated in Response to Particular Physiological Challenges 
Physiological Response 
Challenge Element Consensus Sequence DNA Bound Factor Size (kD) 
Heat shock HSE CNNGAANNTCCNNG 27 bp HSTF 93 
Glucocorticoid GRE TGGTACAAATGTTCT 20 bp Receptor 94 
Cadmium MRE CGNCCCGGNCNC 
Phorbol ester TRE TGACTCA 22 bp AP1 39 
Serum SRE CCATATTAGG 20 bp SRF 52 








Adapted from Lewin, B., 1994. Genes V. Cambridge, MA: Cell Press. 


tion is activated in response to a sudden increase in temperature (heat shock), glu- 
cocorticoid hormones, or toxic heavy metals, respectively (Table 29.4). HSE se- 
quences are recognized by a specific transcription factor, HSTF (for heat shock 
transcription factor). HSEs are located about 15 bp upstream from the transcription 
start site of a variety of genes whose expression is dramatically enhanced in response 
to elevated temperature. Similarly, the response to steroid hormones depends on the 
presence of a GRE positioned 250 bp upstream of the transcription start point. Acti- 
vation of the steroid receptor (a specific transcription factor) at a GRE occurs when 
certain steroids bind to the steroid receptor. 

Many genes are subject to multiple regulatory influences. Regulation of such 
genes is achieved through the presence of an array of different regulatory ele- 
ments. The metallothionein gene is a good example (Figure 29.27). Metallo- 
thionein is a metal-binding protein that protects cells against metal toxicity by 
binding excess amounts of heavy metals and removing them from the cell. This 
protein is always present at low levels, but its concentration increases in response 
to heavy metal ions such as cadmium or in response to glucocorticoid hormones. 
The metallothionein gene promoter consists of two general promoter elements, 
namely, a TATA box and a GC box, two basal-level enhancers, four MREs, and one 
GRE. These elements function independently of one another; any one is able to 
activate transcription of the gene. 


Transcription Initiation by RNA Polymerase II Requires TBP 
and the GTFs 


A eukaryotic transcription initiation complex consists of RNA polymerase II, five 
general transcription factors (GTFs), and a 20-subunit complex called Mediator (or 
Srb/Med). The CTD of RNA polymerase II anchors Mediator to the polymerase. 
Mediator allows RNA polymerase II to communicate with transcriptional activators 
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CENGAGENOW” ANIMATED FIGURE 29.27 The metallothionein gene possesses several constitutive elements in 
its promoter (the TATA and GC boxes) as well as specific response elements such as MREs and a GRE. The BLEs are 
elements involved in basal level expression (constitutive expression). TRE is a tumor response element activated 
in the presence of tumor-promoting phorbol esters such as TPA (tetradecanoyl phorbol acetate). See this figure 
animated at www.cengage.com/login. 
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Number of 
Factor Subunits Function 
TFIID 
TBP 1 Core promoter recognition (TATA); TFIIB recruitment 
TAFs 14 Core promoter recognition (non-TATA elements); positive and negative regulatory functions; HAT 
(histone acetyltransferase) activity 
TFIIA 3 Stabilization of TBP binding; stabilization of TAF-DNA interactions 
TFIIB 1 RNA polymerase U-TFIUF recruitment; start-site selection by RNA polymerase II 
TFUF 2 Promoter targeting of polymerase I; destabilization of nonspecific RNA polymerase II-DNA interactions 
RNA pol II 12 Enzymatic synthesis of RNA; TFIE recruitment 
TFUE 2 TFIIH recruitment; modulation of TFIIH helicase, ATPase, and kinase activities; promoter melting 
TFA 9 Promoter melting using helicase activity; promoter clearance via CTD phosphorylation (2 subunits of 
TFHII are a cyclin: CDK pair) 








Adapted from Table 1 in Roeder, R. G., 1996. The role of general initiation factors in transcription by RNA polymerase II. Trends in Biochemical Sciences 21:327-335; and Reese, J. C., 2003. Basal 
transcription factors. Current Opinion in Genetics and Development 13:114-118. 


bound at sites distal from the promoter. There are six GTFs (Table 29.5), five of which 
are required for transcription: TFIIB, TFIID, TFITE, TFIIF, and TFIIH. The sixth, 
TFIIA, stimulates transcription by stabilizing the interaction of TFIID with the TATA 
box. TFIID consists of TBP (TATA-binding protein), which directly recognizes the 
TATA box within the core promoter, and a set of TBP-associated factors (TAFs or 
TAFys) .* Some TAFs resemble histones and may form a histone octamerlike structure 
that facilitates TFIID:DNA interactions. The TBP-TAFy complexes serve as a bridge 
between the promoter and RNA polymerase II. Some are capable of recognizing core 
promoters lacking a TATA box. TBP binds to the core promoter through contacts 
made with the minor groove of the DNA, distorting and bending the DNA so that DNA 
sequences upstream and downstream of the TATA box come into closer proximity 
(Figure 29.28a). Once TBP/TFIID is bound at the core promoter, a complex contain- 
ing RNA polymerase II and the remaining GTFs convenes at this site, establishing a 
competent transcription preinitiation complex (Figure 29.28b). An open complex then 
forms, and transcription begins. 


The Role of Mediator in Transcription Activation and Repression 


Transcription activation requires Mediator. Mediator serves as a bridge between gene- 
specific transcription co-activators bound to enhancers and the RNA polymerase 
II/GTF transcription machinery bound at the promoter. Once DNA becomes accessi- 
ble through the action of chromatin remodeling complexes (discussed in the following 
sections), a transcription co-activator binds to an enhancer and recruits Mediator to the 
gene. Mediator then establishes the bridge by promoting the binding of GIFs and RNA 
polymerase II at the promoter. Mediator has been described as the ultimate regulator 
of transcription since it integrates and communicates information from enhancers and 
transcription co-activators to RNA polymerase at the promoter. 

The Mediator complex is about 1 megadalton in mass, with a core structure com- 
posed of about 20 distinct subunits (in yeast; about 30 in humans), the MED proteins. 
Mediator is a somewhat crescent-shaped structure, with a head, middle, and tail. Eight 
MED proteins each are found in the head and middle regions, and four make up the 
tail. The tail section recognizes and binds the transcription co-activator. Both the 
head region and the middle region of Mediator bind RNA polymerase II and interact 
with its CTD, with the middle region also associating with general transcription factor 
TFITE (Figure 29.29a). 

Two of the Mediator subunits are a cyclin-CDK pair (CycC/CDK8), and they act 
to phosphorylate S5 in the CTD YSPTSPS heptapeptide repeat. Mediator is a global 


°For many genes, another transcription factor called SAGA (which also contains TAFys) can serve in- 
stead of TFIID to initiate transcription. 
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FIGURE 29.28 Transcription initiation. (a) Model of the TATA-binding 
protein (TBP) in complex with a DNA TATA sequence (pdb id = 1YTB). 
The saddle-shaped TBP (gold) is unusual in that it binds in the minor 
groove of DNA, sitting on the DNA like a saddle on a horse. TBP 
binding pries open the minor groove, creating a 100° bend in the 
DNA axis and unwinding the DNA within the TATA sequence. The 
other subunits of the TFIID complex (see Table 29.5) sit on TBP like a 
“cowboy on a saddle.” All known eukaryotic genes (those lacking a 
TATA box as well as those transcribed by RNA polymerase | or Ill) rely 
on TBP. (b) Formation of a preinitiation complex at a TATA-containing Preinitiation 
promoter. TFIID bound to the TATA motif recruits RNA polymerase II complex 
and the other GIFs to form the preinitiation complex. Melting of the 

DNA duplex around /nr generates the open complex and transcrip- 

tion ensues. 





regulator essential to transcription of virtually every RNA polymerase II-dependent 
gene. Mediator is even required for basal transcription of these genes. Mediator 
also displays HAT activity, which may aid in exposing promoters and subsequent 
binding of GTFs and RNA polymerase II. Taken together, the results indicate that 
Mediator provides a scaffold for assembly of the pre-initiation complex (PIC), the 


(a) enhancer DNA (b) silencer 






Mediator 





promoter 
promoter 


FIGURE 29.29 Simple models of Mediator in the regulation of eukaryotic gene transcription. (a) Mediator as a 
transcription activator. Mediator regions are highlighted in color: green for the tail, yellow for the middle, and 

red for the head. RNA polymerase II and the GTFs are blue. The transcription co-activator is orange. DNA is 
shown as a black line. (b) Mediator as a repressor. If Mediator interacts with a co-repressor (colored aqua) bound 
to a silencer and then binds the repressive subcomplex (MED12 : MED13: CycC/CDk8), shown in pink here, it fails 
to recruit RNA polymerase I| and the GTFs to the promoter, and expression from the promoter is repressed. 
(Adapted from Figures 1 and 2 in Björklund, S. and Gustafson, C. M., 2005.The yeast Mediator complex and its regulation. Trends 
in Biochemical Sciences 30:240-244.) 
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complex of RNA polymerase II, GTFs, and associated proteins that assemble at the 
promoter just prior to transcription initiation. Mediator apparently acts as the gate- 
way by which RNA polymerase II gains access to the promoter. As a protein kinase, 
Mediator also acts in phosphorylation of the unphosphorylated RNA polymerase 
CTD, thereby prompting its transition into the elongation phase of transcription. 


Mediator as a Repressor of Transcription 


Mediator apparently acts in repression of transcription as well as activation. How can 
Mediator serve two opposing regulatory functions? Perhaps the explanation lies 
in the ability of a Mediator subcomplex (the MED12 and MED13 proteins, plus 
CycC/CDK8) to interact with co-repressor bound to a silencer (see Figure 29.29). 
This Mediator subcomplex is a repressive module that stabilizes Mediator in a con- 
formation that cannot recruit RNA polymerase II to the promoter (Figure 29.29b). 


Chromatin-Remodeling Complexes and Histone-Modifying 
Enzymes Alleviate the Repression Due to Nucleosomes 


The central structural unit of nucleosomes, the histone “core octamer” (see Figure 
11.26), is constructed from the eight histone-fold protein domains of the eight various 
histone monomers comprising the octamer. Successive histone octamers are linked 
via histone H1, which is not part of the octamer. Each histone monomer in the 
core octamer has an unstructured N-terminal tail that extends outside the core oc- 
tamer. Interactions between histone tails contributed by core histones in adjacent 
nucleosomes are an important influence in establishing the higher orders of 
chromatin organization. Activation of eukaryotic transcription is dependent on two 
sets of circumstances: (1) relief from the repression imposed by chromatin struc- 
ture and (2) interaction of RNA polymerase II with the promoter and transcription 
regulatory proteins. Relief from repression requires factors that can reorganize the 
chromatin and then alter the nucleosomes so that promoters become accessible 
to the transcriptional machinery. Two sets of factors are important: chromatin- 
remodeling complexes that mediate ATP-dependent conformational (noncova- 
lent) changes in nucleosome structure and histone-modifying enzymes that intro- 
duce covalent modifications into the N-terminal tails of the histone core octamer. 
Chromatin remodeling and histone modification are closely linked processes. 


Chromatin-Remodeling Complexes Are Nucleic Acid—Stimulated 
Multisubunit ATPases 


Chromatin-remodeling complexes are huge (1 megadalton) assemblies containing 
ATP-dependent enzymes that loosen the DNA: protein interactions in nucleosomes 
by sliding, ejecting, inserting, or otherwise restructuring core octamers. In the 
process, about 50 bp of DNA are “peeled” away from the edge of the nucleosome, 
creating a “bulge” that allows RNA polymerase I, GTFs, and other transcription fac- 
tors to access the DNA. Chromatin-remodeling complexes contain proteins of the 
SNF2 family of DEAD/H-box-containing, nucleic acid—stimulated ATPases. (DEAD 
signifies the Asp-Glu-Ala-Asp tetrapeptide signature sequence of this protein family; 
in some of these proteins, a histidine residue (H) replaces the second D in the box.) 
SNF2 family members include SWI2 types with bromodomains that bind acetylated 
lysines, ISWI types that have separate domains for histone tail and linker DNA bind- 
ing, CHD types with chromodomains that bind methylated lysines, and INO80 types 
with DNA-binding domains. 

During the elongation phase of transcription, RNA polymerase II must pass by 
each nucleosome, and such passage is believed to result in the loss and regain of an 
H2A/H2B dimer from the core octamer. FACT (for facilitates chromatin transcrip- 
tion) is a heterodimer that acts on some nucleosomes to catalyze H2A/H2B removal 
and readdition; its activity is markedly increased by ubiquitination (see Chapter 31) 
of H2B Lys!”°, 
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Covalent Modification of Histones 


Chromatin is also remodeled through the action of enzymes that covalently modify 
side chains on histones within the core octamer. These modifications either dimin- 
ish DNA: histone associations through disruption of electrostatic interactions or in- 
troduce substitutions that can recruit binding of new protein participants through 
protein-protein interactions. 

Initial events in transcriptional activation include acetyl-CoA-dependent acetylation 
of €-amino groups on lysine residues in histone tails by histone acetyltransferases 
(HATs) (Figure 29.30). The histone transacetylases responsible are essential compo- 
nents of several megadalton-size complexes known to be required for transcription co- 
activation (co-activation in the sense that they are required along with RNA polymerase 
II and other components of the transcriptional apparatus). Examples of such com- 
plexes include the TFIID (some of whose TAFys have HAT activity), the SAGA com- 
plex (which also contains TAFys), and the ADA complex. N-Acetylation suppresses the 
positive charge in histone tails, diminishing their interaction with the negatively 
charged DNA. 

Phosphorylation of Ser residues and methylation of Lys residues in histone tails also 
contribute to transcription regulation (Figure 29.30). Attachment of small proteins to 
histone C-terminal lysine residues through ubiquitination and sumoylation (see Chap- 
ter 31) are two additional forms of covalent modification found in nucleosomes. Col- 
lectively, these modifications create binding sites for proteins that modulate chromatin 
structure, such as the chromatin-remodeling complexes with bromodomains that in- 
teract specifically with acetylated lysine residues and chromodomains that bind to 
methylated lysine residues. A “histone code” has emerged. 


Covalent Modification of Histones Forms the Basis of the Histone Code 


A code based on histone-tail covalent modifications determines gene expression 
through selective recruitment of proteins. Proteins that cause chromatin com- 
paction (heterochromatin formation) lead to repression; proteins giving easier ac- 
cess to DNA through relaxation of histone: DNA interactions favor the possibility of 
gene expression. 





E JOR) 


FIGURE 29.30 A schematic diagram of the nucleosome 
illustrating the various covalent modifications on the 
N-terminal tails of histones. acK = acetylated lysine 
residue; meK = methylated lysine residue; mek = methy- 
lated arginine residue; PS = phosphorylated serine 
residue. The numbers indicate the positions of the amino 
acids in the amino acid sequences. Note the prevalence 
of modifiable sites, particularly acetylatable lysines, on the 
N-terminal tails of histones H2B, H3, and H4. 
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The prominent forms of histone covalent modification are lysine acetylation, ly- 
sine methylation, serine phosphorylation, lysine ubiquitination, and lysine sumoy- 
lation. The lysine residue at position 9 (K9) in the histone H3 amino acid sequence 
is methylated in heterochromatin, the compacted, repressed state of chromatin. In 
contrast, lysine 4 (K4) of histone H3 typically is methylated in chromatin where 
gene expression is active. Different proteins are recruited to these two methylated 
forms of histone H3. Methylated K9 recruits heterochromatin protein 1 (HP1), 
which binds via its chromodomain. On the other hand, methylated K4 binds CHD1, 
a chromatin remodeling protein with two chromodomains. (CHD1 is an acronym 
for chromodomain, helicase, DNA-binding.) Ubiquitination of Lys!?? in the 
C-terminal tail of H2B favors methylation (and thus transcription activation), 
whereas ubiquitination of Lys!’ in the C-terminal tail of H2A favors repression. 
Sumoylation of Lys residues tends to repress transcription; apparently, sumolyation 
antagonizes acetylation. 


Methylation and Phosphorylation Act as a Binary Switch 
in the Histone Code 


As cells enter mitosis, the chromatin becomes condensed and histone H3 is not only 
methylated at K9 but also phosphorylated at the adjacent serine residue, S10. 
S10 phosphorylation triggers the dissociation of HP1 from the heterochromatin. 
Thus, phosphorylation of the residue neighboring K9 trumps HP1 binding. Similarly, 
phosphorylation of the threonine residue (T3) neighboring K4 in the histone H3 tail 
evicts CHD1 from its site on the methylated K4. Apparently, lysine methylation is the 
“on” position for the binary switch that recruits specific proteins to histone tails, and 
phosphorylation at a neighboring residue turns the switch to the “off” position by 
ejecting the bound proteins. There are at least 16 instances of serines or threonines 
immediately flanking lysine residues in the four histones that constitute the histone 
core octamer of the nucleosome. The methylation-phosphorylation binary switch may 
be a general phenomenon in the regulation of chromatin dynamics. 


Chromatin Deacetylation Leads to Transcription Repression 


Deacetylation of histones is a biologically relevant matter, and enzyme complexes that 
carry out such reactions have been characterized. Known as histone deacetylase com- 
plexes, or HDACs, they catalyze the removal of acetyl groups from lysine residues 
along the histone tails, restoring the chromatin to a repressed state. Beyond these ef- 
fects on transcription, histone modifications determine whether significant cellular 
events involving DNA allocation through mitosis and meiosis may occur. 


Nucleosome Alteration and Interaction of RNA Polymerase II 
with the Promoter Are Essential Features in Eukaryotic Gene Activation 


Gene activation (the initiation of transcription) can thus be viewed as a process 
requiring two principal steps: (1) alterations in nucleosomes (and thus, chromatin) 
that relieve the general repressed state imposed by chromatin structure, followed by 
(2) the interaction of RNA polymerase II and the GTFs with the promoter. 
Transcription activators (proteins that bind to enhancers and response elements) ini- 
tiate the process by recruiting chromatin-altering proteins (the chromatin-remodeling 
complexes and histone-modifying enzymes described previously). Once these alterations 
have occurred, promoter DNA is accessible to TBP: TFIID, the other GTFs, and RNA 
polymerase II. Transcription activation, however, requires communication between 
RNA polymerase II and the transcription activator for transcription to take place. Me- 
diator (or Srb/Med) fulfills this function. Mediator interacts with both the transcrip- 
tion activator and the CTD of RNA polymerase II. This Mediator bridge provides an 
essential interface for communication between enhancers and promoters, triggering 
RNA polymerase II to begin transcription. A general model for transcription initia- 
tion is shown in Figure 29.31. Once transcription begins, Mediator is replaced by 
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Deacetylase 


another complex called Elongator. Elongator has HAT subunits whose activity remod- 
els downstream nucleosomes as RNA polymerase II progresses along the chromatin- 
associated DNA. 


The interactions described thus far emphasize regulation of gene expression at 
the level of RNA polymerase II recruitment to promoters. However, whole-genome 
analyses show that, for many genes, RNA polymerase II is already situated at pro- 
moters and appears to be paused there, awaiting signals that will activate the elon- 
gation phase of transcription. Thus, the expression of many genes may be regulated 
at the level of transcription elongation. 

Beyond these considerations, various mechanisms regulate gene expression 
through events that take place subsequent to transcription. Post-transcriptional 
gene regulation mediated by microRNAs, such as RNAi (see Chapter 12) and gene 
silencing (see Chapter 10), as well as alternative RNA splicing and nucleotide 
changes introduced through RNA editing (as described in this chapter), are mech- 
anisms targeting transcripts. Post-translational modifications of proteins also play a 
major role in the regulation of gene expression, as assessed at the level of biologi- 
cal activity (see Chapter 30). 


A SINE of the Times 


An interesting twist on transcription regulation comes from the discovery that cer- 
tain noncoding RNAs (ncRNAs) act as transcription factors through direct binding 
to RNA polymerase II. For example, ncRNA B2 in mouse and Alu RNA in humans 
are RNAs encoded within short interdispersed elements (SINEs). SINEs are abun- 
dant within animal DNA and were once considered “junk” DNA because they lack 
protein-coding properties. Ala RNA or ncRNA B2 blocks promoter-bound RNA 
polymerase II by interfering with transcription initiation. 


29:4: How Do Gene Regulatory Proteins Recognize 
Specific DNA Sequences? 


Proteins that recognize nucleic acids do so by the basic rule of macromolecular recog- 
nition. That is, such proteins present a three-dimensional shape or contour that is 
structurally and chemically complementary to the surface of a DNA sequence. When 
the two molecules come into contact, the numerous atomic interactions that underlie 
recognition and binding can take place. Nucleotide sequence-specific recognition 
by the protein involves a set of atomic contacts with the bases and the sugar- 
phosphate backbone. Hydrogen bonding is critical for recognition, with amino acid 
side chains providing most of the critical contacts with DNA. Protein contacts with the 
bases of DNA usually occur within the major groove (but not always). Protein contacts 
with the DNA backbone involve both H bonds and salt bridges with electronegative 
oxygen atoms of the phosphodiester linkages. Structural studies on regulatory proteins 
that bind to specific DNA sequences have revealed that roughly 80% of such proteins 
can be assigned to one of three principal classes based on their possession of one of 
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FIGURE 29.31 A model for the transcriptional regulation 
of eukaryotic genes. The DNA is a green ribbon 
wrapped around disclike nucleosomes. A specific tran- 
scription factor (TF, pink) is bound to a regulatory ele- 
ment (either an enhancer or silencer). RNA polymerase II 
and its associated GTFs (blue) are bound at the pro- 
moter. The N-terminal tails of histones are shown as 
wavy lines (blue) emanating from the nucleosome 
discs. A specific transcription factor that is a transcrip- 
tion activator stimulates transcription through interac- 
tion with a co-activator whose HAT activity renders the 
DNA more accessible and through interactions with the 
Mediator complex associated with RNA polymerase ll. 

A specific transcription factor that is a repressor inter- 
acts with a co-repressor that has HDAC activity that 
deacetylates histones, restructuring the nucleosomes 
into a repressed state. (From Figure 1 in Kornberg, R. D., 1999. 
Eukaryotic transcription control. Trends in Biochemical Sciences 
24:M46-M49.) 
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HUMAN BIOCHEMISTRY 


Storage of Long-Term Memory Depends on Gene Expression Activated 


by CREB-Type Transcription Factors 


Learning can be defined as the process whereby new information is 
acquired and memory as the process by which this information is re- 
tained. Short-term memory (which lasts minutes or hours) requires 
only the covalent modification of preexisting proteins, but long-term 
memory (which lasts days, weeks, or a lifetime) depends on gene ex- 
pression, protein synthesis, and the establishment of new neuronal 
connections. 

The macromolecular synthesis underlying long-term memory 
storage requires cAMP-response element-binding (CREB) protein- 
related transcription factors and the activation of cAMP-dependent 


phosphorylate CREB protein-related transcription factors that 
activate transcription of cAMP-inducible genes. These genes are 
characterized by the presence of CRE (cAMP response element) 
consensus sequences containing the 8-bp TGACGTCA palindrome. 
CREB transcription factors are bZ/P-type proteins (see later discus- 
sion). These exciting findings opened a new arena in molecular 
biology, the molecular biology of cognition. Eric Kandel was awarded 
the 2000 Nobel Prize in Physiology or Medicine for, among other 
things, his discovery of the role of CREB-type transcription factors in 
long-term memory storage. 


gene expression. Serotonin (5-hydroxytryptamine, or 5-HT, a hor- 
mone implicated in learning and memory) acting on neurons pro- 


motes cAMP synthesis, which in turn stimulates protein kinase Ato Cognition is the act or process of knowing; the acquisition of knowledge. 











three kinds of small, distinctive structural motifs: the helix-turn-helix (or HTH), the 
zinc finger (or Zn-finger), and the leucine zipper-basic region (or bZIP). The latter two 
motifs are found only in DNA-binding proteins from eukaryotic organisms. 

In addition to their DNA-binding domains, these proteins commonly possess 
other structural domains that function in protein: protein recognitions essential to 
oligomerization (for example, dimer formation), DNA looping, transcriptional ac- 
tivation, and signal reception (for example, effector binding). 


a-Helices Fit Snugly into the Major Groove of B-DNA 


A recurring structural feature in DNA-binding proteins is the presence of 
a-helical segments that fit directly into the major groove of B-form DNA. The diam- 
eter of an a-helix (including its side chains) is about 1.2 nm. The dimensions of the 
major groove in B-DNA are 1.2 nm wide by 0.6 to 0.8 nm deep. Thus, one side of an 
a-helix can fit snugly into the major groove. Although examples of 6-sheet DNA 
recognition elements in proteins are known, the a-helix and B-form DNA are the 
predominant structures involved in protein:DNA interactions. Significantly, pro- 
teins can recognize specific sites in “normal” B-DNA; the DNA need not assume any 
unusual, alternative conformation (such as Z-DNA). 


Proteins with the Helix-Turn-Helix Motif Use One Helix 
to Recognize DNA 


The HTH motif is a protein structural domain consisting of two successive a-helices 
separated by a sharp 6-turn (Figure 29.32). Within this domain, the a-helix situated 
more toward the C-terminal end of the protein, the so-called helix 3, is the DNA 
recognition helix; it fits nicely into the major groove, with several of its side chains 
touching DNA base pairs. Helix 2, the helix at the beginning of the HTH motif, cre- 
ates a stable structural domain through hydrophobic interactions with helix 3 that 
locks helix 3 into its DNA interface. Proteins with HTH motifs bind to DNA as 
dimers. In the dimer, the two helix 3 cylinders are antiparallel to each other, such 
that their N—>C orientations match the inverted relationship of nucleotide sequence 
in the dyad-symmetric DNA-binding site. An example is Antp. Anip is a member of 
a family of eukaryotic proteins involved in the regulation of early embryonic devel- 
opment that have in common an amino acid sequence element known as the 
homeobox’ domain. The homeobox is a DNA motif that encodes a related 60—amino 
acid sequence (the homeobox domain) found among proteins of virtually every 





FIGURE 29.32 An HTH motif protein: Antp monomer 
bound to DNA. Helix 3 (yellow) is locked into the major 
groove of the DNA by helix 2 (magenta) (pdb id = 9ANT). 


°Homeo derives from homeotic genes, a set of genes originally discovered in the fruit fly Drosophila 
melanogaster through their involvement in the specification of body parts during development. 
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eukaryote, from yeast to man. Embedded within the homeobox domain is an HTH 
motif. Homeobox domain proteins act as sequence-specific transcription factors. 
Typically, the homeobox portion comprises only 10% or so of the protein’s mass, 
with the remainder of the protein serving in protein: protein interactions essential 
to transcription regulation. Other DNA-binding proteins with HTH motifs are lacre- 
pressor, trp repressor, and the C-terminal domain of CAP. 


How Does the Recognition Helix Recognize Its Specific DNA-Binding Site? The 
edges of base pairs in dsDNA present a pattern of hydrogen-bond donor and ac- 
ceptor groups within the major and minor grooves, but only the pattern displayed 
on the major-groove side is distinctive for each of the four base pairs A:T, T:A, C:G, 
and G:C. (You can get an idea of this by inspecting the structures of the base pairs 
in Figure 11.6.) Thus, the base-pair edges in the major groove act as a recognition 
matrix identifiable through H bonding with a specific protein, so it is not necessary 
to melt the base pairs to read the base sequence. Although formation of such 
H bonds is very important in DNA: protein recognition, other interactions also play 
a significant role. For example, the C-5-methyl groups unique to thymine residues 
are nonpolar “knobs” projecting into the major groove. 


Proteins Also Recognize DNA via “Indirect Readout” Indirect readout is the term 
for the ability of a protein to indirectly recognize a particular nucleotide sequence by 
recognizing local conformational variations resulting from the effects that base se- 
quence has on DNA structure. Superficially, the B-form structure of DNA appears to 
be a uniform cylinder. Nevertheless, the conformation of DNA over a short distance 
along its circumference varies subtly according to local base sequence. That is, base 
sequences generate unique contours that proteins can recognize. Because these con- 
tours arise from the base sequence, the DNA-binding protein “indirectly reads out” 
the base sequence through interactions with the DNA backbone. In the E. coli Trp re- 
pressor: trp operator DNA complex, the Trp repressor engages in 30 specific hydro- 
gen bonds to the DNA: 28 involve phosphate groups in the backbone; only 2 are to 
bases. Thus, some sequence-specific DNA-binding proteins are able to recognize an 
overall DNA conformation caused by the specific DNA sequence. 


Some Proteins Bind to DNA via Zn-Finger Motifs 


There are many classes of Zn-finger motifs. The prototype Zn-finger is a structural fea- 
ture formed by a pair of Cys residues separated by 2 residues, then a run of 12 amino 
acids, and finally a pair of His residues separated by 3 residues (Cys-x9-Cys-x19-His-xs- 
His). This motif may be repeated as many as 13 times over the primary structure of a 
Zn-finger protein. Each repeat coordinates a zinc ion via its 2 Cys and 2 His residues 
(Figure 29.33). The 12 or so residues separating the Cys and His coordination sites 


(a) (b) 








FIGURE 29.33 The Zn-finger motif of the C,H, 
type showing (a) the coordination of Cys and His 
residues to Zn and (b) the secondary structure. 
(c) Structure of a classic C,H, zinc finger protein 
(zif268) with three zinc fingers bound to DNA 
(pdb id = 1ZAA). 
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Chelation is from the Greek word chele, meaning 
“claw”; it refers to the binding of a metal ion to 
two or more nonmetallic atoms in the same 
molecule. 






Basic region 
(DNA contact 
surface) 

Leucine zipper 
(dimerization motif) 


FIGURE 29.34 Model for a dimeric bZIP protein. Two 
bZIP polypeptides dimerize to form a Y-shaped mole- 
cule. The stem of the Y is the Leu zipper, and it holds 
the two polypeptides together. Each arm of the Y is the 
basic region from one polypeptide. Each arm is com- 
posed of two a-helical segments: BR-A and BR-B (basic 
regions A and B). 


are looped out and form a distinct DNA interaction module, the so-called Zn-finger. 
When Zn-finger proteins associate with DNA, each Zn-finger binds in the major 
groove and interacts with about five nucleotides, adjacent fingers interacting with 
contiguous stretches of DNA. Many DNA-binding proteins with this motif have been 
identified. In all cases, the finger motif is repeated at least two times, with at least a 
7- to 8-amino acid linker between Cys/Cys and His/His sites. Proteins with this gen- 
eral pattern are assigned to the CH; class of Zn-finger proteins to distinguish them 
from proteins bearing another kind of Zn-finger, the C, type, which includes the C4 
and C, Zn-finger proteins. The C, proteins have a variable number of Cys residues 
available for Zn chelation. For example, the vertebrate steroid receptors have two sets 
of Cys residues, one with four conserved cysteines (C4) and the other with five (C5). 


Some DNA-Binding Proteins Use a Basic Region-Leucine 
Zipper (bZIP) Motif 


bZIP is a structural motif characterizing the third major class of sequence-specific, 
DNA-binding proteins. This motif was first recognized by Steve McKnight in C/EBP, a 
heat-stable, DNA-binding protein isolated from rat liver nuclei that binds to both 
CCAAT promoter elements and certain enhancer core elements.’ The DNA-binding 
domain of C/EBP was localized to the C-terminal region of the protein. This region 
shows a notable absence of Pro residues, suggesting it might be arrayed in an a-helix. 
Within this region are two clusters of basic residues: A and B. Further along is a 
28-residue sequence. When this latter region is displayed end-to-end down the axis of 
a hypothetical a-helix, beginning at Leu*!®, an amphipathic cylinder is generated, sim- 
ilar to the one shown in Figure 6.22. One side of this amphipathic helix consists prin- 
cipally of hydrophobic residues (particularly leucines), whereas the other side has an 
array of negatively and positively charged side chains (Asp, Glu, Arg, and Lys), as well 
as many uncharged polar side chains (glutamines, threonines, and serines). 


The Zipper Motif of bZIP Proteins Operates Through Intersubunit 
Interaction of Leucine Side Chains 


The leucine zipper motif arises from the periodic repetition of leucine residues within 
this helical region. The periodicity causes the Leu side chains to protrude from the 
same side of the helical cylinder, where they can enter into hydrophobic interactions 
with a similar set of Leu side chains extending from a matching helix in a second 
polypeptide. These hydrophobic interactions establish a stable noncovalent linkage, 
fostering dimerization of the two polypeptides (as shown in Figure 29.34). The 
leucine zipper is not a DNA-binding domain. Instead, it functions in protein dimer- 
ization. Leucine zippers have been found in other mammalian transcriptional regu- 
latory proteins, including Myc, Fos, and Jun. 


The Basic Region of bZ/P Proteins Provides the DNA-Binding Motif 


The actual DNA contact surface of bZIP proteins is contributed by a 16-residue seg- 
ment that ends exactly 7 residues before the first Leu residue of the Leu zipper. 
This DNA contact region is rich in basic residues and hence is referred to as the 
basic region. Two bZIP polypeptides join via a Leu zipper to form a ¥shaped mol- 
ecule in which the stem of the Y corresponds to a coiled pair of a-helices held by 
the leucine zipper. The arms of the Y are the respective basic regions of each 
polypeptide; they act as a linked set of DNA contact surfaces (Figure 29.34). The 
dimer interacts with a DNA target site by situating the fork of the Y at the center 
of the dyad-symmetric DNA sequence. The two arms of the Y can then track along 
the major groove of the DNA in opposite directions, reading the specific recogni- 
tion sequence (Figure 29.35). An interesting aspect of bZIP proteins is that the two 
polypeptides need not be identical (Figure 29.35). Heterodimers can form, pro- 
vided both polypeptides possess a leucine zipper region. An important conse- 


"The acronym C/EBP designates this protein as a “CCAAT and enhancer-binding protein.” 
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quence of heterodimer formation is that the DNA target site need not be a palin- 
dromic sequence. The respective basic regions of the two different bZIP polypep- 
tides (for example, Fos and Jun) can track along the major groove reading two dif- 
ferent base sequences. Heterodimer formation expands enormously the DNA 
recognition and regulatory possibilities of this set of proteins. 


29:55 How Are Eukaryotic Transcripts Processed 
and Delivered to the Ribosomes for Translation? 


Transcription and translation are concomitant processes in prokaryotes, but in 
eukaryotes, the two processes are spatially separated (see Chapter 10). Transcrip- 
tion occurs on DNA in the nucleus, and translation occurs on ribosomes in the cytoplasm. 
Consequently, transcripts must be transported from the nucleus to the cytosol to 
be translated. On the way, these transcripts undergo processing: alterations that 
convert the newly synthesized RNAs, or primary transcripts, into mature messen- 
ger RNAs. Also, unlike prokaryotes, in which many mRNAs encode more than 
one polypeptide (that is, they are polycistronic), eukaryotic mRNAs encode only 
one polypeptide (that is, they are exclusively monocistronic). 


Eukaryotic Genes Are Split Genes 


Most genes in higher eukaryotes are split into coding regions, called exons,’ and 
noncoding regions, called introns (Figure 29.36; see also Figure 10.20). Introns are 
the intervening nucleotide sequences that are removed from the primary tran- 
script when it is processed into a mature RNA. Gene expression in eukaryotes en- 
tails not only transcription but also the processing of primary transcripts to yield the 
mature RNA molecules we classify as mRNAs, tRNAs, rRNAs, and so forth. 


The Organization of Exons and Introns in Split Genes Is Both 
Diverse and Conserved 


Split genes occur in an incredible variety of interruptions and sizes. The yeast actin 
gene is a simple example, having only a single 309-bp intron that separates the nu- 
cleotides encoding the first 3 amino acids from those encoding the remaining 350 or 
so amino acids in the protein. The chicken ovalbumin gene is composed of 8 exons 


SAlthough the term exon is commonly used to refer to the protein-coding regions of an interrupted or 
split gene, a more precise definition would specify exons as sequences that are represented in mature 
RNA molecules. This definition encompasses not only protein-coding genes but also the genes for var- 
ious RNAs (such as tRNAs or rRNAs) from which intervening sequences must be excised in order to 
generate the mature gene product. 





FIGURE 29.35 Model for the heterodimeric bZIP tran- 
scription factor c-Fos: c-Jun bound to a DNA oligomer 
containing the AP-1 consensus target sequence 
TGACTCA (pdb id = 1FOS). 
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FIGURE 29.36 The organization of split eukaryotic genes. 
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FIGURE 29.37 The organization of the mammalian 
DHFR gene in three representative species. Note that 
the exons are much shorter than the introns. Note also 
that the exon pattern is more highly conserved than 
the intron pattern. 
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and 7 introns. The two vitellogenin genes of the African clawed toad Xenopus laevis are 
both spread over more than 21 kbp of DNA; their primary transcripts consist of just 
6 kb of message that is punctuated by 33 introns. The chicken pro a-2 collagen gene 
has a length of about 40 kbp; the coding regions constitute only 5 kb distributed over 
51 exons within the primary transcript. The exons are quite small, ranging from 45 to 
249 bases in size. 

Clearly, the mechanism by which introns are removed and multiple exons are 
spliced together to generate a continuous, translatable mRNA must be both precise 
and complex. If one base too many or too few is excised during splicing, the coding 
sequence in the mRNA will be disrupted. The mammalian DHFR (dihydrofolate 
reductase) gene is split into 6 exons spread over more than 31 kbp of DNA. The 
6 exons are spliced together to give a 6-kb mRNA (Figure 29.37). Note that, in three 
different mammalian species, the size and position of the exons are essentially the 
same but that the lengths of the corresponding introns vary considerably. Indeed, 
the lengths of introns in vertebrate genes range from a minimum of about 60 bases 
to more than 10,000 bp. Many introns have nonsense codons in all three reading 
frames and thus are untranslatable. Introns are found in the genes of mitochondria 
and chloroplasts as well as in nuclear genes. Although introns have been observed in 
archaea and even bacteriophage T4, none are known in the genomes of bacteria. 


Post-Transcriptional Processing of Messenger RNA Precursors Involves 
Capping, Methylation, Polyadenylylation, and Splicing 


Capping and Methylation of Eukaryotic mRNAs The protein-coding genes of 
eukaryotes are transcribed by RNA polymerase II to form primary transcripts or 
pre-mRNAs that serve as precursors to mRNA. As a population, these RNA 
molecules are very large and their nucleotide sequences are very heterogeneous 
because they represent the transcripts of many different genes, hence the des- 
ignation heterogeneous nuclear RNA, or hnRNA. Shortly after transcription of 
hnRNA is initiated, the 5'-end of the growing transcript is capped by addition of 
a guanylyl residue. This reaction is catalyzed by the nuclear enzyme guanylyl 
transferase using GTP as substrate (Figure 29.38). The cap structure is methyl- 
ated at the 7-position of the G residue. Additional methylations may occur at the 
2'-O positions of the two nucleosides following the 7-methyl-G cap and at the 
6-amino group of a first base adenine (Figure 29.39). 


3’-Polyadenylylation of Eukaryotic mRNAs Transcription by RNA polymerase II 
typically continues past the 3’-end of the mature messenger RNA. Primary tran- 
scripts show heterogeneity in sequence at their 3’-ends, indicating that the precise 
point where termination occurs is nonspecific. However, termination does not nor- 
mally occur until RNA polymerase II has transcribed past a consensus AAUAAA se- 
quence known as the polyadenylylation signal. 

Most eukaryotic mRNAs have 100 to 200 adenine residues attached at their 
3’-end, the poly(A) tail. [Histone mRNAs are the only common mRNAs that lack 
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FIGURE 29.38 The capping of eukaryotic pre-mRNAs. Guanylyl transferase catalyzes the addition of a guanylyl 
residue (G,) derived from GTP to the 5’-end of the growing transcript, which has a 5’-triphosphate group already 
there. In the process, pyrophosphate (pp) is liberated from GTP and the terminal phosphate (p) is removed from 
the transcript.Gppp + pppApNpNpNp... > GpppApNpNpNp... + pp + p (Ais often the initial nucleotide in the 
primary transcript). 





FIGURE 29.39 Methylation of several specific sites located at the 5’-end of eukaryotic pre-mRNASs is 
an essential step in mRNA maturation. A cap bearing only a single —CH; on the guanyl is termed 
cap 0. This methylation occurs in all eukaryotic mRNAs. If a methyl is also added to the 2’-O position 
of the first nucleotide after the cap,a cap 1 structure is generated. This is the predominant cap form 
in all multicellular eukaryotes. Some species add a third —CH; to the 2’-O position of the second 
nucleotide after the cap, giving a cap 2 structure. Also, if the first base after the cap is an adenine, it 
may be methylated on its 6-NH). In addition, approximately 0.1% of the adenine bases throughout : 
the mRNA of higher eukaryotes carry methylation on their 6-NH; groups. etc. 





poly(A) tails.] These A residues are not encoded in the DNA but are added post- 
transcriptionally by the enzyme poly(A) polymerase, using ATP as a substrate. The 
consensus AAUAAA is not itself the poly(A) addition site; instead it defines the po- 
sition where poly(A) addition occurs (Figure 29.40). The consensus AAUAAA is 
found 10 to 35 nucleotides upstream from where the nascent primary transcript is 
cleaved by an endonuclease to generate a new 3’-OH end. This end is where the 
poly(A) tail is added. The processing events of mRNA capping, poly(A) addition, 
and splicing of the primary transcript create the mature mRNA. Interestingly, both 
the guanylyl transferase that adds the 5'-cap structure and the enzymes that process 
the 3’-end of the transcript and add the poly (A) tail are anchored to RNA poly- 
merase II via interactions with its RPB1 CTD. 


Nuclear Pre-mRNA Splicing 


Within the nucleus, hnRNA forms ribonucleoprotein particles (RNPs) through as- 
sociation with a characteristic set of nuclear proteins. These proteins interact with 
the nascent RNA chain as it is synthesized, maintaining the hnRNA in an untangled, 
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FIGURE 29.41 Consensus sequences at the splice sites 
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accessible conformation. The substrate for splicing, that is, intron excision and 
exon ligation, is the capped primary transcript emerging from the RNA polymerase 
II transcriptional apparatus, in the form of an RNP complex. Splicing occurs ex- 
clusively in the nucleus. The mature mRNA that results is then exported to the 
cytoplasm to be translated. Splicing requires precise cleavage at the 5'- and 3’-ends 
of introns and the accurate joining of the two ends. Consensus sequences define the 
exon/intron junctions in eukaryotic mRNA precursors, as indicated from an analy- 
sis of the splice sites in vertebrate genes (Figure 29.41). Note that the sequences GU 
and AG are found at the 5’- and 3’-ends, respectively, of introns in pre-mRNAs from 
higher eukaryotes. In addition to the splice junctions, a conserved sequence within 
the intron, the branch site, is also essential to pre-mRNA splicing. The site lies 18 to 
40 nucleotides upstream from the 3’-splice site and is represented in higher eu- 
karyotes by the consensus sequence YNYRAY, where Y is any pyrimidine, R is any 
purine, and N is any nucleotide. 


The Splicing Reaction Proceeds via Formation of a Lariat Intermediate 


The mechanism for splicing nuclear mRNA precursors is shown in Figure 29.42. 
A covalently closed loop of RNA, the lariat, is formed by attachment of the 
5'-phosphate group of the intron’s invariant 5'-G to the 2’-OH at the invariant 
branch site A to form a 2'-5' phosphodiester bond. Note that lariat formation creates 
an unusual branched nucleic acid. The lariat structure is excised when the 3’-OH of 
the consensus G at the 3’-end of the 5’ exon (Exon 1, Figure 29.42) covalently joins 
with the 5’-phosphate at the 5'-end of the 3’ exon (Exon 2). The reactions that 
occur are transesterification reactions where an OH group reacts with a phospho- 
ester bond, displacing an —OH to form a new phosphoester link. Because the reac- 
tions lead to no net change in the number of phosphodiester linkages, no energy in- 
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FIGURE 29.42 Splicing of mRNA precur- 
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put (for example, as ATP) is needed. The lariat product is unstable; the 2’-5' phos- 
phodiester branch is quickly cleaved to give a linear excised intron that is rapidly de- 
graded in the nucleus. 


Splicing Depends on snRNPs 


The hnRNA (pre-mRNA) substrate is not the only RNP complex involved in the splic- 
ing process. Splicing also depends on a unique set of small nuclear ribonucleoprotein 
particles, so-called snRNPs (pronounced “snurps”). In higher eukaryotes, each 
snRNP consists of a small RNA molecule 100 to 200 nucleotides long and a set of 
about 10 different proteins. Some of the different proteins form a “core” set common 
to all snRNPs, whereas others are unique to a specific snRNP. The major snRNP 
species are very abundant, present at greater than 100,000 copies per nucleus. The 
RNAs of snRNPs are typically rich in uridine, hence the classification of particular 
snRNPs as U1, U2, and so on. The prominent snRNPs are given in Table 29.6. U1 
snRNA folds into a secondary structure that leaves the 11 nucleotides at its 5’-end 
single-stranded. The 5'-end of U1 snRNA is complementary to the consensus sequence 
at the 5’-splice junction of the pre-mRNA (Figure 29.43), as is a region at the 5’-end of 
U6 snRNA. U2 snRNA is complementary to the consensus branch site sequence. 


snRNPs Form the Spliceosome 


Splicing occurs when the various snRNPs come together with the pre-mRNA to 
form a multicomponent complex called the spliceosome. The spliceosome is a 
large complex, roughly equivalent to a ribosome in size, and its assembly requires 
ATP. Assembly of the spliceosome begins with the binding of Ul snRNP at the 
5'-splice site of the pre-mRNA (Figure 29.44). Each subsequent step in spliceosome 
assembly requires ATP-dependent RNA rearrangements catalyzed by spliceosomal 
DEAD-box ATPases/helicases. The branch-point sequence (UACUAAC in yeast) 
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FIGURE 29.43 Mammalian U1 snRNA can be arranged in 
a secondary structure where its 5’-end is single-stranded 
and can base-pair with the consensus 5’-splice site of 
the intron. (Adapted from Figure 1 in Rosbash, M., and Seraphin, 
B., 1991.Who's on first? The U1 snRNP-5’ splice site interaction 
and splicing. Trends in Biochemical Sciences 16:187.) 




















FIGURE 29.44 Events in spliceosome assembly. U1 
snRNP binds at the 5’-splice site, followed by the associ- 
ation of U2 snRNP with the UACUAA*C branch-point 
sequence. The triple U4/U6-U5 snRNP complex replaces 
U1 at the 5’-splice site and directs the juxtaposition of 
the branch-point sequence with the 5’-splice site, 
whereupon U4 snRNP is released. Lariat formation oc- 
curs, freeing the 3’-end of the 5’-exon to join with the 
5’-end of the 3’-exon, followed by exon ligation. U2, U5, 
and U6 snRNPs dissociate from the lariat following exon 
ligation. 
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Intron 3' exon 


binds U2 snRNP, and then the triple snRNP complex of U4/U6-U5 replaces U1 at 
the 5'-splice site. The substitution of base-pairing interactions between U1 and the 
pre-mRNA 5’-splice site by base-pairing between U6 and the 5’-splice site is just one 
of the many RNA rearrangements that accompany the splicing reaction. Base- 
pairings between U6 and U2 RNA bring the 5’-splice site and the branch point RNA 
sequences into proximity. Interactions between U2 and U6 lead to release of U4 
snRNP. The spliceosome is now activated for catalysis: A transesterification reaction 
involving the 2’-O of the invariant A residue in the branch-point sequence displaces 
the 5’-exon from the intron, creating the lariat intermediate. The free 3’-O of the 
5'-exon now triggers a second transesterification reaction through attack on the 
P atom at the 3’-exon splice site. This second reaction joins the two exons and re- 
leases the intron as a lariat structure. In addition to the snRNPs, a number of pro- 
teins with RNA-annealing functions as well as proteins with ATP-dependent RNA- 
unwinding activity participate in spliceosome function. The spliceosome is thus a 
dynamic structure that uses the pre-mRNA as a template for assembly, carries out 
its transesterification reactions, and then disassembles when the splicing reaction 
is over. 


Alternative RNA Splicing Creates Protein Isoforms 


In one mode of splicing, every intron is removed and every exon is incorporated 
into the mature RNA without exception. This type of splicing, termed constitutive 
splicing, results in a single form of mature mRNA from the primary transcript. 
However, many eukaryotic genes can give rise to multiple forms of mature RNA 
transcripts. The mechanisms for production of multiple transcripts from a single 
gene include use of different promoters, selection of different polyadenylation 
sites, alternative splicing of the primary transcript, or even a combination of the 
three. 

Different transcripts from a single gene make possible a set of related polypep- 
tides, or protein isoforms, each with slightly altered functional capability. Such vari- 
ation serves as a useful mechanism for increasing the apparent coding capacity of 
the genome. Furthermore, alternative splicing offers another level at which regula- 
tion of gene expression can operate. For example, mRNAs unique to particular 
cells, tissues, or developmental stages could be formed from a single gene by choos- 
ing different 5’- or 3’-splice sites or by omitting entire exons. Translation of these 
mature mRNAs produces cell-specific protein isoforms that display properties tai- 
lored to the needs of the particular cell. Such regulated expression of distinct pro- 
tein isoforms is a fundamental characteristic of eukaryotic cell differentiation and 
development. 
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Fast Skeletal Muscle Troponin T Isoforms Are an Example 
of Alternative Splicing 


In addition to many other instances, alternative splicing is a prevalent mechanism 
for generating protein isoforms from the genes encoding muscle proteins (see 
Chapter 16), allowing distinctive isoforms aptly suited to the function of each mus- 
cle. An impressive manifestation of alternative splicing is seen in the expression 
possibilities for the rat fast skeletal muscle troponin T gene (Figure 29.45). This 
gene consists of 18 exons, 11 of which are found in all mature mRNAs (exons 1 
through 3, 9 through 15, and 18) and thus are constitutive. Five exons, those num- 
bered 4 through 8, are combinatorial in that they may be individually included or 
excluded, in any combination, in the mature mRNA. Two exons, 16 and 17, are 
mutually exclusive: One or the other is always present, but never both. Sixty-four 
different mature mRNAs can be generated from the primary transcript of this gene 
by alternative splicing. Because each exon represents a cassette of genetic infor- 
mation encoding a segment of protein, alternative splicing is a versatile way to in- 
troduce functional variation within a common protein theme. 


RNA Editing: Another Mechanism That Increases the Diversity 
of Genomic Information 


RNA editing is a process that changes one or more nucleotides in an RNA transcript 
by deaminating a base, either AI (adenine to inosine, through deamination at the 
6-position in a purine ring) or C—U (cytosine to uracil, through deamination at 
the 4-position in a pyrimidine ring). These changes alter the coding possibilities 
in a transcript, because I will pair with G (not U as A does) and U will pair with A 
(not G as C does). RNA editing has the potential to increase protein diversity by 
(1) altering amino acid coding possibilities, (2) introducing premature stop codons, 
or (3) changing splice sites in a transcript. If RNA splicing is cutting-and-pasting, 
then these single-base changes are aptly termed RNA editing. 

A-to-I editing is carried out by adenosine deaminases that act on RNA (the ADAR 
family of RNA-editing enzymes). ADARs act only on double-stranded regions of 
RNA. Typically, such regions form when an exon region containing an A to be 
edited base-pairs with a complementary base sequence in an intron known as the 
editing site complementary sequence, or ECS. ADARs are abundant in the nervous system 
of animals. A prominent example of RNA editing occurs in transcripts encoding 
mammalian glutamate receptors (GluRs; see Chapter 32). Deamination of the 
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FIGURE 29.45 Organization of the fast skeletal muscle troponin T gene and the 64 possible mRNAs that can be 
generated from it. Exons are constitutive (yellow), combinatorial (green), or mutually exclusive (blue or orange). 
Exon 1 is composed of 5’-untranslated (UT) sequences, and Exon 18 includes the polyadenylation site (AATAAA) 
and 3’-UT sequences. The TATA box indicates the transcription start site. The amino acid residues encoded by 
each exon are indicated below. Many exon: intron junctions fall between codons. The “sawtooth” exon bound- 
aries indicate that the splice site falls between the first and second nucleotides of a codon, the “concave/convex" 
exon boundaries indicate that the splice site falls between the second and third nucleotides of a codon, and flush 
boundaries between codons signify that the splice site falls between intact codons. Each mRNA includes all con- 
stitutive exons, 1 of the 32 possible combinations of Exons 4 to 8, and either Exon 16 or 17. 
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GluR-B gene transcript changes a glutamine codon CAG to CIG, which is read by 
the translational machinery as an arginine codon (CGG), dramatically altering the 
conductance properties of the membrane receptor produced from the edited tran- 
script, as compared to the receptor produced from the unedited transcript. 

C-to-I editing is carried out on single-stranded regions of transcripts by an edito- 
some core structure consisting of a cytosine deaminase and an adapter protein that 
brings the deaminase and the transcript together. A prominent example of C-to-I 
editing targets a single C residue in a 14-kb transcript encoding the 4536-residue 
apolipoprotein B100 protein (see Chapter 24). ApoB RNA editing changes codon 
2153 (a CAA [glutamine] codon) to a UAA stop codon, which leads to a shortened 
protein product, ApoB48, consisting of the N-terminal 48% of apoB100. In humans, 
apoB100 is made in the liver and found in liver-derived VLDL serum lipoprotein 
complexes. In contrast, apoB48 is made in intestinal cells and found in intestinal-de- 
rived lipid complexes. 


29:6: Can We Propose a Unified Theory of Gene Expression? 


The stages of eukaryotic gene expression—from transcriptional activation, tran- 
scription, transcript processing, nuclear export of mRNA, to translation—have tra- 
ditionally been presented as a linear series of events, that is, as a pathway of discrete, 
independent steps. However, it now is clear that each stage is part of a continuous 
process, with physical and functional connections between the various transcrip- 
tional and processing machineries. This realization led George Orphanides and 
Danny Reinberg to propose a “unified theory of gene expression.” The principal 
tenet of this theory is that eukaryotic gene expression is a continuous process, from 
transcription through processing and protein synthesis: DNA->RNA-— protein (Fig- 
ure 29.46). Furthermore, regulation occurs at multiple levels in this continuous 
process, in a coordinated fashion. Tom Maniatis and Robin Reed provide additional 
support for this theory by pointing out that eukaryotic gene expression depends on 
an interacting network of multicomponent protein machines—nucleosomes, HATs, 
and the remodeling apparatus; RNA polymerase II and its associated factors, which 
include capping, splicing, and polyadenylylation enzymes; and the proteins involved 
in mRNA export to the cytoplasm for translation on ribosomes, the topic of the next 
chapter. Translation is not inevitable. Ifa noncoding RNA base-pairs with the mRNA, 
translation will be thwarted. For example, base pairing with a microRNA will result 
in gene silencing (see Chapter 10). Base pairing with a small interfering RNA 
(siRNA, Chapter 10) leads to gene knockdown by RNAi through mRNA destruction 
by the RISC protein complex (see Figure 12.20). Recall that gene silencing is a post- 
transcriptional regulatory mechanism that prevents translation of an mRNA, 
whereas RNAi carries out post-transcriptional destruction of mRNAs. 

It should be mentioned that eukaryotic cells have elaborate systems for mRNA 
surveillance; these systems destroy any messages containing errors, such as the 
nonsense mediated decay (NMD) system. NMD degrades any message that has 
premature nonsense, or “stop,” codons. Furthermore, the regulation of mRNA 
levels through selective destruction provides another mechanism for the post- 
transcriptional regulation of gene expression. 


RNA Degradation 


The amount of specific mRNAs or proteins present in a cell at any time represents a 
balance between the rates of macromolecular synthesis and degradation. Regulated 
degradation of mRNAs (discussed here) and proteins (discussed in Chapter 31) is a 
rapid and effective way to control the cellular levels of these macromolecules. Be- 
cause indiscriminate degradation of RNAs and proteins could have detrimental con- 
sequences within the cell, such degradation typically is compartmentalized. Targeted 
degradation of RNAs and proteins is enclosed within ringlike or cylindrical macro- 
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FIGURE 29.46 A unified theory of gene expression. Each step in gene expression, from transcription to transla- 
tion, is but a stage within a continuous process. Each stage is physically and functionally connected to the 
next, ensuring that all steps proceed in an appropriate fashion and overall regulation of gene expression is 
tightly integrated. (Adapted from Figure 2 in Orphanides, G., and Reinberg, D., 2002. A unified theory of gene expression. Cell 
108:439-451,) 


molecular complexes—the exosome for RNA and the proteasome for proteins (see 
Chapter 31). The catalytically active component of an exosome is an RNase PH 
family member that processively degrades RNA in the 3'-5’ direction. 

Exosomes have a fundamental structural pattern: a ring of six subunits sur- 
rounding a central cavity, with one or more of the subunits having RNase PH ac- 
tivity (Figure 29.47). RNAs to be degraded are threaded into the central cavity. The 
architecture of the exosome restricts substrate access and compartmentalizes the 
RNase activity so that indiscriminate degradation of cellular RNase is avoided. Only 
RNAs targeted to the exosome are destroyed. 
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FIGURE 29.47 Structure of the human exosome core 
(pdb id = 2NN6). The exosome core is composed of 
nine different polypeptide chains. A hexameric ring of 
polypeptides (each a different color in this image) sur- 
rounds a central cavity. This cavity is capped by a set of 
three other proteins (all colored hot pink here). The hu- 
man exosome core is catalytic inactive, but it serves as a 
platform for the association of additional subunits that 
have 3'-exonuclease activity. Evidence suggests that the 
human exosome core selects RNAs for degradation, and 
associated 3’-exonucleases then degrade them. 








SUMMARY 


29.1 How Are Genes Transcribed in Prokaryotes? In prokaryotes, vir- 
tually all RNA synthesis is carried out by a single species of DNA- 
dependent RNA polymerase. RNA polymerase links ribonucleoside 
5’-triphosphates in an order specified by base pairing with a DNA tem- 
plate. The enzyme reads along a DNA strand in the 3'-5’ direction, 
joining the 5’-phosphate of an incoming ribonucleotide to the 3’-OH of 
the previous residue, so the RNA chain grows 5'—>3' during transcription. 
Transcription begins when the o-subunit of RNA polymerase recognizes 
a promoter and forms a complex with it. Next, the RNA polymerase 
holoenzyme unwinds about 14 base pairs of DNA, and transcription com- 
mences. Once an oligonucleotide 9 to 12 residues long has been formed, 
the o-subunit dissociates. The core RNA polymerase is highly processive 
and goes on to synthesize the remainder of the mRNA. Prokaryotes have 
two types of transcription termination mechanisms: one that is depen- 
dent on p termination factor protein and another that depends on disso- 
ciation of the mRNA through reestablishment of DNA base pairs. 


29.2 How Is Transcription Regulated in Prokaryotes? Bacterial genes 
encoding a common metabolic pathway are often grouped adjacent to 
one another in an operon, allowing all of the genes to be expressed in 
a coordinated fashion. The operator, a regulatory sequence adjacent to 
the structural genes, determines whether transcription takes place. The 
operator is located next to a promoter. Interaction of a regulatory pro- 
tein with the operator controls transcription. Small molecules act as sig- 
nals of the nutritional or environmental conditions. These small mole- 
cules interact with operator-binding regulatory proteins and determine 
whether transcription occurs. Induction is the increased synthesis of en- 
zymes in response to a small molecule called a co-inducer. Repression is 
the decreased transcription in response to a specific metabolite termed 
a co-repressor. The lac operon provides an example of induction, and 
the trp operon, repression. Operon regulation depends on the interac- 
tion of sequence-specific DNA-binding proteins with regulatory se- 
quences along the DNA. DNA looping increases the regulatory input 
available to a specific gene. 


29.3 How Are Genes Transcribed in Eukaryotes? Transcription is 
more complicated in eukaryotes because eukaryotic DNA is wrapped 
around histones to form nucleosomes, and nucleosomes repress gene 
expression by limiting access of the transcriptional apparatus to genes. 
Two classes of transcriptional co-regulators are necessary to overcome 
nucleosome repression: (1) histone-modifying enzymes, such as HATs, 
and (2) ATP-dependent chromatin-remodeling complexes. Gene acti- 
vation also requires interaction of RNA polymerase with the promoter. 
RNA polymerase II consists of 12 different polypeptides. The largest, 
RPB1, has a C-terminal domain (CTD) containing multiple repeats of 


the heptapeptide sequence YSPTSPS; 5 of these 7 residues can be phos- 
phorylated by protein kinases. The CTD orchestrates events in the tran- 
scription process. RNA polymerase II promoters commonly consist of 
the core element, near the transcription start site, where general tran- 
scription factors bind, and more distantly located regulatory elements, 
known as enhancers or silencers. These regulatory sequences are rec- 
ognized by specific DNA-binding proteins that activate transcription 
above basal levels. A eukaryotic transcription initiation complex consists 
of RNA polymerase II, five general transcription factors, and Mediator. 
The CTD of RNA polymerase II anchors Mediator to the polymerase 
and allows RNA polymerase II to communicate with transcriptional 
activators bound at sites distal from the promoter. 


29.4 How Do Gene Regulatory Proteins Recognize Specific DNA 
Sequences? Proteins that recognize nucleic acids present a three- 
dimensional shape or contour that is structurally and chemically comple- 
mentary to the surface of a DNA sequence. Nucleotide sequence— 
specific recognition by the protein involves a set of atomic contacts with 
the bases and the sugar—-phosphate backbone. Protein contacts with the 
bases of DNA usually occur within the major groove (but not always). 
Roughly 80% of DNA-binding regulatory proteins fall into one of three 
principal classes based on distinctive structural motifs: the helix-turn- 
helix (or HTH), the zinc finger (or Zn-finger), and the leucine zipper- 
basic region (or bZIP). In addition to their DNA-binding domains, these 
proteins also have protein:protein recognition domains essential to 
oligomerization, DNA looping, transcriptional activation, and signal re- 
ception (effector binding). 


29.5 How Are Eukaryotic Transcripts Processed and Delivered to the 
Ribosomes for Translation? In eukaryotes, primary transcripts must be 
processed to form mature messenger RNAs and exported from the nu- 
cleus to the cytosol for translation. Shortly after transcription initiation, 
the 5’-end of the growing transcript is capped with a guanylyl residue that 
is then methylated at the 7-position. Additional methylations may occur at 
the 2’-O positions of the next two nucleotides and at the 6-amino group 
of a first adenine. Transcription termination does not normally occur 
until RNA polymerase II has transcribed past the polyadenylation signal. 
Most eukaryotic mRNAs have a poly(A) tails consisting of 100 to 200 ade- 
nine residues at their 3'-end, added post-transcriptionally by poly(A) 
polymerase. Most eukaryotic genes are split genes, subdivided into cod- 
ing regions, called exons, and noncoding regions, called introns. Intron 
excision and exon ligation, a process called splicing, also occurs in the 
nucleus. Splicing is mediated by the spliceosome, which is assembled 
from a set of small nuclear ribonucleoprotein particles called snRNPs. 
Splicing requires precise cleavage at the 5'- and 3’-ends of introns and 


the accurate joining of the two exons. Exon/intron junctions are defined 
by consensus sequences recognized by the spliceosome. In addition, a con- 
served sequence within the intron, the branch site, is also essential to splic- 
ing. The splicing reaction involves formation of a lariat intermediate 
through attachment of the 5'-phosphate group of the intron’s invariant 
5'-G to the 2’-OH at the invariant branch site A to form a 2'-5' phos- 
phodiester bond. The lariat structure is excised when the exons are 
ligated. In constitutive splicing, every intron is removed and every exon 
is incorporated into the mature RNA without exception. However, alter- 
native splicing can give rise to different transcripts from a single gene, 
making possible a set of protein isoforms, each with slightly altered func- 
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tional capability. Fast skeletal muscle troponin T isoforms are an exam- 
ple of alternative splicing. 


29.6 Can We Propose a Unified Theory of Gene Expression? Each 
stage in eukaryotic transcription is part of a continuous process, with 
physical and functional connections between the various transcriptional 
and processing machineries. These multicomponent protein machines 
are organized into an interacting network, and regulation occurs in a 
coordinated fashion at multiple levels in the continuous process. Eu- 
karyotic cells also have elaborate systems for mRNA surveillance. Not all 
protein-coding transcripts are translated. Gene silencing or RNAi may 
intervene to prevent translation of mature RNAs. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 


1. The 5’-end of an mRNA has the sequence 


. .AGAUCCGUAUGGCGAUCUCGACGAAGACUC- 
CUAGGGAAUCC... 


What is the nucleotide sequence of the DNA template strand from 
which it was transcribed? If this mRNA is translated beginning with 
the first AUG codon in its sequence, what is the N-terminal amino 
acid sequence of the protein it encodes? (See Table 30.1 for the 
genetic code.) 

2. Describe the sequence of events involved in the initiation of tran- 
scription by E. coli RNA polymerase. Include in your description 
those features a gene must have for proper recognition and tran- 
scription by RNA polymerase. 

3. RNA polymerase has two binding sites for ribonucleoside triphos- 
phates: the initiation site and the elongation site. The initiation site 
has a greater K,, for NTPs than the elongation site. Suggest what 
possible significance this fact might have for the control of tran- 
scription in cells. 

4. Make a list of the ways that transcription in eukaryotes differs from 
transcription in prokaryotes. 

5. DNA-binding proteins may recognize specific DNA regions either 
by reading the base sequence or by “indirect readout.” How do 
these two modes of protein: DNA recognition differ? 

6. (Integrates with Chapter 11.) The metallothionein promoter is 
illustrated in Figure 29.27. How long is this promoter, in nm? How 
many turns of B-DNA are found in this length of DNA? How many 
nucleosomes (approximately) would be bound to this much DNA? 
(Consult Chapter 11 to review the properties of nucleosomes.) 


7. Describe why the ability of bZIP proteins to form heterodimers 
increases the repertoire of genes whose transcription might be re- 
sponsive to regulation by these proteins. 

8. Suppose exon 17 were deleted from the fast skeletal muscle tro- 
ponin T gene (Figure 29.45). How many different mRNAs could 
now be generated by alternative splicing? Suppose that exon 7 in a 
wild-type troponin T gene were duplicated. How many different 
mRNAs might be generated from a transcript of this new gene by al- 
ternative splicing? 
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Figure 29.30 illustrates some of the various covalent modifications 
that occur on histone tails. How might each of these modifications 
influence DNA: histone interactions? 


10. (Integrates with Chapter 15.) Predict from Figure 29.12 whether 
the interaction of lac repressor with inducer might be cooperative. 
Would it be advantageous for inducer to show cooperative binding 
to lac repressor? Why? 


11. What might be the advantages of capping, methylation, and poly- 
adenylylation of eukaryotic mRNAs? 

(Integrates with Chapter 28.) Figure 29.24 shows only one Mg** ion 
in the RNA polymerase II active site; more recent studies reveal the 


presence of two. Why is the presence of two Mg** ions significant? 
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13. (Integrates with Chapter 11.) The SWI/SNF chromatin-remodeling 
complex peels about 50 bp from the nucleosome. Assuming B-form 
DNA, how long is this DNA segment? In forming nucleosomes, 
DNA is wrapped in turns about the histone core octamer. What frac- 
tion of a DNA turn around the core octamer does 50 bp of DNA 
comprise? How does 50 bp of DNA compare to the typical size of eu- 
karyotic promoter modules and response elements? 


14. Draw the structures that comprise the lariat branch point formed 
during mRNA splicing: the invariant A, its 5'-R neighbor, its 3’-Y 
neighbor, and its 2'-G neighbor. 

15. (Integrates with Chapters 6 and 11.) The a-helices in HTH (helix- 
turn-helix motif) DNA-binding proteins are formed from 7- or 
8-amino acid residues. What is the overall length of these a-helices? 
How does their length compare with the diameter of B-form DNA? 


16. Bacteriophage T7 RNA polymerase bound to two DNA strands and 
an RNA strand, as shown in pdb 1MSW, provides a glimpse of tran- 
scription. View this structure at www.pdb.org to visualize how the 
template DNA strand is separated from the nontemplate strand and 
transcribed into an RNA strand. Which Phe residue of the enzyme 
plays a significant role in DNA strand separation? In which domain 
of the polymerase is this Phe located? (You might wish to consult 
Yin, Y. W., and Steitz, T .A., 2002. Structural basis for the transition 
from initiation to elongation transcription in T7 RNA polymerase. 
Science 298:1387-1395 to confirm your answer.) 

17. RNA polymerase II is inhibited by a-amanitin. This mushroom- 
derived toxin has no effect on the enzyme’s affinity for NTP sub- 
strates, but it dramatically slows polymerase translocation along the 
DNA. Go to www.pdb.org to view pdb file 1K83, which is the structure 
of RNA polymerase II with bound a-amanitin. Locate a-amanitin 
within this structure and discuss why its position is consistent with 
its mode of inhibition. 

18. C/EBP® is a bZIP transcription factor in neuronal differentiation, 
learning and memory process, and other neuronal and glial func- 
tions. The structure of the bZIP domain of C/EBP£ bound to DNA 
is shown in pdb file 1GU4. Explore this structure to discover the 
leucine zipper dimerization domain and the DNA-binding basic re- 
gions. On the left side of the www.pdb.org 1GU4 page under “Display 
Files,” click “pdb file” to see the atom-by-atom coordinates in the 
three-dimensional structure (scroll down past “Remarks” to find 
this information). Toward the end of this series, find the amino acid 
sequence of the C/EBP6 domain used in this study. Within this 
amino acid sequence, find the leucine residues of the leucine zip- 
per and the basic residues in the DNA-binding basic region. 


Preparing for the MCAT Exam 

19. Figure 29.15 highlights in red the DNA phosphate oxygen atoms. 
Some of them interact with catabolite activator protein (CAP). 
What kind of interactions do you suppose predominate and what 
kinds of CAP amino acid side chains might be involved in these in- 
teractions? 

20. Chromatin decompaction is a preliminary step in gene expression 
(Figure 29.46). How is chromatin decompacted? 


950 Chapter 29 Transcription and the Regulation of Gene Expression 
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0 Protein Synthesis 


ESSENTIAL QUESTION 


Ribosomes synthesize proteins by reading the nucleotide sequence of mRNAs and 
polymerizing amino acids in an N—>C direction. 


How is the nucleotide sequence of an mRNA molecule translated into the 
amino acid sequence of a protein molecule? 





Ogee 4 Fale K 
The Maya encoded their history in Rees vrs nee We turn now to the problem of how the sequence of nucleotides in an mRNA mol- 
on stelae and temples like these ruins in Tikal, ecule is translated into the specific amino acid sequence of a protein. The problem 
Guatemala. raises both informational and mechanical questions. First, what is the genetic code 
that allows the information specified in a sequence of bases to be translated into the 
amino acid sequence of a polypeptide? That is, how is the 4-letter language of nu- 
cleic acids translated into the 20-letter language of proteins? Implicit in this question 
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We are a spectacular, splendid manifestation 
of life. We have language and can build 
metaphors as skillfully and precisely as 


ribosomes make proteins. We have affection. is a mechanistic problem: It is easy to see how base pairing establishes a one-to-one 

We have genes for usefulness, and usefulness correspondence that allows the template-directed synthesis of polynucleotide chains 

is about as close to a “common goal” of in the processes of replication and transcription. However, there is no obvious chem- 

nature as I can guess at. And finally, and ical affinity between the purine and pyrimidine bases and the 20 different amino 

OED BONELESS acids. Nor is there any obvious structural or stereochemical connection between 
Lewis Thomas (1913-1994) polynucleotides and amino acids that might guide the translation of information. 

“The Youngest and Brightest Thing Around” Francis Crick reasoned that adapter molecules must bridge this information gap. 


in The Medusa and the Snail (1979) 


These adapter molecules must interact specifically with both nucleic acids (mRNAs) 
and amino acids. At least 20 different adapter molecules would be needed, at least 








KEY QUESTIONS one for each amino acid. The various adapter molecules would be able to read the 
genetic code in an mRNA template and align the amino acids according to the tem- 
30.1 What Is the Genetic Code? plate’s directions so that they could be polymerized into a unique polypeptide. 
30.2 How Is an Amino Acid Matched with Its Transfer RNAs (tRNAs; Figure 30.1) are the adapter molecules (see Chapter 10). 
Proper tRNA? Amino acids are attached to the 3’-OH at the 3'-CCA end of tRNAs as aminoacy]l es- 
30.3 What Are the Rules in Codon—Anticodon ters. The formation of these aminoacyl-tRNAs, so-called charged tRNAs, is catalyzed 
Panne by specific aminoacyl-tRNA synthetases. There is one of these enzymes for each of 
30.4 What Is the Structure of Ribosomes, and the 20 amino acids and each aminoacyl-tRNA synthetase loads its amino acid only 
a Rese miblet! onto tRNAs designed to carry it. In turn, these tRNAs specifically recognize unique 

30.5 What Are the Mechanics of mRNA 








Hae sequences of bases in the mRNA through complementary base pairing. 
ranslation ¢ 











30.6 ow Are Proteins Synthesized in Eukaryotic 
Cells? 


30:18 What Is the Genetic Code? 


Once it was realized that the sequence of bases in a gene specified the sequence of 
amino acids in a protein, various possibilities for such a genetic code were consid- 
ered. How many bases are necessary to specify each amino acid? Is the code over- 
lapping or nonoverlapping (Figure 30.2)? Is the code punctuated or continuous? 
Mathematical considerations favored a triplet of bases as the minimal code word, or 
codon, for each amino acid: A doublet code based on pairs of the four possible 
bases, A, C, G, and U, has 4? = 16 unique arrangements, an insufficient number to 
encode the 20 amino acids. A triplet code of four bases has 4° = 64 possible code 
words, more than enough for the task. 


The Genetic Code Is a Triplet Code 


The genetic code is a triplet code read continuously from a fixed starting point in 
each mRNA. Specifically, it is defined by the following: 
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PrE ] 1. A group of three bases codes for one amino acid. 
and Active Figures at www.cengage.com/login. 


2. The code is not overlapping. 





30.2 Howls an Amino Acid Matched with Its Proper tRNA? 


3. The base sequence is read from a fixed starting point without punctuation. That 
is, the mRNA sequences contain no “commas” signifying appropriate groupings 
of triplets. If the reading frame is displaced by one base, it remains shifted 
throughout the subsequent message; no “commas” are present to restore the 
“correct” frame. 

4. The code is degenerate, meaning that, in most cases, each amino acid can be 
coded by any of several triplets. Recall that a triplet code yields 64 codons for 
20 amino acids. Most codons (61 of 64) code for some amino acid. 


Codons Specify Amino Acids 


The complete translation of the genetic code is presented in Table 30.1. Codons, 
like other nucleotide sequences, are read 5'—>3'. Codons represent triplets of bases 
in mRNA or, replacing U with T, triplets along the nontranscribed (nontemplate) 
strand of DNA. 

Several noteworthy features characterize the genetic code: 


1. All the codons have meaning. Of the 64 codons, 61 specify particular amino acids. 
The remaining 3—UAA, UAG, and UGA—specify no amino acid and thus they 
are nonsense codons. Nonsense codons serve as termination codons; they are 
“stop” signals indicating that the end of the protein has been reached. 

2. The genetic code is unambiguous. Each of the 61 “sense” codons encodes only one 
amino acid. 

3. The genetic code is degenerate. With the exception of Met and Trp, every amino acid 
is coded by more than one codon. Several—Arg, Leu, and Ser—are represented 
by six different codons. Codons coding for the same amino acid are called 
synonymous codons. 

4. Codons representing the same amino acid or chemically similar amino acids tend to be simi- 
lar in sequence. Often the third base in a codon is irrelevant, so, for example, all four 
codons in the GGX family specify Gly, and the UCX family specifies Ser (Table 
30.1). This feature is known as third-base degeneracy. Note also that codons with a 
pyrimidine as second base likely encode amino acids with hydrophobic side chains, 
and codons with a purine in the second-base position typically specify polar or 
charged amino acids. The two negatively charged amino acids, Asp and Glu, are 
encoded by GAX codons; GA-pyrimidine gives Asp and GA-purine specifies Glu. 
The consequence of these similarities is that mutations are less likely to be harm- 
ful because single base changes in a codon will result either in no change or in a 
substitution with an amino acid similar to the original amino acid. The degeneracy 
of the code is evolution’s buffer against mutational disruption. 

5. The genetic code is “universal.” Although certain minor exceptions in codon usage 
occur (see A Deeper Look box on page 954), the more striking feature of the 
code is its universality: Codon assignments are virtually the same throughout all 
organisms—archaea, bacteria, and eukaryotes. This conformity means that all 
extant organisms use the same genetic code, providing strong evidence that they 
all evolved from a common primordial ancestor. 


30:28 How Is an Amino Acid Matched with Its Proper tRNA? 


Codon recognition is achieved by aminoacyl-tRNAs. In order for accurate translation 
to occur, the appropriate aminoacyl-tRNA must “read” the codon through base pair- 
ing via its anticodon loop (see Chapter 11). Once an aminoacyl-tRNA has been syn- 
thesized, the amino acid part makes no contribution to accurate translation of the 
mRNA. That is, the amino acid is passively chauffeured by its tRNA and becomes in- 
serted into a growing peptide chain following codon—anticodon recognition be- 
tween the mRNA and tRNA. 


Aminoacyl-tRNA Synthetases Interpret the Second Genetic Code 


A second genetic code must exist, the code by which each aminoacyl-tRNA syn- 
thetase matches up its amino acid with tRNAs that can interact with codons speci- 
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Anticodon 


FIGURE 30.1 Generalized secondary structure of tRNA 
molecules. Circles represent nucleotides in the tRNA 
sequence. The numbers given indicate the standardized 
numbering system for tRNAs (which differ in total num- 
ber of nucleotides). Dots indicate places where the 
number of nucleotides may vary in different tRNA 
species. All tRNAs have the invariant 3-base sequence 
CCA at their 3’-ends. Recall from Chapter 10 that tRNA 
molecules often have modified or unusual bases. 
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FIGURE 30.2 (a) An overlapping versus a nonoverlap- 
ping code. (b) A continuous versus a punctuated code. 
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Third-Base Degeneracy Is Color-Coded 
Position Second Position 
(5'-end) Position (3'-end) Third Bases 
Third-Base with Same 
U C A G Relationship Meaning Number of Codons 
UUU Phe UCU Ser UAU Tyr UGU Cys U Third base U, C, A, G 32 (8 families) 
UUC Phe UCC Ser UAC Tyr UGC Cys C irrelevant 
U UUA Leu UCA Ser UAA Stop UGA Stop A Purines AorG 12 (6 pairs) 
UREN PEGS EBSSEE meem n Pyrimidines U or C 14 (7 pairs) 
CUU Leu CCU Pro CAU His CGU Arg U Tedi UCA 3 (AUX = Ile) 
CUC Leu CCC Pro CAC His CGC Arg C of four A 
C CUA Leu CCA Pro CAA Gln CGA Arg A 
CUG Leu CCG Pro CAG Gin CGG Arg G [H Unique G only 2 (AUG = Met) 
definitions (UGG = Trp) 
AUU Tle ACU Thr AAU Asn AGU Ser U i n 
AUC Ile ACCThr AACAsn AGC Ser c P D Aann 
A AUA Ile ACA Thr AAA Lys AGA Arg A 
ma e AAG Lys AGG Arg G 
GUU Val GCU Ala GAU Asp GGU Gly U 
GUC Val GCC Ala GAC Asp GGC Gly C 
G GUA Val GCA Ala GAA Glu GGA Gly A 
GUG Val GCG Ala GAG Glu GGG Gly G 














*AUG signals translation initiation as well as coding for Met residues. 
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Natural and Unnatural Variations in the Standard Genetic Code 


The genomes of some lower eukaryotes, prokaryotes, and mito- 
chondria show some exceptions to the standard genetic code 
(Table 30.1) in codon assignments. The phenomenon is more 
common in mitochondria. For example, the termination codon 
UGA codes for tryptophan in mitochondria from various animals, 
protozoans, and fungi. AUA, normally an Ile codon, codes for me- 
thionine in some animal and fungal mitochondrial genomes, and 
AGA (an Arg codon) is a termination codon in vertebrate mito- 
chondria but is a Ser codon in fruit fly mitochondria. Mitochon- 
dria in several species of yeast use the CUX codons to specify Thr 
instead of Leu. Some yeast and algal mitochondria use CGG, 
normally an Arg codon, as a stop codon. 

Less common are genomic codon variations within the 
genomes of prokaryotic and eukaryotic cells. Among the lower 
eukaryotes, certain ciliated protozoans (Tetrahymena and Parame- 
cium) use UAA and UGA as glutamine codons rather than stop 
codons. Instances in prokaryotes include use of the stop codon 
UGA to specify Trp by Mycoplasma. Perhaps most interesting is the 
use of some UGA codons by both prokaryotes and eukaryotes (in- 
cluding humans) to specify selenocysteine (Sec), an analog of cys- 
teine in which the sulfur atom is replaced by a selenium atom. In- 
deed, the identification of Sec residues in proteins from bacteria, 
archaea, and eukaryotes has led some people to nominate Sec as 
the 21st amino acid! Sec formation requires a novel Sec-specific 
tRNA known as tRNA‘. This tRNA‘ is loaded with a Ser residue 
by seryl-tRNA synthetase, the aminoacyl-tRNA synthetase for ser- 


ine. Then, in an ATP-dependent process, the Ser-O is replaced by 
Se. Translation of UGA codons by selenocysteinyl-tRNA‘* de- 
pends on the presence of specific stem-loop secondary structures 
in the mRNA called SECIS elements that recode the UGA codon 
from “stop” to “Sec.” SECIS elements are recognized by specific 
proteins that recruit selenocysteinyl-tRNA‘S® to the UGA codon 
during protein synthesis. Most selenoproteins (proteins contain- 
ing selenocysteine) are involved in oxidation—reduction reac- 
tions, and Sec participates directly in the catalytic mechanism. Sec 
provides a more reactive functionality than Cys. 


H 
H — Se — CH, — C— COO- 
NH,* 
Selenocysteine (Sec) 


Recently, in vitro methods have been developed to introduce 
genetically encoded unnatural amino acids with different physi- 
cal, chemical, or biological properties into bacterial or mam- 
malian cells. More than 30 such unnatural amino acids, each with 
unique chemical and photochemical reactivity, can be incorpo- 
rated into proteins using a unique codon and corresponding 
tRNA-aminoacyl-tRNA synthetase pair. (See Wang L., Xie J., and 
Schultz P. G., 2006. Expanding the genetic code. Annual Review of Bio- 
physics and Biomolecular Structure 35:225-249.) 
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fying its amino acid. To interpret this second genetic code, an aminoacyl-tRNA syn- 
thetase must discriminate between the 20 amino acids and the many tRNAs and 
uniquely picks out its proper substrates—one specific amino acid and the tRNA(s) 
appropriate to it—from among the more than 400 possible combinations. The ap- 
propriate tRNAs are those having anticodons that can base-pair with the codons 
specifying the particular amino acid. It is imperative that the proper amino acids be 
loaded onto the various tRNAs so that the mRNA can be translated with fidelity. Al- 
though the primary genetic code is key to understanding the central dogma of mo- 
lecular biology on how DNA encodes proteins, the second genetic code is just as 
crucial to the fidelity of information transfer. 
Cells have 20 different aminoacyl-tRNA synthetases, one for each amino acid. 
Each of these enzymes catalyzes ATP-dependent attachment of its specific amino 
acid to the 3'-end of its cognate tRNA molecules (Figure 30.3). The products of the Cognate kindred; in this sense, cognate refers to 
reaction are an aminoacytl-tRNA, AMP, and PP;. Ever-present pyrophosphatases those tRNAs having anticodons that can read one 
quickly hydrolyze the pyrophosphate product to give 2 P;. This highly exergonic re- Or More of the codons that specify one particular 
action provides the overall thermodynamic driving force for aminoacyl-tRNA syn- @Mino acid. 
thesis. The aminoacyl-tRNA synthetase reaction serves two purposes: 


1. It activates the amino acid so that it will readily react to form a peptide bond. 
2. It bridges the information gap between amino acids and codons. 


The underlying mechanisms of molecular recognition used by each aminoacyl- 
tRNA synthetase to bring the proper amino acid to its cognate tRNA are the em- 
bodiment of the second genetic code. 


Evolution Has Provided Two Distinct Classes 
of Aminoacyl-tRNA Synthetases 


Despite their common enzymatic function, aminoacyl-tRNA synthetases are a di- 
verse group of proteins in terms of size, amino acid sequence, and oligomeric 
structure. In higher eukaryotes, some aminoacyl-tRNA synthetases are assembled 
into large multiprotein complexes. The aminoacyl-tRNA synthetases fall into two 
fundamental classes on the basis of similar amino acid sequence motifs, oligomeric E _ Eig 
state, and acylation function (Table 30.2): class I and class II. Class I aminoacyl- | The Two Classes of Aminoacyl-tRNA | 
tRNA synthetases first add the amino acid to the 2’-OH of the terminal adenylate Synthetases 
residue of tRNA before shifting it to the 3'-OH; class II enzymes add it directly to 
the 3’-OH (Figure 30.3). The catalytic domains of these enzymes evolved from two 
different ancestral predecessors. Aminoacyl-tRNA synthetases are ranked among 
the oldest proteins because the different classes of these enzymes were present very 
early in evolution. Class I and class II aminoacyl-tRNA synthetases interact with the 
tRNA 3’-terminal CCA and acceptor stem in a mirror-symmetric fashion with re- 
spect to each other (Figure 30.4). Class I enzymes bind to the tRNA acceptor stem 
helix from the minor-groove side, whereas class II enzymes bind it from the major- 
groove side. 

Both class I and class II aminoacyl-tRNA synthetases can be approximated as 
two-domain structures, as can their L-shaped tRNA substrates, which have the 
acceptor stem/CCA-3’-OH at one end and the anticodon stem-loop at the other 
(see Figures 11.35 and 30.5). This L-shaped tertiary structure of tRNAs separates 
the 3'-CCA acceptor end from the anticodon loop by a distance of 7.6 nm. The 
two domains of tRNAs have distinct functions: The 3’-CCA end is the site of 
aminoacylation, and the anticodon-containing domain interacts with the mRNA 
template. The two domains of tRNAs interact with the separate domains in the 
synthetases. One of the two major aminoacyl-tRNA synthetase domains is the cat- 
alytic domain (which defines the difference between class I and class II enzymes); 
this domain interacts with the tRNA 3’-CCA end. The other major domain in 
aminoacyl-tRNA synthetases is highly variable and interacts with parts of the 
tRNA beyond the acceptor-TVC stem-loop domain, including, in some cases, the 
anticodon. 
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FIGURE 30.3 The aminoacyl-tRNA synthetase reaction. (a) The overall reaction. (b) Aminoacyl-tRNA formation pro- 
ceeds in two steps: (i) formation of an aminoacyl-adenylate and (ii) transfer of the activated amino acid moiety of 
the mixed anhydride to either the 2’-OH (class | aminoacyl-tRNA synthetases) or 3’-OH (class Il aminoacyl-tRNA 
synthetases) of the ribose on the terminal adenylic acid at the 3’-CCA terminus common to all tRNAs. Those 
aminoacyl-tRNAs formed as 2’-aminoacyl esters undergo a transesterification that moves the aminoacyl function 
to the 3’-O of tRNA. Only the 3’-esters are substrates for protein synthesis. 


Aminoacyl-tRNA Synthetases Can Discriminate Between 
the Various tRNAs 
Aside from the need to uniquely recognize their cognate amino acids, aminoacyl- 


tRNA synthetases must be able to discriminate between the various tRNAs. The 
structural features that permit the synthetases to recognize and aminoacylate their 
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cognate tRNA(s) are not universal. That is, a common set of rules does not govern 
tRNA recognition by these enzymes. Most surprising is the fact that the recognition 
features are not limited to the anticodon and, in some instances, do not even in- 
clude the anticodon. For most tRNAs, a set of sequence elements is recognized by 
its specific aminoacyl-tRNA synthetase, rather than a single distinctive nucleotide or 
base pair. These elements include one or more of the following: (1) at least one 
base in the anticodon; (2) one or more of the three base pairs in the acceptor stem; 
and (3) the base at canonical position 73 (the unpaired base preceding the CCA 
end), referred to as the discriminator base because this base is invariant in the 
tRNAs for a particular amino acid. Figure 30.5 presents a ribbon diagram of a tRNA 


Acceptor stem Position 73 


A, D, G, H, M 
Q, S, T, V, W 





FIGURE 30.4 Mirror-symmetric interactions of class | 
versus class Il aminoacyl-tRNA synthetases with their 
RNA substrates. The two different classes of aminoacyl- 
RNA synthetases bind to opposite faces of tRNA 
molecules. On the left is the structure of the class | 
glutaminyl-tRNAG@" synthetase : tRNA" complex with a 
bound active-site inhibitor (orange) (pdb id = 1EUQ). 
On the right is the structure of the class Il threonyl- 
RNAT synthetase :tRNA™ complex with AMP (red) at 
he active site (odb id = 1QF6).The relative orientation 
of the tRNA is the same in both structures, with the 
3'-CCA end of the tRNA pointed away from the viewer. 





FIGURE 30.5 Ribbon diagram of tRNA tertiary structure. 
Numbers represent the consensus nucleotide sequence 
(see Figure 30.1). The locations of nucleotides recog- 
nized by the various aminoacyl-tRNA synthetases are 
indicated; shown within the boxes are one-letter desig- 
nations of the amino acids whose respective aminoacyl- 
tRNA synthetases interact at the discriminator base (po- 
sition 73), acceptor stem, variable pocket and/or loop, or 
anticodon. The inset shows additional recognition sites 
in those tRNAs having a variable loop that forms a stem- 
loop structure. (Adapted from Figure 2 in Saks, M. E, Sampson, 
J.R.,and Abelson, J. N., 1994. The transfer RNA problem: A search 
for rules. Science 263:191-197.) 
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FIGURE 30.6 Major identity elements in four tRNA 


species. Each base in the tRNA is represented by a circle. 


Numbered filled circles indicate positions of identity 
elements within the tRNA that are recognized by its 
specific aminoacyl-tRNA synthetase. (Adapted from Schul- 
man, L. H., and Abelson, J., 1988. Recent excitement in under- 
standing transfer RNA identity. Science 240:1591-1592.) 





FIGURE 30.7 Structure of E. coli glutaminyl-tRNAC" 
synthetase complexed with tRNA&" and ATP (pdb id = 
1GSG). The protein :tRNA contact region extends along 
one side of the entire length of this extended protein. 
The acceptor stem of the tRNA and the ATP (green) fit 
into a cleft at the top of the protein in this view. The 
enzyme also interacts extensively with the anticodon 
(lower tip of tRNA°"). 


molecule showing the location of nucleotides that contribute to specific recogni- 
tion by the respective aminoacyl-tRNA synthetases for each of the 20 amino acids. 
Interestingly, the same set of tRNA features that serves as positive determinants for 
binding and aminoacylation of the tRNA by its cognate aminoacyl-tRNA synthetase 
may act as negative determinants that prohibit binding and aminoacylation by other 
(noncognate) aminoacyl-tRNA synthetases. Because no common set of rules exists, 
the second genetic code is an operational code based on aminoacyl-tRNA synthe- 
tase recognition of varying sequence and structural features in the different tRNA 
molecules during the operation of aminoacyl-tRNA synthesis. Some examples of 
this code are given in Figure 30.6. 


Escherichia coli Glutaminyl-tRNA“" Synthetase Recognizes 
Specific Sites on tRNA" 


E. coli glutaminyl-tRNA®™ synthetase, a class I enzyme, provides a good illustration of 
aminoacyl-tRNA synthetase : cognate tRNA interactions. This glutaminyl-tRNAC" syn- 
thetase shares a continuous interaction with its cognate tRNA that extends from the 
anticodon to the acceptor stem along the entire inside of the L-shaped tRNA (Fig- 
ure 30.7). Specific recognition elements include enzyme contacts with the discrimi- 
nator base, acceptor stem, and anticodon, particularly the central U in the CUG an- 
ticodon. The carboxylate group of Asp?” makes sequence-specific H bonds in the 
tRNA minor groove with the 2-NH» group of G3 in the base pair G3:C70 of the ac- 
ceptor stem. A mutant glutaminyl-tRNAC™ synthetase with Asn substituted for Asp at 
position 235 shows relaxed specificity; that is, it now will acylate noncognate tRNAs 
with Gln. 


The Identity Elements Recognized by Some Aminoacyl-tRNA 
Synthetases Reside in the Anticodon 


Alteration of the anticodons of either tRNA”? or tRNA‘! to CAU, the anticodon 
for the methionine codon AUG, transforms each of the tRNAs into a substrate for 
methionyl-tRNA synthetase, and they are loaded with methionine. Similarly, re- 
versing the methionine CAU anticodon of tRNA™* to UAC transforms it into a sub- 
strate for valyl-tRNA™! synthetase. Clearly, methionyl-tRNA synthetase and valyl- 
tRNA synthetase rely on the anticodon in selecting tRNAs for loading. 


A Single G : U Base Pair Defines tRNA‘'s 


The noncanonical base pair, G3:U70, is the singular feature by which alanyl- 
tRNA“® synthetase recognizes tRNAs as its substrates. All tRNA“ representatives, 
from archaea to eukaryotes, possess this G3:U70 acceptor stem base pair. Alter- 
ing this unusual G3:U70 base pair of tRNA‘" to G:C, A:U, or even U:G abol- 
ishes its ability to be aminoacylated with alanine. On the other hand, provided 
the G3:U70 base pair is present, alanyl-tRNA*" synthetase aminoacylates a 24- 
nucleotide stem-loop analog of tRNA‘ (Figure 30.8). The key feature of the 
G3:U70 base pair is the 2-NHg group of G3. In the RNA A-form double-helical 
structure adopted by the tRNA acceptor stem, the G3 2-HN» group is exposed in 
the minor groove of the helix, and if the G3 pairing partner is a U, this 2-NH» 
group lacks an H-bonding partner (Figure 30.8). Thus, an unpaired G 2-amino 
group at the right place in a tRNA acceptor stem marks a tRNA for aminoacyla- 
tion by alanyl-tRNA*" synthetase. 


30:38 What Are the Rules in Codon—Anticodon Pairing? 


Protein synthesis depends on the codon-directed binding of the proper aminoacyl- 
tRNAs so that the right amino acids are sequentially aligned according to the spec- 
ifications of the mRNA undergoing translation. This alignment is achieved via 
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FIGURE 30.8 (a) A microhelix analog of tRNA‘ is aminoacylated by 
alanyl-tRNA*? synthetase, provided it has the characteristic tRNA*® 
G3:U70 acceptor stem base pair. The microhelix’? consists of nucleo- 
tides 1 through 13 of tRNA*"/6°C connected directly to 66 through 76 to 
re-create the tRNA“" 7-bp acceptor stem. (b) The unbonded 2-HN> 
group of the G3 in the G3:U70 base pair would lie within the minor 
groove of an A-form RNA double helix. [(a) Adapted from Schimmel, P, 1989. 
tRNA 4/8/GGC Microhelix“* Parameters for molecular recognition of transfer RNAs. Biochemistry 28:2747-2759] 

















codon-anticodon pairing in antiparallel orientation (Figure 30.9). However, con- 

siderable degeneracy exists in the genetic code at the third position. Conceivably, Lerna | 

this degeneracy could be handled in either of two ways: (1) Codon-anticodon Anticodon 
recognition could be highly specific so that a complementary anticodon is re- 3 —C—=G—=G— 5 
quired for each codon, or (2) fewer than 61 anticodons could be used for the 5 eee- 3’ 
“sense” codons if certain allowances were made in the base-pairing rules. Then, Codon 

some anticodons could recognize more than one codon. As early as 1965, it was 
known that poly(U) bound all Phe-tRNA™®! molecules even though UUC is also a 


Phe codon. The phenylalanine-specific tRNAs could recognize either UUU or RGURES0 ates sed pea 

UUG. Also, one particular yeast tRNA“ was able to bind to three codons: GCU, . ae ao Paine somp smena 
trinucleotide sequence elements align in antiparallel 

GCC, and GCA. fashion. 


Francis Crick Proposed the “Wobble” Hypothesis 
for Codon—Anticodon Pairing 


Francis Crick considered these results and tested alternative base-pairing possibili- 
ties by model building. He hypothesized that the first two bases of the codon and 
the last two bases of the anticodon form canonical Watson—Crick A:U or G:C base 
pairs, but pairing between the third base of the codon and the first base of the an- 
ticodon follows less stringent rules. That is, a certain amount of play, or wobble, 
might be allowed in base pairing at this position. The third base of the codon is 
sometimes referred to as the wobble position. 

Crick’s investigations suggested a set of rules for pairing between the third base 
of the codon and the first base of the anticodon (Table 30.3). The wobble rules in- TABLE 30.3 
dicate that a first-base anticodon U could recognize either an A or G in the codon 
third-base position; first-base anticodon G might recognize either U or C in the Bases Recognized 
third-base position of the codon; and first-base anticodon I! might interact with U, Base on the Anticodon on the Codon 
C, or A in the codon third position (Figure 30.10) .? 











U A,G 
The wobble rules also predict that four-codon families (like Pro or Thr), where 
. . ae à : C G 
any of the four bases may be in the third position, require at least two different r U 
tRNAs. However, all members of the set of tRNAs specific for a particular amino 
acid—termed isoacceptor tRNAs—are served by one aminoacyl-tRNA synthetase. z UG 
I U,C, A 





'T is inosine (6-OH purine). 
?Thus, the first base of the anticodon indicates whether the tRNA can read one, two, or three differ- Adapted from Crick, F. H. C., 1966. Codon-anticodon pairing: 
ent codons: Anticodons beginning with A or C read only one codon, those beginning with Gor U read The wobble hypothesis. Journal of Molecular Biology 

two, and anticodons beginning with I can read three codons. 19:548-555. 
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FIGURE 30.10 Pairing of anticodon inosine (I, left) with C, 
U, or A as the codon third base. Note that | is in the keto 
tautomeric form. 


H Cytosine 





Some Codons Are Used More Than Others 


Because more than one codon exists for most amino acids, the possibility for varia- 
tion in codon usage arises. Indeed, variation in codon usage accommodates the fact 
that the DNA of different organisms varies in relative A:T/G:C content. Neverthe- 
less, even in organisms of average base composition, codon usage may be biased. 
Table 30.4 gives some examples from E. coli and humans reflecting the nonrandom 
usage of codons. Of more than 109,000 Leu codons tabulated in a set of human 
genes, CUG was used in excess of 48,000 times, CUC more than 23,000 times, but 
UUA just 6000 times. 

The occurrence of codons in £E. coli mRNAs correlates well with the relative abun- 
dance of the tRNAs that read them. Preferred codons are represented by the most 
abundant isoacceptor tRNAs. Furthermore, mRNAs for proteins that are synthe- 
sized in abundance tend to employ preferred codons. Rare tRNAs correspond to 
rarely used codons, and messages containing such codons might experience delays 
in translation. 


Nonsense Suppression Occurs When Suppressor tRNAs 
Read Nonsense Codons 


Mutations that alter a sense codon to one of the three nonsense codons—UAA, UAG, 
or UGA—result in premature termination of protein synthesis and the release of trun- 
cated (incomplete) polypeptides. Geneticists found that second mutations elsewhere 
in the genome were able to suppress the effects of nonsense mutations so that the or- 
ganism survived, a phenomenon termed nonsense suppression. The molecular basis 
for nonsense suppression was a mystery until it was realized that suppressors were mu- 
tations in tRNA genes that altered the anticodon so that the mutant tRNA could now 
read a particular “stop” codon and insert an amino acid. For example, alteration of the 
anticodon of a tRNA®" from GUA to CUA allows this tRNA to read the amber stop 
codon, UAG, and insert Tyr. (The nonsense codons are named amber [UAG], ochre 
[UAA], and opal [UGA]). Suppressor tRNAs are typically generated from minor tRNA 
species within a set of isoacceptor tRNAs, so their recruitment to a new role via muta- 
tion does not involve loss of an essential tRNA; that is, the mutation is not particularly 
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TABLE 30.4 








The results are expressed as frequency of occurrence of a codon per 1000 codons 

tabulated in 1562 E. coli genes and 2681 human genes, respectively. (Because E. coli 

and human proteins differ somewhat in amino acid composition, the frequencies for 

a particular amino acid do not correspond exactly between the two species.) 

E. coli Gene Human Gene 

Amino Acid Codon Frequency/1000 Frequency/1000 

Leu CUA 3.2 6.1 
CUC 9.9 20.1 
CUG 54.6 42.1 
CUU 10.2 10.8 
UUA 10.9 5.4 
UUG 11.5 11.1 

Pro CCA 8.2 15.4 
CCC 4.3 20.6 
CCG 23.8 6.8 
CCU 6.6 16.1 

Ala GCA 15.6 14.4 
GCC 34.4 29.7 
GCG 32.9 7.2 
GCU 13.4 18.9 

Lys AAA 36.5 21.9 
AAG 12.0 35.2 

Glu GAA 43.5 26.4 
GAG 19:2 41.6 











Adapted from Wada, K., et al., 1992. Codon usage tabulated from Genbank genetic sequence data. Nucleic Acids Research 
20:2111-2118. 


deleterious to the organism. A suppressor tRNA, as a mutant tRNA, may even carry and 
introduce an amino acid different from the one borne by the wild-type tRNA. 


30:45 What Is the Structure of Ribosomes, 
and How Are They Assembled? 


Protein biosynthesis is achieved by the process of translation. Translation converts 
the language of genetic information embodied in the base sequence of a messen- 
ger RNA molecule into the amino acid sequence of a polypeptide chain. During 
translation, proteins are synthesized on ribosomes by linking amino acids together 
in the specific linear order stipulated by the sequence of codons in an mRNA. Ri- 
bosomes are the agents of protein synthesis. 

Ribosomes are compact ribonucleoprotein particles found in the cytosol of all 
cells, as well as in the matrix of mitochondria and the stroma of chloroplasts. The gen- 
eral structure of ribosomes is described in Chapter 10; here we consider their struc- 
ture in light of their function in synthesizing proteins. Ribosomes are mechanochem- 
ical systems that move along mRNA templates, orchestrating the interactions between 
successive codons and the corresponding anticodons presented by aminoacyl-tRNAs. 
As they align successive amino acids via codon—anticodon recognition, ribosomes also 
catalyze the formation of peptide bonds between the growing peptide chain and in- 
coming amino acids. 


Prokaryotic Ribosomes Are Composed of 30S and 50S Subunits 


E. coli ribosomes are representative of the structural organization of the prokary- 
otic versions of these supramolecular protein-synthesizing machines (Table 30.5, 
see also Figure 10.22). The E. coli ribosome is a roughly globular particle with a 
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FIGURE 30.11 The seven ribosomal RNA operons in 
E.coli. Numerals to the right of the brackets indicate the 
number of species of tRNA encoded by each transcript. 
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Ribosome Small Subunit Large Subunit 

Sedimentation coefficient 70S 30S 50S 

Mass (kD) 2520 930 1590 

Major RNAs 16S = 1542 bases 23S = 2904 bases 
Minor RNAs 5S = 120 bases 
RNA mass (kD) 1664 560 1104 

RNA proportion 66% 60% 70% 

Protein number 21 polypeptides* 31 polypeptidest 
Protein mass (kD) 857 370 487 

Protein proportion 34% 40% 30% 











*The S proteins 
tThe L proteins 


diameter of 25 nm, a sedimentation coefficient of 70S, and a mass of about 
2520 kD. It consists of two unequal subunits that dissociate from each other at Mg?* 
concentrations below 1 mM. The smaller, or 30S, subunit is composed of 21 differ- 
ent proteins and a single rRNA, 16S ribosomal RNA (rRNA). The larger 50S sub- 
unit consists of 31 different proteins and two rRNAs: 23S rRNA and 5S rRNA. Ri- 
bosomes are roughly two-thirds RNA and one-third protein by mass. An E. coli cell 
contains around 20,000 ribosomes, constituting about 20% of the dry cell mass. 


Prokaryotic Ribosomes Are Made from 50 Different Proteins 
and Three Different RNAs 


Ribosomal Proteins There is one copy of each ribosomal protein per 70S 
ribosome, excepting protein L7/L12 (L7 and L12 have identical amino acid se- 
quences and differ only in the degree of N-terminal acetylation). Only one pro- 
tein is common to both the small and large subunit: S20 = L26. The largest 
ribosomal protein is S1 (557 residues, 61.2 kD); the smallest is L34 (46 residues, 
5.4 kD). The sequences of ribosomal proteins share little similarity. These pro- 
teins are typically rich in the cationic amino acids Lys and Arg and have few aro- 
matic amino acid residues, properties appropriate to proteins intended to inter- 
act strongly with polyanionic RNAs. 


10,1, or 2 
Chromosomal 
Spacer tRNA Trailer tRNA location (min) 
Tle, Ala ee 86 
Ga eee 89 
Glu Asp, Trp 84 
Tle, Ala nee 71 
Gu y YYY 90 
Glu Asp 56 
Ile, Ala 5 
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rRNAs_ The rRNAs of E. coli are encoded by a set of seven operons (Figure 30.11). 
Each of these operons is transcribed into a 30S rRNA precursor that includes several 
tRNAs. RNase II and other nucleases cleave these precursors to generate 23S, 16S, 
and 5S rRNA, as well as several tRNAs that are unique to each operon. Transcription 
of rRNA genes accounts for 80% to 90% of total cellular RNA synthesis. Ribosomal 
RNAs show extensive potential for intrachain hydrogen bonding and assume sec- 
ondary structures reminiscent of tRNAs, although substantially more complex (see 
Figures 11.36 and 11.37). About two-thirds of rRNA is double-helical. Double-helical 
regions are punctuated by short, single-stranded stretches, generating hairpin con- 
formations that dominate the molecule; four distinct domains can be discerned in the 
secondary structure of 16S rRNA and six in 23S rRNA. The three-dimensional struc- 
tures of both the 30S and 50S ribosomal subunits show that the general shapes of the 
ribosomal subunits are determined by the conformation of the rRNA molecules 
within them. Figure 30.12 illustrates the three-dimensional structure of 16S rRNA 
within the 30S subunit. The overall form of the 30S structure is essentially that of the 
rRNA (compare Figures 30.12 and 30.13a). The same relationship is true for the 23S 
plus 5S rRNA and the large ribosomal subunit (compare Figures 11.37 and 30.13b). 
Ribosomal proteins serve a largely structural role in ribosomes; their primary function 
is to brace and stabilize the rRNA conformations within the ribosomal subunits. 


Ribosomes Spontaneously Self-Assemble In Vitro 


Ribosomal subunit self-assembly is one of the paradigms for the spontaneous forma- 
tion of supramolecular complexes from their macromolecular components. If the 
individual proteins and rRNAs composing ribosomal subunits are mixed together in 
vitro under appropriate conditions of pH and ionic strength, spontaneous self- 
assembly into functionally competent subunits takes place without the intervention 
of any additional factors or chaperones. The rRNA acts as a scaffold upon which the 
various ribosomal proteins convene. Ribosomal proteins bind in a specified order. 
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FIGURE 30.12 Tertiary structure of the 16S rRNA within 
the Thermus thermophilus 30S ribosomal subunit (pdb 
id = 2502). This view is of the face that interacts with 
the 50S subunit (see Figures 30.13 and 30.15). H, head; 
Be, beak; N, neck; P platform; Sh, shoulder; Sp, spur; Bo, 
body. 


FIGURE 30.13 Structure of the T. themophilus ribo- 
somal subunits and 70S ribosome, as deduced by 
X-ray crystallography. Prominent structural features 
are labeled. (a) 30S (pdb id = 2502) and (b) 50S 
(pdb id = 2J03) subunits. These views show the 
sides of these two that form the interface between 
them when they come together to form a 70S 
subunit (c). (d) is a side view of the 70S ribosome; 
the white area represents the region where mRNA 
and tRNAs are bound and peptide bond formation 
occurs. The tunnel through the 50S subunit that 
the growing peptide chain transits is shown as a 
dashed line. The approximate dimensions of the 
30S subunit are 5.5 X 22 X 22 nm;the 50S subunit 
dimensions are 15 X 20 X 20 nm. 
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Ribosomes Have a Characteristic Anatomy 


Ribosomal subunits have a characteristic three-dimensional architecture that has 
been revealed by image reconstructions from cryoelectron microscopy, X-ray crys- 
tallography, and X-ray and neutron solution scattering. Such analyses provide images 
as depicted in Figure 30.13. The 30S, or small, subunit features a “head” and a 
“base,” or “body,” from which a “platform” projects. A cleft is defined by the spatial 
relationship between the head, base, and platform (Figure 30.13a). The mRNA 
passes across this cleft. The platform represents the central domain of the 30S sub- 
unit; it contains one-third of the 16S rRNA. This central domain binds mRNA and 
the anticodon stem-loop end of aminoacyl-tRNAs, providing the framework for de- 
coding the genetic information in mRNA by mediating codon—anticodon recogni- 
tion. As such, this central domain of the 30S subunit serves as the decoding center. 
This center is composed only of 16S rRNA; no ribosomal proteins are involved in de- 
coding the message. 

The 50S, or large, subunit is a mitt-like globular structure with three distinctive 
projections: a “central protuberance,” the “stalk” containing protein L1, and a wing- 
like ridge known as the “L7/L12 region” (Figure 30.13b). The large subunit binds 
the aminoacyl-acceptor ends of the tRNAs and is responsible for catalyzing forma- 
tion of the peptide bond formed between successive amino acids in the polypeptide 
chain. This catalytic center, the peptidyl transferase, is located at the bottom of a 
deep cleft. From it, a 10-nm-long tunnel passes outward through the back of the 
large subunit. 

The small and large subunits associate with each other in the manner shown in Fig- 
ure 30.13c and d. The contacts between the 30S and 50S subunits are rather limited, 
and the subunit interface contains mostly rRNA, with relatively little contribution from 
ribosomal proteins. The decoding center in the 30S subunit is aligned somewhat with 
the peptidyl transferase and the tunnel in the large subunit, and the growing peptidyl 
chain is threaded through this tunnel as protein synthesis proceeds. Even though the 
ribosomal proteins are arranged peripherally around the rRNAs in ribosomes, rRNA 
occupies 30% to 40% of the ribosomal subunit surface areas. 

Both subunits are involved in translocation, the process by which the mRNA 
moves through the ribosome, one codon at a time. (Although it is physically more 
likely for the mRNA to move through the ribosome, the descriptions of the events 
in protein synthesis that follow imply that the ribosome moves along the mRNA.) 


The Cytosolic Ribosomes of Eukaryotes Are Larger 
Than Prokaryotic Ribosomes 


Eukaryotic cells have ribosomes in their mitochondria (and chloroplasts) as well as 
in the cytosol. The mitochondrial and chloroplastic ribosomes resemble prokary- 
otic ribosomes in size, overall organization, structure, and function, a fact reflecting 
the prokaryotic origins of these organelles. Although eukaryotic cytosolic ribosomes 
are larger and considerably more complex, they retain the “core” structural and 
functional properties of their prokaryotic counterparts, confirming that the funda- 
mental ribosome organization and operation has been conserved across evolution- 
ary time. The rRNA genes of eukaryotes are present in the form of several hundred 
tandem clusters; these clusters define the nucleolus, a distinct region of the nucleus 
where these clusters are located and where transcription of rRNA genes occurs. As 
in prokaryotes, 80% to 90% of eukaryotic transcription is rRNA synthesis. Higher 
eukaryotes have more complex ribosomes than lower eukaryotes. For example, the 
yeast cytosolic ribosomes have major rRNAs of 3392 (large subunit) and 1799 nu- 
cleotides (small subunit); the major rRNAs of mammalian cytosolic ribosomes are 
4718 and 1874 nucleotides, respectively. Table 30.6 lists the properties of cytosolic 
ribosomes in a mammal, the rat. Comparison of base sequences and secondary 
structures of rRNAs from different organisms suggests that evolution has worked to 
conserve the secondary structure of these molecules, although not necessarily the 
nucleotide sequences creating such structure. That is, the retention of a base pair 
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TABLE 30.6 | lian | C 
Ribosome Small Subunit Large Subunit 

Sedimentation coefficient 80S 40S 60S 

Mass (kD) 4220 1400 2820 

Major RNAs 18S = 1874 bases 28S = 4718 bases 

Minor RNAs 5.88 = 160 bases 
5S = 120 bases 

RNA mass (kD) 2520 700 1820 

RNA proportion 60% 50% 65% 

Protein number 33 polypeptides 49 polypeptides 

Protein mass (kD) 1700 700 1000 

Protein proportion 40% 50% 35% 











at a particular location seems more important than whether the base pair is G:C 
or A:U. 


30:5: What Are the Mechanics of mRNA Translation? 


In translating an mRNA, a ribosome must move along it in the 5’ 3’ direction, re- 
cruiting aminoacyl-tRNAs whose anticodons match up with successive codons and 
joining amino acids in peptide bonds in a polymerization process that forms a par- 
ticular protein. Like chemical polymerization processes, protein biosynthesis in all 
cells is characterized by three distinct phases: initiation, elongation, and termina- 
tion. At each stage, the energy driving the assembly process is provided by GTP hy- 
drolysis, and specific soluble protein factors participate in the events. These soluble 
proteins are often G-protein family members that use the energy released upon hy- 
drolysis of bound GTP to fuel switchlike conformational changes. Such conforma- 
tional changes are at the heart of the mechanical steps necessary to move a ribo- 
some along an mRNA and to deliver an aminoacyl-tRNA into appropriate register 
with a codon. 

Initiation involves binding of mRNA by the small ribosomal subunit, followed by 
association of a particular initiator aminoacyl-tRNA that recognizes the first codon. 
This codon often lies within the first 30 nucleotides or so of mRNA spanned by the 
small subunit. The large ribosomal subunit then joins the initiation complex, 
preparing it for the elongation stage. 

Elongation includes the synthesis of all peptide bonds from the first to the last. 
The ribosome remains associated with the mRNA throughout elongation, moving 
along it and translating its message into an amino acid sequence. This is accom- 
plished via a repetitive cycle of events in which successive aminoacyl-tRNAs are 
added to the ribosome: mRNA complex as directed by codon binding, the 50S sub- 
unit catalyzes peptide bond formation, and the polypeptide chain grows by one 
amino acid at a time. 

Three tRNA molecules may be associated with the ribosome:mRNA complex at 
any moment. Each lies in a distinct site (Figure 30.14). The A, or acceptor, site is the 
attachment site for an incoming aminoacyltRNA. The P, or peptidyl, site is occupied 
by peptidyl-tRNA, the tRNA carrying the growing polypeptide chain. The elongation 
reaction transfers the peptide chain from the peptidyl-tRNA in the P site to the 
aminoacyl-tRNA in the A site. This transfer occurs through covalent attachment of 
the a-amino group of the aminoacyl-tRNA to the a-carboxyl group of the peptidyl- 
tRNA, forming a new peptide bond. The new, longer peptidyl-tRNA now moves from 
the A site into the P site as the ribosome moves one codon further along the mRNA. 
The A site, left vacant by this translocation, can accept the next incoming aminoacyl- 
tRNA. The E, or exit, site, is transiently occupied by the “unloaded,” or deacylated, 
tRNA, which has lost its peptidyl chain through the peptidyl transferase reaction. 
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CENGAGENOW” ACTIVE FIGURE 30.14 The basic steps 
in protein synthesis. The ribosome has three distinct 
binding sites for tRNA: the A, or acceptor, site; the P or 
peptidyl, site; and the E, or exit, site. Test yourself on the 
concepts in this figure at www.cengage.com/login. 
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FIGURE 30.15 The three tRNA-binding sites on ribo- 
somes. The view shows the ribosomal surfaces that form 
the interface between the 30S (a) and 50S (b) subunits 
in a 70S ribosome (as if a 70S ribosome has been 
“opened” like a book to expose facing “pages”). The 

A (green), P (blue), and E (yellow) sites are occupied by 
tRNAs. The decoding center on the 30S subunit (b) lies 
behind the top of the tRNAs in the A and P sites (which 
is where the anticodon ends of the tRNAs are located). 
The peptidyl transferase center on the 50S subunit (b) 
lies at the lower tips (acceptor ends) of the A- and P-site 
tRNAs. (a: pdb id = 2502; b: pdb id = 203.) 
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FIGURE 30.16 The secondary structure of E.coli N-formyl- 
methionyl-tRNA fM®t. The features distinguishing it from 
noninitiator tRNAs are highlighted. 
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These events are summarized in Figure 30.14. The contributions made to each of the 
three tRNA-binding sites by each ribosomal subunit are shown in Figure 30.15. 

Termination is triggered when the ribosome reaches a “stop” codon on the 
mRNA. At this point, the polypeptide chain is released and the ribosomal subunits 
dissociate from the mRNA. 

Protein synthesis proceeds rapidly. In vigorously growing bacteria, about 20 
amino acid residues are added to a growing polypeptide chain each second. So an 
average protein molecule of about 300 amino acid residues is synthesized in only 
15 seconds. Eukaryotic protein synthesis is only about 10% as fast. Protein syn- 
thesis is also highly accurate: An inappropriate amino acid is incorporated only 
once in every 104 codons. We focus first on protein synthesis in E. coli, the system 
for which we know the most. 


Peptide Chain Initiation in Prokaryotes Requires a G-Protein 
Family Member 


The components required for peptide chain initiation include (1) mRNA; (2) 30S 
and 50S ribosomal subunits; (3) a set of proteins known as initiation factors; 
(4) GTP; and (5) a specific charged tRNA, f-Met-tRNA#™*. A discussion of the 
properties of these components and their interaction follows. 


Initiator tRNA tRNA“ is a particular tRNA for reading an AUG (or GUG, or even 
UUG) codon that signals the start site, or N-terminus, of a polypeptide chain; the sub- 
script į signifies “initiation.” This tRNA;™* does not read internal AUG codons, so it 
does not participate in chain elongation. Instead, that role is filled by another 
methionine-specific tRNA, referred to as tRNAM*, which cannot replace tRNA}™* in 
peptide chain initiation. (However, both of these tRNAs are loaded with Met by the 
same methionyl-tRNA synthetase.) The structure of E. coli tRNAjf™* has several distin- 
guishing features (Figure 30.16). Collectively, these features identify this tRNA as es- 
sential to initiation and inappropriate for chain elongation. 

The synthesis of all E. coli polypeptides begins with the incorporation of a modi- 
fied methionine residue, N-formyl-Met, as N-terminal amino acid. However, in about 
half of the E. colt proteins, this Met residue is removed once the growing polypep- 
tide is ten or so residues long; as a consequence, many mature proteins in E. coli lack 
N-terminal Met. 

The methionine contributed in peptide chain initiation by tRNA#™* is unique in 
that its amino group has been formylated. This reaction is catalyzed by a specific en- 
zyme, methionyl-tRNA ;™“ formyl transferase (Figure 30.17). Note that the addition 
of the formyl group to the a-amino group of Met creates an N-terminal block re- 
sembling a peptidyl grouping. That is, the initiating Met is transformed into a mini- 
mal analog of a peptidyl chain. 
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mRNA Recognition and Alignment In order for the mRNA to be translated accu- 
rately, its sequence of codons must be brought into proper register with the transla- 
tional apparatus. Recognition of translation initiation sequences on mRNAs involves 
the 16S rRNA component of the 30S ribosomal subunit. Base pairing between a 
pyrimidine-rich sequence at the 3'-end of 16S rRNA and complementary purine-rich 
tracts at the 5’-end of prokaryotic mRNAs positions the 30S ribosomal subunit in 
proper alignment with an initiation codon on the mRNA. The purine-rich mRNA se- 
quence, the ribosome-binding site, is often called the Shine-Dalgarno sequence in 
honor of its discoverers. Figure 30.18 shows various Shine-Dalgarno sequences found 
in prokaryotic mRNAs, along with the complementary 3'-tract on E. coli 16S rRNA. 
The 3’-end of 16S rRNA resides in the “head” region of the 30S small subunit, and 
the double helical duplex formed by the Shine-Dalgarno sequence and the 16S rRNA 
3’-end fits snugly in a chamber between the 30S subunit “head” and “platform” 
domains. 


Initiation Factors Initiation involves interaction of the initiation factors (IFs) with 
GTP, N-formyl-Met-ttRNA#™*, mRNA, and the 30S subunit to give a 30S initiation 
complex to which the 50S subunit then adds to form a 70S initiation complex. The 
initiation factors are soluble proteins required for assembly of proper initiation 
complexes. Their properties are summarized in Table 30.7. 























Initiation 

codon 
araB - UUUGGAUGGAGUGAAACGAUGGCGAUU- 
galE - AGCCUAAUGGAGCGAAUUA AGAGUU- 
lacl - CAAUUCAIGEGGUGGUGAUUGUGAAACCA- 
lacZ - UUCACACAGGAAACAGCUAUGACCAUG- 
Q B phage replicase - UA A CR U GAA AUGCAUGUCUAAG- 
fX174 phage A protein -AAUCUUGGAGGCUUUUUU: GUUCGU- 
R17 phage coat protein -UCAACCGGGGUUUGAAGCA GCUUCU- 
ribosomal protein S12 - AAAACCAGGAGCUAUUUAAUGGCAACA-— 
ribosomal protein L10 - CUACCAGGAGCAAAGCUAAUGGCUUUA- 
irpE - CAAAAUUAGAGAAUAACAA) CAAACA-— 
irpL leader - GUAAAAAGGGUAUCGACAAUGAAAGCA-— 


? yAUUCCUCCACUAG- 5 


3'-end of 16S rRNA 









UUITEL EA Properties o 
Mass Molecules/ 
Factor (kD) Ribosome Function 
IF-1 9 0.15 Binds to 30S A site and prevents tRNA binding 
IF-2 97 G-protein that binds fMet-tRNA;™; interacts with IF-1 
IF-3 23 0.25 Binds to 30S E site; prevents 50S binding 
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CENGAGENOW” ANIMATED FIGURE 30.17 Methionyl- 
tRNA {Met formyl transferase catalyzes the transformyla- 
tion of methionyl-tRNA{™* using N'°-formyl-THF as 
formyl donor. The tRNA for reading Met codons within a 
protein (tRNA™* is not a substrate for this transformyl- 
ase. See this figure animated at www.cengage.com/ 
login. 


FIGURE 30.18 Various Shine-Dalgarno sequences rec- 
ognized by £. coli ribosomes. These sequences lie about 
ten nucleotides upstream from their respective AUG 
initiation codon and are complementary to the UCCU 
core sequence element of E.coli 16S rRNA.G:U as well 
as canonical G:C and A:U base pairs are involved here. 
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CENGAGENOW” ACTIVE FIGURE 30.19 The sequence 
of events in peptide chain initiation. Test yourself on 
the concepts in this figure at www.cengage.com/ 
login. 


Events in Initiation Initiation begins when a 30S subunit: (IF-3:IF-1) complex 
binds mRNA and a complex of IF-2, GTP, and fMettRNA#™*. The sequence of 
events is summarized in Figure 30.19. Although IF-3 is absolutely essential for 
mRNA binding by the 30S subunit, it is not involved in locating the proper transla- 
tion initiation site on the message. The presence of IF-3 on 30S subunits also pre- 
vents them from reassociating with 50S subunits. IF-3 must dissociate before the 50S 
subunit will associate with the mRNA: 30S subunit complex. 

IF-2 delivers the initiator fMettRNA™* in a GTP-dependent process. Apparently, 
the 30S subunit is aligned with the mRNA such that the initiation codon is situated 
within the “30S part” of the P site. Upon binding, fMettRNA#™* enters this 30S 
portion of the P site. GTP hydrolysis is necessary to form an active 70S ribosome. GTP 
hydrolysis is triggered when the 50S subunit joins and is accompanied by IF-1 and 
IF-2 release. The A site of the 70S initiation complex is ready to accept an incoming 
aminoacyl-tRNA; the 70S ribosome is poised to begin chain elongation. 


Peptide Chain Elongation Requires Two G-Protein Family Members 


The requirements for peptide chain elongation are (1) an mRNA:70S ribosome: 
peptidyl-tRNA complex (peptidyltRNA in the P site), (2) aminoacyl-tRNAs, (3) a set 
of proteins known as elongation factors, and (4) GTP. Chain elongation can be divided 
into three principal steps: 


1. Codon-directed binding of the incoming aminoacyl-tRNA at the A site. Decoding 
center regions of 16S rRNA make sure the proper aminoacyl-tRNA is in the A site 
by direct surveillance of codon—anticodon base pairing geometry. 

2. Peptide bond formation: transfer of the peptidyl chain from the tRNA bearing it 
to the —NHg group of the new amino acid. 

3. Translocation of the “one-residue-longer” peptidyl-tRNA to the P site to make 
room for the next aminoacyl-tRNA at the A site. These shifts are coupled with 
movement of the ribosome one codon further along the mRNA. 


The Elongation Cycle 


The properties of the soluble proteins essential to peptide chain elongation are 
summarized in Table 30.8. These proteins are present in large quantities, reflecting 
the great importance of protein synthesis to cell vitality. For example, elongation 
factor Tu (EF-Tu) is the most abundant protein in E. coli, accounting for 5% of 
total cellular protein. 

















30.5 
UUTEELEM Properties of E. coli Elongation Factors F 
Mass Molecules/ 
Factor (kD) Cell Function 
EF-Tu 43 70,000 G protein that binds aminoacyl-tRNA and delivers it 
to the A site 
EF-Ts 74 10,000 Guanine-nucleotide exchange factor (GEF) that 
replaces GDP on EF-Tu with GTP 
EF-G 77 20,000 G protein that promotes translocation of mRNA 











Aminoacyl-tRNA Binding 


EF-Tu binds aminoacyl-tRNA and GTP. There is only one EF-Tu species serving all the 
different aminoacyltRNAs, and aminoacyl-tRNAs are accessible to the A site of active 
70S ribosomes only in the form of aminoacyl-tRNA: EF-Tu: GTP complexes. Once cor- 
rect base pairing between codon and anticodon has been established within the A site, 
the GTP is hydrolyzed to GDP and P; by EF-Tu. Because the sites of codon—anticodon 
recognition and GTP hydrolysis are 7.5 nm apart, conformational changes in the 
ribosome must convey the information of cognate tRNA recognition to the EF-Tu 
GTPase site. The aminoacyl end of the tRNA is properly oriented in the peptidyl trans- 
ferase site of the 50S subunit, and the EF-Tu molecule is released as a EF-Tu: GDP com- 
plex (Figure 30.20). 

Elongation factor Ts (EF-Ts) is a guanine-nucleotide exchange factor (GEF) that 
catalyzes the recycling of EF-Tu by mediating the displacement of GDP and its re- 
placement by GTP. EF-Ts forms a transient complex with EF-Tu by displacing GDP, 
whereupon GTP displaces EF-Ts from EF-Tu (Figure 30.20). 


The Decoding Center: A 16S rRNA Function Analysis of the structures of the 70S 
ribosome :tRNA complexes and isolated 30S subunits has revealed the decoding 
center in the 30S subunit. This decoding center, where anticodon loops of the 
A- and P-site tRNAs and the codons of the mRNA are matched up, is primarily a 
property of 16S rRNA. Figure 30.21 reveals the location of the decoding center and 
highlights the interface between an mRNA UUU codon, the GAA anticodon of a 
cognate tRNA", and the elements of the ribosome. 16S rRNA nucleotides A1493, 
A1492, G530, C518, and C1054 interact extensively with the minor groove of the 
UUU:AAG duplex; residues Ser” and Pro* of ribosomal protein $12 also partici- 
pate (Figure 30.21b). A1492 and A1493 are part of 16S rRNA helix 44; G530 and 
C518 are from 16S helix 18. Triggered conformational changes within these 16S 
rRNA regions are key to codon—anticodon recognition. 


Peptidyl Transfer Peptidyl transfer, or transpeptidation, is the central reaction of 
protein synthesis, the actual peptide bond-forming step. No energy input (for ex- 
ample, in the form of ATP) is needed; the ester bond linking the peptidyl moiety to 
tRNA is intrinsically reactive. As noted earlier, peptidyl transferase, the activity catalyz- 
ing peptide bond formation, is associated with the 50S ribosomal subunit. Indeed, 
this reaction is a property of the 23S rRNA in the 50S subunit. 


23S rRNA Is the Peptidyl Transferase Enzyme Peptide bond formation is catalyzed 
by the large rRNA in the large ribosomal subunit (e.g., the 23S rRNA in prokaryotic 
50S ribosomal subunits) through its peptidyl transferase center (PTC). No riboso- 
mal proteins lie within 1.5 nm of the PTC; thus, none can participate in the reac- 
tion mechanism. In kinetic terms, the catalytic power of ribosomes is modest: A ri- 
bosome, through the rRNA comprising its PTC, accelerates the peptide-bond 
forming reaction rate 4 X 10°-fold over the uncatalyzed reaction. 
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CENGAGENOW” ACTIVE FIGURE 30.20 The cycle of events in peptide chain elongation on E. coli 
ribosomes. The structure at the lower left is an EF-Tu: aminoacyl-tRNA complex (pdb id = 1TTT). 
Test yourself on the concepts in this figure at www.cengage.com/login. 


The Catalytic Power of the Ribosome Comes from Tight Binding of Its Peptidyl- 
tRNA and Aminoacyl-tRNA Substrates Structurally, the PTC can be described as 
a funnel-shaped active-site crater where the 3’-acceptor ends of the peptidyl-tRNA 
and aminoacyl-tRNA meet. This crater lies directly above the entrance to the pep- 
tide exit tunnel. Bases at the 3’-ends of the tRNAs base-pair with bases in the PTC. 
G2251 and G2252 of the PTC (Figure 30.22) base-pair with C75 and C74, respec- 
tively, of the P-site tRNA. PTC G2553 forms a base pair with C75 of the A-site tRNA. 
Significantly, PTC C2501 interacts with A76 of the P-site tRNA and nearby U2506 in- 
teracts with A76 of the A-site tRNA. These interactions and others lead to tight bind- 
ing of the substrates by the ribosome in a manner such that the reactive groups 
(aminoacyl and peptidyl) are juxtaposed and properly oriented for reaction to oc- 
cur. Collectively, functional groups provided by the PTC, the peptidyl-tRNA, and the 
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aminoacyl-tRNA establish a network of H bonds and electrostatic interactions that 
stabilize the highly polar transition state (Figure 30.23). In effect, interactions of the 
tRNAs with the rRNA of the PTC create a highly organized environment that facil- 
itates peptide bond synthesis. Ribosome-catalyzed protein synthesis relies more on 
proximity and orientation of substrates than on chemical catalysis. The role of the 
PTC in protein synthesis is to align the tRNA-linked substrates so that the reaction 
is facilitated. 





FIGURE 30.21 The decoding 
somal subunit is composed 
30S subunit, as viewed from 
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center of the 30S ribo- 
only of 16S rRNA. (a) The 
he 50S subunit. The circle 


(cyan) shows the latch structure of the 30S subunit that 
encircles and encloses the mRNA. The mRNA enters 


along the path indicated by 


he arrow and follows a 


groove along this face of the subunit. The location of 
the decoding center is indicated by the red circle. 


(b) 30S ribosomal subunit in 
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(Adapted from Figure 2 in Ogle, 
Structural insights into translatio 
chemistry 74:129-177.) (Adapted 
et al., 2000. Structure of the func 


.M., and Ramakrishnan, V., 2005. 
nal fidelity. Annual Review of Bio- 
from Figure 3 in Schluenzen, F, 





ionally activated small riboso- 


mal subunit at 3.3 Å resolution. Cell 102:615-623.) 


FIGURE 30.22 The peptidyl transferase active site: Base- 
pairing between cytosine residues of A-site tRNA 
(yellow) and P-site tRNA (orange) with 23S rRNA bases 
(pale green) (pdb id = 1VQN).The a-amino group of the 
aminoacyl group on the aminoacyl-tRNA (blue) is posi- 
tioned for the attack on the ester carbonyl carbon of 
the peptidyl group of the P-site tRNA (green). (Adapted 
from Figure 2 in Beringer, M.,and Rodnina, M.V., 2007. The ribo- 
somal peptidyl transferase. Molecular Cell 26:31 1-321.) 
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CENGAGENOW” ANIMATED FIGURE 30.23 The protein synthesis reaction proceeds via deprotonation of the 
a-amino group of the aminoacyl-tRNA, followed by nucleophilic attack of the a-NH> on the peptidyl-tRNA 
carbonyl carbon to form a highly polar tetrahedral intermediate. Proton transfer to the 3’-O of the tRNA that 
contributed the peptidyl chain leads to release of this tRNA in the deacylated form and formation of a“one- 
residue-longer’-peptidyl-tRNA. See this figure animated at www.cengage.com/login. 


Translocation Three things remain to be accomplished in order to return the active 
70S ribosome: mRNA complex to the starting point in the elongation cycle: 


1. The deacylated tRNA must be removed from the P site. 

2. The peptidyltRNA must be moved (translocated) from the A site to the P site. 

3. The ribosome must move one codon down the mRNA so that the next codon is 
positioned in the A site. 


The precise events in translocation are still being resolved, but several distinct steps 
are clear. The acceptor ends (the aminoacylated ends) of both A- and P-site tRNAs in- 
teract with the PTC of the 50S subunit. Because the growing peptidyl chain doesn’t 
move during peptidyl transfer, the acceptor end of the A-site aminoacyl-tRNA must 
move into the P site as its aminoacyl function picks up the peptidyl chain. At the same 
time, the acceptor end of the deacylated P-site tRNA is shunted into the E site (see 
Figure 30.15). Then, the mRNA and the anticodon ends of tRNAs move together with 
respect to the 30S subunit so that the mRNA is passively dragged one codon further 
through the ribosome. With this movement, the anticodon end of the now one- 
residue-longer peptidyl-tRNA goes from the A site of the 30S subunit to the P site. 
Concomitantly, the anticodon end of the deacylated tRNA is moved into the E site. 
These ratchetlike movements of the 30S subunit relative to the 50S subunit are cat- 
alyzed by the translocation protein elongation factor G (EF-G), which apparently cou- 
ples the energy of GTP hydrolysis to movement. Note that translocation of the mRNA 
relative to the 30S subunit will deliver the next codon to the 30S A site. 

EF-G binds to the ribosome as an EF-G:GTP complex. GTP hydrolysis is essential 
not only for translocation but also for subsequent EF-G dissociation. Because EF-G 
and EF-Tu compete for a common binding site on the ribosome (the factor-binding 
center) adjacent to the A site, EF-G release is a prerequisite for return of the 70S ribo- 
some :mRNA to the beginning point in the elongation cycle. 

In this simple model of peptidyl transfer and translocation, the ends of both 
tRNAs move relative to the two ribosomal subunits in two discrete steps, the ac- 
ceptor ends moving first and then the anticodon ends. Furthermore, the readjust- 
ments needed to reposition the ribosomal subunits relative to the mRNA and to 
one another imply that the 30S and 50S subunits must move relative to one an- 
other in ratchetlike fashion. This model provides a convincing explanation for why ribo- 
somes are universally organized into a two-subunit structure: The small and large subunits 
must move relative to each other, as opposed to moving as a unit, in order to carry out the 
process of translation. 
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GTP Hydrolysis Fuels the Conformational Changes That Drive 
Ribosomal Functions 


Two GTPs are hydrolyzed for each amino acid residue incorporated into peptide 
during chain elongation: one upon EF-Tu—mediated binding of aa-tRNA and one 
more in translocation. The role of GTP (with EF-Tu as well as EF-G) is mechani- 
cal, in analogy with the role of ATP in driving muscle contraction (see Chapter 
16). GTP binding induces conformational changes in ribosomal components that 
actively engage these components in the mechanics of protein synthesis; subse- 
quent GTP hydrolysis followed by GDP and Pi release relax the system back to the 
initial conformational state so that another turn in the cycle can take place. The 
energy expenditure for protein synthesis is at least four high-energy phosphoric 
anhydride bonds per amino acid. In addition to the two provided by GTP, two 
from ATP are expended in amino acid activation via aminoacyl-tRNA synthesis 
(see Figure 30.3). 


A DEEPER LOOK 





Molecular Mimicry—The Structures of EF-Tu: Aminoacyl-tRNA, EF-G, and RF-3 





EF-Tu and EF-G compete for binding to ribosomes. EF-Tu has the 
unique capacity to recognize and bind any aminoacyl-tRNA and de- 
liver it to the ribosome in a GTP-dependent reaction. EF-G catalyzes 
GTP-dependent translocation. The structure of the EF-Tu:tRNA 
complex is remarkably similar to the structure of EF-G (EF-Tu: 
tRNA is shown on the left in the figure; EF-G is in the center). The 
EF-Tu: tRNA structure is Thermus aquaticus EF-Tu: Phe-tRNA™® com- 
plexed with GMPCP (purple); GMPCP is a nonhydrolyzable analog 
of GTP (pdb id = 1TTT). The EF-G structure has GDP bound (pur- 
ple) (pdb id = 1DAR). Note that parts of the EF-G structure mimic 
the structure of the tRNA molecule. Both EF-Tu and EF-G bind to 
the factor-binding center, which is located on the L7/L12 side of 
50S ribosomal subunit. Part of this center has an associated GTPase 
function that plays an integral role in the binding and release of 
these factors. 





» EF-Tu:tRNA (left); EF-G (center); 
and RF-3 (right). 





Nature has extended this mimicry with RF-3, a ribosome- 
binding protein essential to the termination phase of protein syn- 
thesis. RF-3 is a GTP-binding protein that looks like a tRNA (the 
structure of RF-3 with bound GDP (purple) is shown on the right 
in the figure) (pdb id = 2H5E). 

One view of early evolution suggests that RNA was the primor- 
dial macromolecule, fulfilling all biological functions, including 
those of catalysis and information storage that are now assumed 
for the most part by proteins and DNA. The mimicry of EF-Tu: 
tRNA by EF-G may represent a fossil of early macromolecular evo- 
lution when the proteins first began to take over some functions of 
RNA by mimicking shapes known to work as RNAs. 
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Peptide Chain Termination Requires a G-Protein Family Member 


The elongation cycle of polypeptide synthesis continues until the 70S ribosome en- 
counters a “stop” codon. At this point, polypeptidyl-tRNA occupies the P site and the 
arrival of a “stop” or nonsense codon in the A site signals that the end of the polypep- 
tide chain has been reached (Figure 30.24) These nonsense codons are not “read” 
by any “terminator tRNAs” but instead are recognized by specific proteins known as 
release factors, so named because they promote polypeptide release from the ribo- 
some. The release factors bind at the A site. RF-1 recognizes UAA and UAG, whereas 
RF-2 recognizes UAA and UGA. RF-1 and RF-2 are members of the guanine nu- 
cleotide exchange factor (GEF) family of proteins, which includes EF-Ts. Like EF-G, 
RF-1 and RF-2 interact well with the ribosomal A-site structure. Furthermore, these 
release factors “read” the nonsense codons through specific tripeptide sequences 
that serve as the RF protein equivalent of the tRNA anticodon loop. There is about 
one molecule each of RF-1 and RF-2 per 50 ribosomes. Ribosomal binding of RF-1 
or RF-2 is competitive with EF-G. RF-1 or RF-2 recruit a third release factor, RF-3, 
complexed with GTP; this protein is a structural mimic of tRNA. RF-3 is the fourth 
G-protein family member (the other three are IF-2, EF-Tu, and EF-G) involved in 
protein synthesis. All share the same ribosomal binding site. The state of the pep- 
tidyl-tRNA in the P site determines which is bound and, importantly, the progression 
of protein synthesis through initiation, elongation, and termination. 

The presence of release factors with a nonsense codon in the A site creates a 70S 
ribosome: RF-1 (or RF-2) :RF-3-GTP: termination signal complex that transforms the 
ribosomal peptidyl transferase into a hydrolase. That is, instead of catalyzing the trans- 
fer of the polypeptidyl chain from a polypeptidyl-tRNA to an acceptor aminoacyl- 
tRNA, the peptidyl transferase hydrolyzes the ester bond linking the polypeptidyl 
chain to its tRNA carrier. Both RF-1 and RF-2 have a conserved GGQ sequence lo- 
cated in a loop that enters the PTC. This GGQ sequence faces A76 of the P-site tRNA; 
the closest 23S rRNA residues are A2451 and A2602. RF-1 and RF-2 promote confor- 
mational adjustments that expose the peptidyl-tRNA ester to attack by a water mole- 
cule. The conserved Gln in the release factor GGQ motif positions this hydrolytic 
water molecule through hydrogen-bonding interactions. The peptidyl transferase 
transfers the polypeptidyl chain to this water molecule instead of an aminoacyl- 
tRNA. Peptide release is followed by expulsion of RF-1 (or RF-2) from the ribosome 
(Figure 30.24). This leaves a ribosome : mRNA: P-site tRNA complex that must be dis- 
assembled by a protein ribosome recycling factor (RRF) with the help of EF-G. The 
structure of RRF resembles the letter “L”; thus, this protein is also structurally akin to 
a tRNA. 

We can now recount the central role played by GTP in protein synthesis. IF-2, 
EF-Tu, EF-G, and RF-3 are all GTP-binding proteins, and all are part of the 
G-protein superfamily (whose name is derived from the heterotrimeric G proteins 
that function in transmembrane signaling pathways, as in Figure 15.19). IF-2, EF-Tu, 
EF-G, and RF-3 interact with the same site on the 50S subunit, the factor-binding cen- 
ter, in the 50S cleft. This factor-binding center activates the GTPase activity of these 
factors, once they become bound. 


The Ribosomal Subunits Cycle Between 70S Complexes 
and a Pool of Free Subunits 


Ribosomal subunits cycle rapidly through protein synthesis. In actively growing bac- 
teria, 80% of the ribosomes are engaged in protein synthesis at any instant. Once a 
polypeptide chain is synthesized and the nascent polypeptide chain is released, the 
70S ribosome dissociates from the mRNA and separates into free 30S and 50S sub- 
units (Figure 30.25). Intact 70S ribosomes are inactive in protein synthesis because 
only free 30S subunits can interact with the initiation factors. Binding of initiation 
factor IF-3 by 30S subunits and interaction of 30S subunits with 50S subunits are mu- 
tually exclusive. 30S subunits with bound initiation factors associate with mRNA, but 
50S subunit addition requires IF-3 release from the 30S subunit. 
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FIGURE 30.25 The ribosome life cycle. Note that IF-3 is 
released prior to 50S addition. 





Polyribosomes Are the Active Structures of Protein Synthesis 


Active protein-synthesizing units consist of an mRNA with several ribosomes 
attached to it. Such structures are polyribosomes, or, simply, polysomes (Figure 
30.26). All protein synthesis occurs on polysomes. In the polysome, each ribosome 
is traversing the mRNA and independently translating it into polypeptide. The fur- 
ther a ribosome has moved along the mRNA, the greater the length of its associ- 
ated polypeptide product. In prokaryotes, as many as ten ribosomes may be found 
in a polysome. Ultimately, as many as 300 ribosomes may translate an mRNA, so as 
many as 300 enzyme molecules may be produced from a single transcript. Eukary- 
otic polysomes typically contain fewer than 10 ribosomes. 


30:6: How Are Proteins Synthesized in Eukaryotic Cells? 


Eukaryotic mRNAs are characterized by two post-transcriptional modifications: 
the 5’-terminal ‘methyl-GTP cap and the 3’-terminal poly(A) tail (Figure 30.27). 
The ‘methyl-GTP cap is essential for mRNA binding by eukaryotic ribosomes and 
also enhances the stability of these mRNAs by preventing their degradation by 
j : a : 

FIGURE 30:26 Elecizon wicrogiaph ot polysomesmuli- 5 exonucleases. The poly(A) tail enhances both the stability and translational 
ple ribosomes translating the same mRNA. (From Francke efficiency of eukaryotic mRNAs. The Shine-Dalgarno sequences found at the 
C, et al, 1982. Electron microscopic visualization of a discrete 5'-end of prokaryotic mRNAs are absent in eukaryotic mRNAs. 

class of giant translation units in salivary gland cells of Chirono- 


mus tentans. The EMBO Journal 1:59-62. Photo courtesy of Oscar 


L Miller Jt, University of Virginia) Peptide Chain Initiation in Eukaryotes 


The events in eukaryotic peptide chain initiation are summarized in Figure 30.28, 
and the properties of eukaryotic initiation factors, symbolized eIFs, are presented 
in Table 30.9. As might be expected, eukaryotic protein synthesis is considerably 
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FIGURE 30.27 The characteristic structure of eukaryotic mRNAs. Untranslated regions ranging between 40 and 
7-Methyl GTP “cap” at 150 bases in length occur at both the 5’- and 3'-ends of the mature mRNA. An initiation codon at the 5'-end, 
5'-end invariably AUG, signals the translation start site. 


more complex than prokaryotic protein synthesis. More than 100 different RNA 
molecules and 200 proteins are required just for the core translational machinery 
in Caenorhabditis elegans, a simple animal. Despite such complexity, the overall 
process is similar to prokaryotic protein synthesis. The eukaryotic initiator tRNA 
is a unique tRNA functioning only in initiation. Like the prokaryotic initiator 
tRNA, the eukaryotic version carries only Met. However, unlike prokaryotic 
fMet-ttRNA;™* the Met on this tRNA is not formylated. The eukaryotic initiator 
tRNA is usually designated tRNA™M*, with the “i” indicating “initiation.” 
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FIGURE 30.28 The three stages in the initiation of trans- 
lation in eukaryotic cells. See Table 30.9 for a description 
Ne of the functions of the eukaryotic initiation factors (elFs). 
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TABLE 30.9 
Factor Subunit Size (kD) Function 
elF1 15 Enhances initiation complex formation 
eIFIA 17 Stabilizes Met-tRNA; binding to 40S ribosomes 
eIF2 125 GTP-dependent Met-tRNA; binding to 40S ribosomes 
a 36 Regulated by phosphorylation 
B 50 Binds Met-tRNA; 
y 55 Binds GTP, Met-tRNA; 
eIF2B 270 Promotes guanine nucleotide exchange on eIF2 
eIF2C 94 Stabilizes ternary complex in presence of RNA 
eIF3 800 Promotes Met-tRNA; and mRNA binding 
eIF4F 243 Binds to mRNA caps and poly(A) tails; consists of eIF4A, eIF4E, and eIF4G; RNA helicase 
activity unwinds mRNA 2° structure 
eIF4A 46 Binds RNA; ATP-dependent RNA helicase; promotes mRNA binding to 40S ribosomes 
eIF4E 24 Binds to 5'-terminal ’methyl-GTP cap on mRNA 
eIF4G 173 Binds to PABP 
eIF4B 80 Binds mRNA; promotes RNA helicase activity and mRNA binding to 40S ribosomes 
eIF5 49 Promotes GTPase of eIF2, ejection of eIF2 and eIF3 
eIF5B 175 Ribosome-dependent GTPase activity; mediates 40S and 60S joining 
eIF6 Dissociates 80S; binds to 60S 











In particular, initiation of protein synthesis is significantly more complicated in 
eukaryotes. It can be divided into three fundamental stages: 


Stage 1: Formation of the 43S preinitiation complex (Figure 30.28, stage 1). Initiation 
factors eIF1, eIFIA, eIF3, and eIF5 bind to a 40S ribosomal subunit. Then, Met- 
tRNA™* (in the form of an eIF2:GTP:MettRNA™* ternary complex) is delivered to 
the eIF1/1A/3/5:40S subunit complex. (Unlike in prokaryotes, binding of Met- 
tRNA by eukaryotic ribosomes occurs in the absence of mRNA, so MettRNAM* 
binding is not codon-directed.) In mammals, eIF3 is an 800-kD multimer containing 
at least 13 different subunits. The eIF3 complex binds to the solvent-exposed side of 
the 40S subunit, away from the face that interacts with mRNA and the 60S subunit. A 
prominent role of eIF3 is to serve as a platform and scaffold for the recruitment of 
mRNA and other proteins involved in translation initiation. 


Stage 2: Formation of the 48S initiation complex (Figure 30.28, stage 2). This stage in- 
volves binding of the 43S preinitiation complex to mRNA and migration of the 40S ri- 
bosomal subunit to the correct AUG initiation codon. Binding of mRNA by the 43S 
preinitiation complex requires a set of proteins including eIF4 group and the (poly)A- 
binding protein (PABP). Collectively, these proteins recognize the 5'-terminal cap and 
3'-terminal poly(A) tail of an mRNA, unwind any secondary structure in the mRNA, 
and transfer the mRNA to the 43S preinitiation complex. The eIF4 group includes 
elF4B and elIF4F. eIF4F is a trimeric complex consisting of eIF4A (an ATP-dependent 
RNA helicase), eIF4E (which binds the 5’-terminal ‘methyl-GTP of mRNAs), and 
eIF4G. Because eIF4G interacts with PABP (Figure 30.29), eIF4G serves as the bridge 
between the cap-binding eIF4E, the poly(A) tail of the mRNA, and the 40S subunit 
(through interaction with eIF3). These interactions between the 5’-terminal methyl- 
GTP cap and the 3’-poly(A) tail initiate scanning of the 40S subunit in search of an 
AUG codon. 

eIF4E, the mRNA cap-binding protein of eIF4F, represents a key regulatory ele- 
ment in eukaryotic translation. eIF4F binding to the cap structure is necessary for 
association of eIF4B and formation of the 48S preinitiation complex. Translation is 
inhibited when the eIF4E subunit of eIF4F binds with 4EBP (the eIF4E binding pro- 
tein). Growth factors stimulate protein synthesis by causing the phosphorylation of 
4EBP, which prevents its binding to eIF4E. 
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Stage 3: Formation of the 80S initiation complex. When the 48S preinitiation com- 
plex stops at an AUG codon, GTP hydrolysis in the eIF2:MettRNA ternary com- 
plex causes ejection of the initiation factors bound to the 40S ribosomal subunit. 
EIF5, in conjunction with eIF5B, acts here by stimulating the GTPase activity of 
eIF2. Ejection of the eIFs is followed by 60S subunit association to form the 80S ini- 
tiation complex, whereupon translation begins (Figure 30.28, stage 3). eIF2:GDP 
is recycled to eIF2:GTP by eIF2B (eIF2B is a guanine nucleotide exchange factor) . 


Control of Eukaryotic Peptide Chain Initiation Is One Mechanism for 
Post-Transcriptional Regulation of Gene Expression 


Regulation of gene expression can be exerted post-transcriptionally through con- 
trol of mRNA translation. Phosphorylation/dephosphorylation of translational 
components is a dominant mechanism for control of protein synthesis. Several ini- 
tiation factors—elF2a, eIF2B, eIF4E, eIF4G, 40S ribosomal protein S6, and two eu- 
karyotic elongation factors, eEF1 and eEF2 (see following)—have been identified 
as targets of regulatory controls. Modification of some factors affects the rate of 
mRNA translation; modification of others affects which mRNAs are selected for 
translation. Peptide chain initiation, the initial phase of the synthetic process, is 
the optimal place for such control. Phosphorylation of S6 facilitates initiation of 
protein synthesis, resulting in a shift of the ribosomal population from inactive ri- 
bosomes to actively translating polysomes. S6 phosphorylation is stimulated by 
serum growth factors (see Chapter 32). On the other hand, the phosphorylation 
of some translational components inhibits protein synthesis. For example, the 
a-subunit of eIF2 can be reversibly phosphorylated at a specific Ser residue by an 
eIF2a kinase/phosphatase system (Figure 30.30). Four different elF2a@ kinases are 
known; each is responsive to specific metabolic signals. Phosphorylation of eIF2a 
inhibits peptide chain initiation. Phosphorylation of eIF2 by heme-regulated 
inhibitor (HRI, the heme-inhibited eIF2a kinase) is an important control govern- 
ing globin synthesis in reticulocytes. If heme for hemoglobin synthesis becomes 
limiting in these cells, eIF2a@ is phosphorylated, so globin mRNA is not translated 
and chains are not synthesized. Availability of heme inhibits HRI, leading to re- 
sumption of protein synthesis upon phosphatase-mediated removal of the phos- 
phate group from the Ser residue. 


FIGURE 30.29 The 48S initiation complex. mRNA is 
shown as a black line, its 5’-cap as a black circle, and its 
coding region as a yellow bar beginning with AUG and 
ending with UAA; downstream lies the poly(A) tail. The 
initiator Met-tRNA; is shown as a pitchfork bound to the 
AUG initiation codon. The various initiation factors are 
shown by their numerical and letter codes. Mnk is a 
stress- and mitogen-activated protein kinase that phos- 
phorylates elF4E, increasing its affinity for the 5'-cap. 
(Adapted from Figure 2 in Rhoads, R., Dinkova,T. D., and 
Korneeva, N. L., 2006. Mechanism and regulation of translation 
in C. elegans. WormBook 28:1-18.) 
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FIGURE 30.30 Control of elF2 functions through reversible phosphorylation of a Ser residue on its a-subunit. 
The phosphorylated form of elF2 (elF2—P) enters a tight complex with elF2B and is unavailable for initiation. 
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Diphtheria Toxin ADP-Ribosylates eEF2 
Diphtheria arises from infection by Corynebacterium diphtheriae bacte- tion is unknown. (EF-G of prokaryotes lacks this unusual modifica- 
ria carrying bacteriophage corynephage B. Diphtheria toxin is a tion and is not susceptible to diphtheria toxin.) Diphtheria toxin 
phage-encoded enzyme secreted by these bacteria that is capable of specifically ADP-ribosylates an imidazole-N within the diphthamide 
inactivating a number of GTP-dependent enzymes through covalent moiety of eEF2 (see accompanying figure). ADP-ribosylated eEF2 
attachment of an ADP-ribosyl moiety derived from NAD*. That is, retains the ability to bind GTP but is unable to function in protein 
diphtheria toxin is an NAD*-dependent ADP-ribosylase. One target synthesis. Because diphtheria toxin is an enzyme and can act catalyt- 
of diphtheria toxin is the eukaryotic translocation factor, eEF2. This ically to modify many molecules of its target protein, just a few mi- 
protein has a modified His residue known as diphthamide. Diph- crograms suffice to cause death. 
thamide is generated post-translationally on eEF2; its biological func- 
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Peptide Chain Elongation in Eukaryotes Resembles 
the Prokaryotic Process 


Eukaryotic peptide elongation occurs in very similar fashion to the process in 
prokaryotes. An incoming aminoacyl-tRNA enters the ribosomal A site while pep- 
tidyl-tRNA occupies the P site. Peptidyl transfer then occurs, followed by transloca- 
tion of the ribosome one codon further along the mRNA. Two elongation factors, 
eEF1 and eEF2, mediate the elongation steps. eEF1 consists of two components: 
eEFIA and eEFIB. eEFIA is the eukaryotic counterpart of EF-Tu; it serves as the 
aminoacyl-tRNA binding factor and requires GTP. eEF1B is the eukaryotic equivalent 
of prokaryotic EF-Ts; it catalyzes the exchange of bound GDP on eEF1:GDP for GTP 
so that active eEF1:GTP can be regenerated. EF2 is the eukaryotic translocation fac- 
tor. eEF2 (like its prokaryotic kin, EF-G) binds GTP, and GTP hydrolysis accompa- 
nies translocation. 


Eukaryotic Peptide Chain Termination Requires Just 
One Release Factor 


Whereas prokaryotic termination involves three different release factors (RFs), just 
one RF is sufficient for eukaryotic termination. Eukaryotic RF binding to the ribo- 
somal A site is GTP-dependent, and RF:GTP binds at this site when it is occupied 
by a termination codon. Then, hydrolysis of the peptidyl-tRNA ester bond, hydroly- 
sis of GTP, release of nascent polypeptide and deacylated tRNA, and ribosome dis- 
sociation from mRNA ensue. 


Inhibitors of Protein Synthesis 


Protein synthesis inhibitors have served two major, and perhaps complementary, pur- 
poses. First, they have been very useful scientifically in elucidating the biochemical 
mechanisms of protein synthesis. Second, some of these inhibitors affect prokaryotic 
but not eukaryotic protein synthesis and thus are medically important antibiotics. 
Table 30.10 is a partial list of these inhibitors and their mode of action. The structures 
of some of these compounds are given in Figure 30.31. 

As indicated in Table 30.10, many antibiotics inhibit protein synthesis by binding 
to key rRNA components. The two principal sites for antibiotic binding are the 16S 
rRNA of the decoding center within the small ribosomal subunit and the PTC of the 
23S rRNA in the large subunit. More antibiotics are targeted to the PTC than to the 
decoding center. 

Why is rRNA such an optimal target for antibiotics? Generally speaking, it is diffi- 
cult for organisms to defend themselves against agents that bind rRNA rather than 
protein. First of all, there are many rRNA genes, so mutation of one gene to antibi- 
otic resistance (loss in antibiotic binding, for instance) would affect few rRNA mole- 
cules among the many. Further, four bases comprise RNA (as opposed to 20 amino 
acids in proteins), so there are fewer possibilities for modification of the rRNA so that 
the RNA retains its function but loses its ability to bind the antibiotic. 


The Decoding Site Is a Target of Aminoglycoside Antibiotics Aminoglycoside an- 
tibiotics are a series of compounds with a 2-deoxy-streptamine (2-DOS) core struc- 
ture (Figure 30.32a) that is central to their action. These antibiotics bind to an un- 
paired adenine residue (A'*"*) within the 16S rRNA decoding site (that part of the 
16S rRNA that decodes the mRNA through interactions with the anticodon of the 
aminoacyl-tRNA in the A site). A! lies within a small internal rRNA loop (Figure 
30.32b) that is part of the decoding center. The sugar moiety attached to the 
4-position of the 2-DOS core forms H bonds with A'*® that limit the flexibility of 
AM? and AM on the other side of the small RNA loop; Al” and A! interact with 
codon nucleotides (see Figure 30.21b). This flexibility loss results in errors in trans- 
lation fidelity, where the code is misread, the wrong amino acid is inserted in the 
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TABLE 30.10 | n Sy 
Inhibitor Cells Inhibited Mode of Action 











Initiation 

Aurintricarboxylic acid Prokaryotic Prevents IF binding to 30S subunit 
Kasugamycin Prokaryotic Inhibits {Met-tRNA™ binding 
Streptomycin Prokaryotic Prevents formation of initiation complexes 


Elongation: Aminoacyl-tRNA Binding 


Tetracycline Prokaryotic Inhibits aminoacyl-tRNA binding at A site 
Streptomycin Prokaryotic Codon misreading, insertion of improper amino acid 
Kirromycin Prokaryotic Binds to EF-Tu, preventing conformational switch from EF-Tu: GTP to EF-Tu: GDP 


Elongation: Peptide Bond Formation 


Sparsomycin Prokaryotic Peptidyl transferase inhibitor 

Chloramphenicol Prokaryotic Binds to 50S subunit, blocks the A site and inhibits peptidyl transferase activity 

Clindamycin Prokaryotic Binds to 50S subunit, overlapping the A and P sites and blocking peptidyl transferase activity 
Erythromycin Prokaryotic Blocks the 50S subunit tunnel, causing premature peptidyl-tRNA dissociation 


Elongation: Translocation 


Fusidic acid Both Inhibits EF-G: GDP dissociation from ribosome 

Thiostrepton Prokaryotic Inhibits ribosome-dependent EF-Tu and EF-G GTPase activity 

Diphtheria toxin Eukaryotic Inactivates eEF-2 through ADP-ribosylation 

Cycloheximide Eukaryotic Inhibits translocation of peptidyl-tRNA 

Premature Termination 

Puromycin Both Aminoacyl-tRNA analog, binds at A site and acts as peptidyl acceptor, aborting peptide 
elongation 


Ribosome Inactivation 


Ricin Eukaryotic Catalytic inactivation of 28S rRNA via N-glycosidase action on A46 











growing polypeptide chain, and the protein being made is nonfunctional. Amino- 
glycoside antibiotics have been useful probes for decoding center properties. 


Many Antibiotics Target the PTC and the Peptide Exit Tunnel A chemically di- 
verse series of antibiotics targets the PTC and/or the adjacent peptide exit tunnel. 
Among these, the macrolide antibiotics are one of the clinically most important 
classes; erythromycin (see Figure 30.31), a broadly prescribed macrolide, is repre- 
sentative of this class. (The newer, semisynthetic ketolide drugs are analogs of the 
macrolide class and act in a similar manner.) X-ray crystallographic analyses of 
ribosome-—macrolide complexes show the macrolide bound within the peptide exit 
tunnel, 1 to 1.5 nm below the PTC. Macrolide binding plugs the exit tunnel, pre- 
venting the movement of the growing peptide chain through the tunnel. Protein 
synthesis is aborted and the peptidyl-tRNA eventually dissociates from the ribosome. 
Substituents bound at the 5-position of the macrolide extend outward from the pep- 
tide exit tunnel toward the PTC. (The 5-position in macrolides is the position where 
the N(CHs).containing saccharide ring of erythromycin is attached; see Figure 
30.31). All direct interactions of macrolide antibiotics with the ribosome involve 23S 
rRNA, not ribosomal proteins. Both macrolide and ketolide antibiotics adopt a con- 
formation within the ribosome that projects a polar face into the peptide exit tun- 
nel and a nonpolar face toward the tunnel wall. Hydrophobic interactions between 
the lactone-ring methyl groups and the tunnel wall contribute significantly to the 
binding energy of these drugs. Residue A? of the prokaryotic 23S rRNA lies at the 
entrance to the peptide exit tunnel and favors macrolide antibiotic binding. The 
major large-subunit rRNA of ribosomes found in the cytosol and mitochondria of 
eukaryotic cells have G at the equivalent position and are unaffected by macrolides. 


30.6 How Are Proteins Synthesized in Eukaryotic Cells? 
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Erythromycin 


TyrosyltRNA 


FIGURE 30.32 (a) Structure of geneticin, a representa- 
tive aminoglycoside antibiotic. Note the characteristic 
2-deoxystreptamine (2-DOS) core structure, in red. 
(b) The base sequence of the small RNA loop within 
the 16S rRNA decoding center. Note that unpaired 
adenine residues 1408, 1492, and 1493 constitute the 
internal loop structure. (Adapted from Figure 1 in Her- 
mann, T., 2005. Drugs targeting the ribosome. Current Opinion 
in Structural Biology 15:355-366.) 
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The most common effect of antibiotics that interact with the PTC is to occupy 
space within this center, such that the amino acid or peptidyl chain linked at the 
3’-end of a tRNA cannot be positioned properly for the peptide-bond forming re- 
action. This mode of inhibition is consistent with the catalytic role of PTC in pre- 
cisely orienting the substrates so that the peptide bond-forming reaction can occur. 
This effect is more common for aminoacyl-tRNAs in the A site, although some drugs 
can bridge the A and P sites and affect both aminoacyl-tRNA and peptidyl-tRNA ori- 
entation. Ribosomes with long peptidyl chains attached to the tRNA in the P site are 
less susceptible to macrolide antibiotics. 


SUMMARY 


30.1 What Is the Genetic Code? The genetic code is the code of bases 
that specifies the sequence of amino acids in a protein. The genetic code 
is a triplet code. Given the four RNA bases—A, C, G, and U—a total of 
4° = 64 three-letter codons are available to specify the 20 amino acids 
found in proteins. Of these 64 codons, 61 are used for amino acids, and 
the remaining 3 are nonsense, or “stop,” codons. The genetic code is 
unambiguous, degenerate, and universal. 


30.2 How Is an Amino Acid Matched with Its Proper tRNA? During 
protein synthesis, aminoacyl-tRNAs recognize the codons through base 
pairing using their anticodon loops. A second genetic code exists, the 
code by which each aminoacyl-tRNA synthetase adds its amino acid to 
tRNAs that can interact with the codons that specify its amino acid. A 
common set of rules does not govern tRNA recognition by aminoacyl- 
tRNA synthetases. The tRNA features recognized are not limited to the 
anticodon and in some instances do not even include the anticodon. 
Usually, an aminoacyl-tRNA synthetase recognizes a set of sequence ele- 
ments in its cognate tRNAs. 


30.3 What Are the Rules In Codon-Anticodon Pairing? Anticodons are 
paired with codons in antiparallel orientation. There are more codons 
than there are amino acids, and considerable degeneracy exists in the ge- 
netic code at the third base position. The first two bases of the codon and 
the last two bases of the anticodon form canonical Watson—Crick base 
pairs, but pairing between the third base of the codon and the first base 
of the anticodon follows less stringent rules, allowing some anticodons to 
recognize more than one codon, in accordance with Crick’s wobble hy- 
pothesis. Some codons for a particular amino acid are used more than 
the others. Nonsense suppression occurs when suppressor tRNAs read 
nonsense codons. 


30.4 What Is the Structure of Ribosomes, and How Are They Assem- 
bled? Ribosomes are ribonucleoprotein particles that act as mechano- 
chemical systems in protein synthesis. They move along mRNA tem- 
plates, orchestrating the interactions between successive codons and the 
corresponding anticodons presented by aminoacyl-tRNAs. Ribosomes 
catalyze the formation of peptide bonds. Prokaryotic ribosomes consist 
of two subunits, 30S and 50S, which are composed of 50 different pro- 
teins and 3 rRNAs—16S, 23S, and 5S. The general shapes of the riboso- 
mal subunits are determined by their rRNA molecules; ribosomal pro- 


teins serve a largely structural role in ribosomes. Ribosomes sponta- 
neously self-assemble in vitro. The 30S subunit provides the decoding 
center that matches up the tRNA anticodons with the mRNA codons. 
The 50S subunit has the peptidyl transferase center that catalyzes pep- 
tide bond formation. This center consists solely of 23S rRNA; the ribo- 
some is a ribozyme. Eukaryotic cytosolic ribosomes are larger than 
prokaryotic ribosomes. 


30.5 What Are the Mechanics of mRNA Translation? Ribosomes move 
along the mRNA in the 5'—>3' direction, recruiting aminoacyl-tRNAs 
whose anticodons match up with successive codons and joining amino 
acids in peptide bonds in a polymerization process that forms a particular 
protein. Protein synthesis proceeds in three distinct phases: initiation, 
elongation, and termination. Elongation involves two steps: peptide bond 
formation and translocation of the ribosome one codon further along the 
mRNA. At each stage, energy is provided by GTP hydrolysis, and specific 
soluble protein factors participate. Many of these soluble proteins are 
G-protein family members. Initiation involves binding of mRNA by the 
small ribosomal subunit, followed by binding of fMettRNA/™* that rec- 
ognizes the first codon. Elongation is accomplished via a repetitive cycle in 
which successive aminoacyl-tRNAs add to the ribosome: mRNA complex 
as directed by codon binding, the 50S subunit catalyzes peptide bond for- 
mation, and the polypeptide chain grows by one amino acid at a time. 
Ribosomes have three tRNA-binding sites: the A site, where incoming 
aminoacyl-tRNAs bind; the P site, where the growing peptidyl-tRNA chain 
is bound; and the E site, where deacylated tRNAs exit the ribosome. Ter- 
mination occurs when the ribosome encounters a stop codon in the 
mRNA. Polysomes are the active structures in protein synthesis. 


30.6 How Are Proteins Synthesized in Eukaryotic Cells? The process 
of protein synthesis in eukaryotes strongly resembles that in prokary- 
otes, but the events are more complicated. Eukaryotic mRNAs have 
5'-terminal methyl G caps and 3’-polyadenylylated tails. Initiation of eu- 
karyotic protein synthesis involves three stages and multiple proteins. 
This complexity offers many opportunities for regulation, and eukary- 
otic cells employ a variety of mechanisms for post-transcriptional regu- 
lation of gene expression. Many antibiotics are specific inhibitors of 
prokaryotic protein synthesis, making them particularly useful for the 
treatment of bacterial infections and diseases. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 
1. (Integrates with Chapter 12.) The following sequence represents 
part of the nucleotide sequence of a cloned cDNA: 


.. .CAATACGAAGCAATCCCGCGACTAGACCTTAAC... 


Can you reach an unambiguous conclusion from these data about 
the partial amino acid sequence of the protein encoded by this 
cDNA? 


2. Arandom (AG) copolymer was synthesized using a mixture of 5 parts 
adenine nucleotide to 1 part guanine nucleotide as substrate. If this 
random copolymer is used as an mRNA in a cell-free protein synthe- 
sis system, which amino acids will be incorporated into the polypep- 
tide product? What will be the relative abundances of these amino 
acids in the product? 

3. Review the evidence establishing that aminoacyl-tRNA synthetases 
bridge the information gap between amino acids and codons. Indi- 
cate the various levels of specificity possessed by aminoacyl-tRNA 


synthetases that are essential for high-fidelity translation of messen- 
ger RNA molecules. 

4. (Integrates with Chapter 11.) Draw base-pair structures for (a) a 
G:C base pair, (b) a C:G base pair, (c) a G:U base pair, and (d) a 
U:G base pair. Note how these various base pairs differ in the po- 
tential hydrogen-bonding patterns they present within the major 
groove and minor groove of a double-helical nucleic acid. 

5. Point out why Crick’s wobble hypothesis would allow fewer than 
61 anticodons to be used to translate the 61 sense codons. How 
might “wobble” tend to accelerate the rate of translation? 

6. How many codons can mutate to become nonsense codons through 
a single base change? Which amino acids do they encode? 

7. Nonsense suppression occurs when a suppressor mutant arises that 
reads a nonsense codon and inserts an amino acid, as if the nonsense 
codon were actually a sense codon. Which amino acids do you think 
are most likely to be incorporated by nonsense suppressor mutants? 

8. Why do you suppose eukaryotic protein synthesis is only 10% as fast 
as prokaryotic protein synthesis? 

9. If the tunnel through the large ribosomal subunit is 10 nm long, how 
many amino acid residues might be contained within it? (Assume 
that the growing polypeptide chain is in an extended B-sheet—like 
conformation.) 

10. Eukaryotic ribosomes are larger and more complex than prokary- 
otic ribosomes. What advantages and disadvantages might this 
greater ribosomal complexity bring to a eukaryotic cell? 

11. What ideas can you suggest to explain why ribosomes invariably exist 
as two-subunit structures, instead of a larger, single-subunit entity? 

12. How do prokaryotic cells determine whether a particular methionyl- 
tRNA™* is intended to initiate protein synthesis or to deliver a Met 
residue for internal incorporation into a polypeptide chain? How do 
the Met codons for these two different purposes differ? How do eu- 
karyotic cells handle these problems? 

13. What is the Shine-Dalgarno sequence? What does it do? The effi- 
ciency of protein synthesis initiation may vary by as much as 100-fold 
for different mRNAs. How might the Shine-Dalgarno sequence be 
responsible for this difference? 

14. In the protein synthesis elongation events described under the section 
on translocation, which of the following seems the most apt account 
of the peptidyl transfer reaction: (a) The peptidyltRNA delivers its 
peptide chain to the newly arrived aminoacyl-tRNA situated in the A 
site, or (b) the aminoacyl end of the aminoacyl-tRNA moves toward 
the P site to accept the peptidyl chain? Which of these two scenarios 
makes more sense to you? Why? 
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15. (Integrates with Chapter 15.) Why might you suspect that the elon- 
gation factors EF-Tu and EF-Ts are evolutionarily related to the 
G proteins of membrane signal transduction pathways described in 
Chapter 15? 

16. How many ATP equivalents are consumed for each amino acid 
added to an elongating polypeptide chain during the process of 
protein synthesis? 

17. Go to www.pdb.org and bring up PDB file 1GIX, which shows the 30S 
ribosomal subunit, the three tRNAs, and mRNA. In the box on the 
right titled “Images and Visualization,” click “All Images,” and then 
scroll down to look at the Interactive View. By moving your cursor 
over the image, you can rotate it to view it from any perspective. 

a. How are the ribosomal proteins represented in the image? 

b. How is the 16S rRNA portrayed? 

c. Rotate the image to see how the tRNAs stick out from the struc- 
ture. Which end of the tRNA is sticking out? 

d. Where will these ends of the tRNAs lie when the 50S subunit binds 
to this complex? 

18. Go back to www.pdb.org and bring up PDB file 1FFK, which shows 
the 50S ribosomal subunit. In the box titled “Images and Visualiza- 
tion,” click “All Images.” Scroll down to look at the Interactive View. 
Right-click the image to discover more information and tools. 

a. How many atoms are represented in this structure? 

b. Are the bases of the nucleotides visible anywhere in the structure? 

c. Can you find double helical regions of RNA? 

d. Right-click and, under “Select,” select all proteins. Right-click 
again and select “Render,” then “Scheme,” and then “CPK Space- 
fill” to highlight the ribosomal proteins. Go back and cancel the 
protein selection. Then select “Nucleic,” and render nucleic acid 
in “CPK Spacefill.” Which macromolecular species seems to pre- 
dominate the structure? 


Preparing for the MCAT Exam 


19. Review the list of Shine-Dalgarno sequences in Figure 30.18 and se- 
lect the one that will interact best with the 3’-end of E. coli 16S rRNA. 

20. Chloramphenicol (Figure 30.31) inhibits the peptidyl transferase ac- 
tivity of the 50S ribosomal subunit. The 50S peptidyl transferase active 
site consists solely of functionalities provided by the 23S rRNA. What 
sorts of interactions do you think take place when chloramphenicol 
binds to the peptidyl transferase center? Which groups on chloram- 
phenicol might be involved in these interactions? 
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Completing the Protein Life 
Cycle: Folding, Processing, 
and Degradation 





ESSENTIAL QUESTION 


Proteins are the agents of biological function. Protein turnover (synthesis and decay) 
is a fundamental aspect of each protein's natural history. 


How are newly synthesized polypeptide chains transformed into mature, active 
proteins, and how are undesired proteins removed from cells? 


The human genome apparently contains about 20,500 genes, but some estimates 
suggest that the total number of proteins in the human proteome may approach 
1 million. What processes introduce such dramatically increased variation into the 
products of protein-encoding genes? We’ve reviewed (or will soon cover) many of 
these processes; a partial list (with examples) includes: 


1. Gene rearrangements (immunoglobulin G) 

2. Alternative splicing (fast skeletal muscle troponin T) 

3. RNA editing (apolipoprotein B) 

4. Proteolytic processing (chymotrypsinogen or prepro-opiomelanocortin; see 
Chapter 32) 

5. Isozymes (lactate dehydrogenase) 

6. Protein sharing (the glycolytic enzyme enolase is identical to t-crystallin in the eye) 

7. Protein-protein interactions at many levels (oligomerization, supramolecular 
complexes, assembly of signaling pathway protein complexes upon scaffold 
proteins) 

8. Covalent modifications of many kinds (phosphorylation or glycosylation, with 
multisite phosphorylation or variable degrees of glycosylation, to name just two 
of the dozens of possibilities) 


Thus, the nascent polypeptide emerging from a ribosome is not yet the agent 
of biological function that is its destiny. First, the polypeptide must fold into its na- 
tive tertiary structure. Even then, seldom is the nascent, folded protein in its final 
functional state. Proteins often undergo various proteolytic processing reactions 
and covalent modifications as steps in their maturation to functional molecules. 
Finally, at the end of their usefulness, damaged by chemical reactions or dena- 
tured due to partial unfolding, they are degraded. In addition, some proteins are 
targeted for early destruction as part of regulatory programs that carefully control 
available amounts of particular proteins. Damaged or misfolded proteins are a se- 
rious hazard; accumulation of protein aggregates can be a cause of human dis- 
ease, including the prion diseases (see Chapter 28) and diseases of amyloid accu- 
mulation, such as Alzheimer’s, Parkinson’s, or Huntington’s disease. 


3lwle How Do Newly Synthesized Proteins Fold? 


As Christian Anfinsen pointed out 40 years ago, the information for folding each pro- 
tein into its unique three-dimensional architecture resides within its amino acid 
sequence or primary structure (see Chapter 6). Proteins begin to fold even before 
their synthesis by ribosomes is completed (Figure 31.la). However, the cytosolic envi- 
ronment is a very crowded place, with effective protein concentrations as high as 
0.3 grams/mL. Macromolecular crowding enhances the likelihood of nonspecific pro- 
tein association and aggregation. The primary driving force for protein folding is the 
burial of hydrophobic side chains away from the aqueous solvent and reduction in 
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Vong's Flying Crane. Origami—the Asian art of paper 
folding—arose in China almost 2000 years ago when 
paper was rare and expensive and the folded shape 
added special meaning. Protein folding, like origami, 
takes a functionless form and creates a structure with 
unique identity and purpose. 





Life is a process of becoming, a combination of 
states we have to go through. 


Anais Nin (1903-1977) 
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Cellular Levels of Specific Proteins? 


Nascent means “undergoing the process of 
being born’ or, in the molecular sense, newly 
synthesized.” 
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Completing the Protein Life Cycle: Folding, Processing, and Degradation 


HUMAN BIOCHEMISTRY 


Alzheimer’s, Parkinson’s, and Huntington’s Disease Are Late-Onset 
Neurodegenerative Disorders Caused by the Accumulation of Protein Deposits 


As noted in Chapter 6, protein misfolding problems can cause dis- 
ease by a variety of mechanisms. For example, protein aggregates 
can impair cell function. Amyloid plaques (so named because they 
resemble the intracellular starch, or amyloid, deposits found in plant 
cells) and neurofibrillary tangles (NFTs) are proteinaceous deposits 
found in the brains of individuals suffering from any of several neu- 
rogenerative diseases. In each case, the protein is different. In 
Alzheimer’s, disease is caused both by extracellular amyloid deposits 
composed of proteolytic fragments of the amyloid precursor protein 
(APP) termed amyloid-6 (Af) and intracellular NFTs composed of 
the microtubule-binding protein tau (7). Af is a peptide 39 to 
43 amino acids long that polymerizes to form long, highly ordered, 
insoluble fibrils consisting of a hydrogen-bonded parallel B-sheet 
structure in which identical residues on adjacent chains are aligned 
directly, in register (see accompanying figure). Why A£ aggregates 
in some people but not others is not clear. In Parkinson’s, the culprit 
is NFTs composed of polymeric 7; no amyloid plaques are evident. In 
Huntington’s disease, the protein deposits occur as nuclear inclu- 
sions composed of polyglutamine (polyQ) aggregates. PolyQ aggre- 
gates arise from mutant forms of huntingtin, a protein that charac- 
teristically has a stretch of glutamine residues close to its N-terminus. 
Huntingtin is a 3144residue protein encoded by the /TD gene, 
which has 67 exons. Exon | encodes the polyglutamine region. In- 
dividuals whose huntingtin gene has fewer that 35 CAG (glutamine 
codon) repeats never develop the disease; those with 40 or more al- 
ways develop the disease within a normal lifetime. The nuclear in- 
clusions in Huntington’s disease are huntingtin-derived polygluta- 
mine fragments that have aggregated to form B-sheet—containing 
amyloid fibrils. 

Impairment of cellular function by proteinaceous deposits 
may be a general phenomenon. In vitro experiments have 
demonstrated that aggregates of proteins not associated with dis- 


ease can be cytotoxic, and the ability to form amyloid deposits is 
a general property of proteins. The evolution of chaperones to 
assist protein folding and proteasomes to destroy improperly 
folded proteins may have been driven by the necessity to prevent 
protein aggregation. 





A A model for the AB structural unit in B-amyloid fibrils. Fibrils con- 
tain B-strands perpendicular to the fibril axis, with interstrand hydrogen 
bonding parallel to the fiber axis. The top face of the -sheet is hydro- 
phobic and presumably interacts with neighboring AB molecules in fibril 
formation. (Figure adapted from Figure 1 in Thompson, L. K., 2003. Unraveling the 
secrets of Alzheimer’s B-amyloid fibrils. Proceedings of the National Academy of 
Sciences, U.S.A. 100:383-385.) 








solvent-accessible surface area (see Chapter 6). The folded protein typically has a 
buried hydrophobic core and a hydrophilic surface. Protein aggregation is typically 
driven by hydrophobic interactions, so burial of hydrophobic regions through fold- 
ing is a crucial factor in preventing aggregation. To evade such problems, nascent pro- 
teins are often assisted in folding by a family of helper proteins known as molecular 
chaperones (see Chapter 6), because, like the chaperones at a prom, their purpose is 
to prevent inappropriate liaisons. Chaperones also serve to shepherd proteins to their 
ultimate cellular destinations. Also, mature proteins that have become partially un- 
folded may be rescued by chaperone-assisted refolding. 


Chaperones Help Some Proteins Fold 


A number of chaperone systems are found in all cells. Many of the proteins in these 
systems are designated by the acronym Hsp (for heat shock protein) and a number 
indicating their relative mass in kilodaltons (as in Hsp60). Hsps were originally 
observed as abundant proteins in cells given brief exposure to high temperature 
(42° C or so). The principal Hsp chaperones are Hsp70, Hsp60 (the chaperonins), 
and Hsp90. In general, proteins whose folding is chaperone-dependent pass down 
a pathway in which Hsp70 acts first on the newly synthesized protein and then 
passes the partially folded intermediate to a chaperonin for completion of folding. 

Nascent polypeptide chains exiting the large ribosomal subunits are met by 
ribosome-associated chaperones (TF, or trigger factor, in Escherichia coli; NAC 
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[nascent chain-associated complex] in eukaryotes). In E. coli, the 50S ribosomal pro- 
tein L23, which is situated at the peptide exit tunnel, serves as the docking site for 
TF, directly linking protein synthesis with chaperone-assisted protein folding. TF 
and NAC mediate transfer of the emerging nascent polypeptide chain to the Hsp70 
class of chaperones, although many proteins do not require this step for proper 
folding. 


Hsp70 Chaperones Bind to Hydrophobic Regions of Extended 
Polypeptides 


In Hsp70-assisted folding, proteins of the Hsp70 class bind to nascent polypeptide 
chains while they are still on ribosomes (Figure 31.1b). Hsp70 (known as DnaK in 
E. coli) recognizes exposed, extended regions of polypeptides that are rich in hydro- 
phobic residues. By interacting with these regions, Hsp70 prevents nonproductive 
associations and keeps the polypeptide in an unfolded (or partially folded) state 
until productive folding interactions can occur. Completion of folding requires re- 
lease of the protein from Hsp70; release is energy-dependent and is driven by ATP 
hydrolysis. 

Hsp70 proteins such as DnaK consist of two domains: a 44-kD N-terminal ATP- 
binding domain and an 18-kD central domain that binds polypeptides with exposed 
hydrophobic regions (Figure 31.2a). The DnaK: ATP complex receives an unfolded 
(or partially folded) polypeptide chain from DnaJ (Figure 31.2b). DnaJ is an Hsp40 
family member. Interaction of DnaK with DnaJ triggers the ATPase activity of DnaK; 
the DnaK: ADP complex forms a stable complex with the unfolded polypeptide, pre- 
venting its aggregation with other proteins. A third protein, GrpE, catalyzes nucleo- 
tide exchange on Dnak, replacing ADP with ATP, which converts DnaK back to a 
conformational form having low affinity for its polypeptide substrate. Release of the 
polypeptide gives it the opportunity to fold. Multiple cycles of interaction with DnaK 
(or Hsp70) give rise to partially folded intermediates or, in some cases, completely 
folded proteins. The partially folded intermediates may be passed along to the 
Hsp60/chaperonin system for completion of folding (Figure 31.1c). 


(a) Domain organization and structure of the Hsp70 family (b) 
member, DnaK 


ATPase 
domain 


Peptide-binding 
domain 


385 393 537 638 


N 





FIGURE 31.2 Structure and function of Dnak: (a) Domain organization and structure of the Hsp70 family mem- 
ber, Dnak. The ribbon diagram on the lower left is the ATP-binding domain of the Dnak analog, bovine Hsc70; 
bound ADP is shown as a stick diagram (purple). The ribbon diagram on the lower right is the polypeptide- 
binding domain of Dnak. The small blue ovals highlight the position of the polypeptide substrate; the protein 
regions that bind the polypeptide substrate are blue-green. (b) DnaK mechanism of action: DnaJ binds an un- 
folded protein (U) or partially folded intermediate (I) and delivers it to the DnaK: ATP complex. The nucleotide 
exchange protein GrpE replaces ADP with ATP on Dnak and the partially folded intermediate ([I]) is released. 

| has several possible fates: It may fold into the native state, N; it may undergo another cycle of interaction with 
DnaJ and Dnak; or it may be become a substrate for folding by the GroEL chaperonin system. (Adapted from 
Figures 1a and 2a in Frydman, J., 2001. Folding of newly translated proteins in vivo: The role of molecular chaperones. Annual 
Review of Biochemistry 70:603-647.) 
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FIGURE 31.1 Protein folding pathways. (a) Chaperone- 
independent folding. The protein folds as it is synthesized 
on the ribosome (green) (or shortly thereafter). (b) Hsp70- 
assisted protein folding. Hsp70 (gray) binds to nascent 
polypeptide chains as they are synthesized and assists 
their folding. (c) Folding assisted by Hsp70 and chaper- 
onin complexes. The chaperonin complex in E. coli is 
GroES-GroEL. The majority of proteins fold by pathways 
(a) or (b). (Adapted from Figure 2 in Netzer, W.J., and Hartl, F.U., 
1998. Protein folding in the cytosol: Chaperonin-dependent and 
-independent mechanisms. Trends in Biochemical Sciences 23: 
68-73; and Figure 2 in Hartl, F. U., and Hayer-Hartl, M., 2002. Molec- 
ular chaperones in the cytosol: From nascent chain to folded pro- 
tein, Science 295:1852-1858.) 
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How Does ATP Drive Chaperone-Mediated Protein Folding? 


The chaperones that mediate protein folding do so in an ATP- 
dependent manner, as illustrated in Figures 31.2 and 31.3. The 
affinity of chaperones for their unfolded or misfolded protein sub- 
strates is determined by the nature of the nucleotide bound by the 
ATP-binding domain of these proteins, which functions as an 
ATPase. If ATP is bound, the chaperone adopts a conformation with 
an open substrate-binding pocket. ATP increases the rate of associ- 
ation of the chaperone Hsp70 (DnaK) with an unfolded peptide or 
protein substrate by 100-fold, but it increases the rate of dissociation 
of the unfolded protein from the chaperone even more, by a factor 
of 1000. Overall, the chaperone’s affinity for an unfolded protein 
substrate decreases 10-fold (or more) when it binds ATP. 

On the other hand, if the substrate-binding site on the peptide- 
binding domain of DnaK is occupied by an unfolded protein sub- 
strate in conjunction with binding of the co-chaperone (DnaJ in 
Figure 31.2), ATP hydrolysis by the ATPase domain is triggered. 
The presence of ADP in the ATP-binding (ATPase) domain causes 
a shift in the substrate-binding site of the peptide-binding domain 
to a closed conformation, high-affinity state. Thus, ATP-dependent 
chaperones cycle between two stable conformational states, just 
like allosteric proteins. Bound ATP favors the open conformation 
for the protein substrate-binding site, and ADP favors the closed 
conformation. (When ADP is released and no nucleotide occupies 
the ATP-binding site of the ATPase domain, the peptide-binding 
site remains in the closed, high-affinity conformation; see Figure 
31.2). What is the underlying mechanism that controls these ATP- 
regulated conformational changes? 


The ATP-Dependent Allosteric Regulation of Hsp70 Chaperones 
Is Controlled by a Proline Switch 


Clearly, the two domains of Hsp70 (DnaK)—the peptide-binding 
domain and the ATP-binding (or ATPase) domain—communicate 
with each other, because the nature of the nucleotide bound to the 
ATP domain determines the affinity of the peptide-binding domain 
for unfolded substrates. Markus Vogel, Bernd Bukau, and Matthais 
Mayer of the Center for Molecular Biology at the University of 
Heidelberg (Germany) argue that four distinct elements are 
needed for communication between these separate domains: an 
ATP sensor (which must include residues within the ATP-binding 
site), a transducer (to communicate the presence of ATP to the dis- 


tant peptide-binding site), a lever (operated by the ATP-binding 
domain to exert its effect on the distant peptide-binding domain), 
and a switch that controls the lever (the switch is needed to lock the 
protein in either the open conformation or the closed conforma- 
tion so that either alternative conformation is stable). 

The switch that controls the conformational transitions of 
Hsp70 involves two universally conserved residues in the ATPase 
domain of Hsp70 family members, a proline (Pro!) and a 
surface-exposed arginine (Arg!!; panel a of the figure). Pro! is 
the switch, and Arg’! is a relay for the lever. Replacement of either 
of these residues by amino acid substitutions disrupts and/or 
destabilizes the switch. Other nearby residues, Glu'”! and Lys”, 
function as ATP sensors. It is believed that Lys” also serves as the 
nucleophile that initiates ATP hydrolysis through attack on the 
y-phosphate of ATP. Arg’®! acts as a relay between Pro!*, events 
occurring during ATP hydrolysis, and the peptide-binding domain 
(panels b-d of the figure). When ATP binding is sensed by Lys” 
and Glu!”! (panel b), Pro!* is shifted, which causes Arg!®! to move 
in the direction of the peptide-binding domain of DnaK (panel c). 
In turn, this protein-binding domain assumes the open, low- 
affinity conformation. The interaction of DnaK with an unfolded 
protein substrate and co-chaperone DnaJ moves Arg?! back to- 
ward Pro!#’, which causes Lys” and Glu!” to initiate ATP hydroly- 
sis (panel d). ADP now occupies the nucleotide-binding site of the 
ATPase domain, and the protein-binding domain of DnaK is 
locked in the closed, high-affinity conformation. 

The consequence of these events is that DnaK cycles between 
binding and releasing unfolded (or partially folded) proteins, 
fueled by ATP hydrolysis within the ATPase domain. In effect, ATP 
binding and hydrolysis drive DnaK from an open, low-affinity con- 
formational state to a closed, high-affinity conformational state. 
When unfolded (or partially folded) proteins are not held by DnaK, 
they have the opportunity to fold so that any solvent-accessible 
hydrophobic surfaces they might still retain are buried. Once a pro- 
tein has adopted a stable folded state, it lacks exposed hydrophobic 
surfaces and thus escapes the cycle of binding and release by DnaK. 
More generally, Hsp70 provides an elegant example of protein con- 
formational transitions based on the binding of ATP versus ADP at 
an effector site, with the added dimension that the effector site in 
this case is also an ATPase. 





The GroES—GroEL Complex of E. coli Is an Hsp60 Chaperonin 


The Hsp60 class of chaperones, also known as chaperonins, assists some partially 
folded proteins to complete folding after their release from ribosomes. Chaperonins 
sequester partially folded molecules from one another (and from extraneous inter- 
actions), allowing folding to proceed in a protected environment. This protected 
environment is sometimes referred to as an “Anfinsen cage” because it provides an 
enclosed space where proteins fold spontaneously, free from the possibility of aggre- 
gation with other proteins. Chaperonins are large, cylindrical protein complexes 
formed from two stacked rings of subunits. The chaperonins have been organized 
into two groups, I and II, on the basis of their source and structure. Group I chaper- 
onins are found in bacteria, group II in archaea and eukaryotes. The group I chap- 
eronin in £. coli is the GroES—GroEL complex (Figure 31.1c). GroEL is made of two 
stacked seven-membered rings of 60-kD subunits that form a cylindrical a4 oligomer 
15 nm high and 14 nm wide (Figure 31.3). Each GroEL ring has a 5-nm central cav- 
ity where folding can take place. This cavity can accommodate proteins up to 60 kD 
in size. GroES, sometimes referred to as a co-chaperonin, consists of a single seven- 
membered ring of 10-kD subunits that sits like a dome on one end of GroEL (Figure 
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A (a) Side chains involved in the ATP- and ADP-dependent allosteric regulation of the Hsp70 family 
member DnaK. Lower numbers indicate DnaK amino acid residues; upper numbers indicate the cor- 
responding residues in Hsc70 (the human counterpart to DnaK). This illustration highlights 
residues in or near the ATP-binding site of the ATPase domain. The site is occupied by ADP, P;, one 
Mg?*, and two K+ ions. See Figure 31.2a for the location of this site within the ATPase domain. 
Carbon atoms are gray; oxygen, red; nitrogen, blue; and phosphorus, yellow. P143 sits at the center 
of an H-bonded network of residues, which includes K70, Y145, F146, R151, and E171. (b-d) The 
mechanism of ATP- and ADP-dependent allosteric transitions in Hsp70. Events in the ATP-binding 
site of the ATPase domain are communicated to the protein-binding site of the peptide-binding 
domain through K70 and E171, which act as ATP sensors; P143, which acts as the switch; and R151, 
which is part of the lever that relays the events from the ATPase domain to the other domain. 

(a: Adapted from Figure 1 in Vogel, M., Bukau, B., and Mayer, M. P., 2006. Allosteric regulation of Hsp70 chaperones by a 
proline switch. Molecular Cell 21:359-367. Courtesy of Bernd Bukau and Matthias Mayer. b-d: Adapted from Figure 6 in 
Vogel, M., Bukau, B., and Mayer, M. P., 2006. Allosteric regulation of Hsp70 chaperones by a proline switch. Molecular Cell 
21:359-367.) 





31.3). The end of GroEL where GroKES is sitting is referred to as the apical end. Each 
GroEL subunit has two structural domains: an equatorial domain that binds ATP and 
interacts with neighbors in the other «y ring and an apical domain with hydrophobic 
residues that can interact with hydrophobic regions on partially folded proteins. The 
apical domain hydrophobic patches face the interior of the central cavity. An un- 
folded (or partially folded) protein binds to the apical patches and is delivered to the 
central cavity of the upper a7 ring (Figure 31.3c). ATP binding to the subunits of the 
upper æy ring causes rapid (<100 msec), forced unfolding of the substrate protein, 
followed by two events that occur on a slower time scale (~1 sec): (1) GroES is re- 
cruited to GroEL, and (2) the a-subunits undergo a conformational change that 
buries their hydrophobic patches. The a-subunits now present a hydrophilic surface 
to the central cavity. This change displaces the bound partially folded polypeptide 
into the sheltered hydrophilic environment of the central cavity, where it can fold, 
free from danger of aggregation with other proteins. GroES also promotes ATP hy- 
drolysis (Figure 31.3c). The GroEL: ADP: GroES complex dissociates when ATP binds 
to the subunits of the other (lower) @7 ring. Dissociation of GroES allows the partially 
folded (or folded) protein to escape from GroEL. If the protein has achieved its na- 
tive conformation, its hydrophobic residues will be buried in its core and the hy- 
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FIGURE 31.3 Structure and function of the GroEL-GroES 
complex. (a) Structure and overall dimensions of 
GroEL-GroES (top view, left; side view, right) (pdb id = 
1AON). (b) Section through the center of the complex 
to reveal the central cavity. (c) Model of the GroEL cylin- 
der (blue) in action. An unfolded (U) or partially folded (l) 
polypeptide binds to hydrophobic patches on the api- 
cal ring of a7-subunits, followed by ATP binding, forced 
protein unfolding, and GroES (red) association. 
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drophobic patches on the a7 rings will have no affinity for it. On the other hand, if 
the protein is only partially folded, it may be bound again, gaining access to the 
Anfinsen cage of the a7 ring and another cycle of folding. The folding of rhodanese, 
a 33-kD protein, requires the hydrolysis of about 130 equivalents of ATP. 

The group II chaperonin and eukaryotic analog of GroEL, CCT (also called 
TriC) is also a double-ring structure, but each ring consists of eight different sub- 
units that vary in size from 50 to 60 kD. Furthermore, group II chaperonins lack a 
GroES counterpart. Prefoldin (also known as GimC), a hexameric protein com- 
posed of six subunits from two related classes (two a and four £), can serve as a 
co-chaperone for CCT, much as GroES does for GroEL. However, prefoldin also 
acts like an Hsp70 protein, because it binds unfolded polypeptide chains emerging 
from ribosomes and delivers them to CCT. Prefoldin resembles a jellyfish, with six 
tentacle-like coiled coils extending from a barrel-shaped body. The ends of the ten- 
tacles have hydrophobic patches for binding unfolded proteins. 

Prior to substrate protein binding, CCT exists in a partly open state. ATP bind- 
ing opens the ring even more, a state in which prefoldin delivers the substrate pro- 
tein. ATP hydrolysis closes the chamber and drives the folding process. ATP- 
induced conformation changes that promote protein folding propagate from one 
subunit to the next around the ring structure. 


The Eukaryotic Hsp90 Chaperone System Acts on Proteins 
of Signal Transduction Pathways 


Hsp 90 constitutes 1% to 2% of the total cytosolic proteins of eukaryotes, its abundance 
reflecting its importance. Like other Hsp chaperones, its action depends on cyclic 
binding and hydrolysis of ATP. Conformational regulation of signal transduction mol- 
ecules seems to be a major purpose of Hsp90. Receptor tyrosine kinases, soluble tyro- 


31.2 How Are Proteins Processed Following Translation? 


sine kinases, and steroid hormone receptors are some of the signal transduction mol- 
ecules (see Chapter 32) that must associate with Hsp90 in order to become fully com- 
petent; proteins fitting this description are called Hsp90 “client proteins.” The matu- 
ration of Hsp70 client proteins requires other proteins as well, and together with 
Hsp90, these proteins come together to form an assembly that has been called a 
foldosome. CFTR (cystic fibrosis transmembrane regulator), telomerase, and nitric 
oxide synthase are also Hsp90-dependent. 

Association of nascent polypeptide chains with proteins of the various chaperone 
systems commits them to a folding pathway, redirecting them away from degrada- 
tion pathways that would otherwise eliminate them from the cell. However, if these 
protein chains fail to fold, they are recognized as non-native and targeted for 
destruction. 


3:le2s How Are Proteins Processed Following Translation? 


Aside from these folding events, release of the completed polypeptide from the ri- 
bosome is not necessarily the final step in the covalent construction of a protein. 
Many proteins must undergo covalent alterations before they become functional. 
In the course of these post-translational modifications, the primary structure of a 
protein may be altered, and/or novel derivations may be introduced into its 
amino acid side chains. Hundreds of different amino acid variations have been 
described in proteins, virtually all arising post-translationally. The list of such 
modifications is very large; some are rather commonplace, whereas others are pe- 
culiar to a single protein. The diphthamide moiety in elongation factor eEF-2 is 
one example of an amino acid modification (see the Human Biochemistry box on 
page 980 in Chapter 30); the fluorescent group of green fluorescent protein 
(GFP; see Chapter 4) is another. In addition, common chemical groups such as 
carbohydrates and lipids may be covalently attached to a protein during its matu- 
ration. Phosphorylation, acetylation, and methylation of proteins are common 
mechanisms for regulating protein function. Interestingly, many proteins are 
modified in multiple ways, and many post-translational modifications act in com- 
binations—a phenomenon termed cross-talk. (The majority of proteins in cells 
can be phosphorylated on one or more residues. A survey of some of the more 
prominent chemical groups conjugated to proteins is given in Chapter 5.) To put 
a number on the significance of post-translational modifications, we have seen 
that the number of human proteins is estimated to exceed the number of human 
genes (20,000 or so) by more than an order of magnitude. 


Proteolytic Cleavage Is the Most Common Form 
of Post-Translational Processing 


Proteolytic cleavage, as the most prevalent form of protein post-translational 
modification, merits special attention. The very occurrence of proteolysis as a 
processing mechanism seems strange: Why join a number of amino acids in se- 
quence and then eliminate some of them? Three reasons can be cited. First, di- 
versity can be introduced where none exists. For example, a simple form of pro- 
teolysis, enzymatic removal of N-terminal Met residues, occurs in many proteins. 
Met-aminopeptidase, by removing the invariant Met initiating all polypeptide 
chains, introduces diversity at N-termini. Second, proteolysis serves as an activa- 
tion mechanism so that expression of the biological activity of a protein can be 
delayed until appropriate. A number of metabolically active proteins, including 
digestive enzymes and hormones, are synthesized as larger inactive precursors 
termed pro-proteins that are activated through proteolysis (see zymogens, Chap- 
ter 15). The N-terminal pro-sequence on such proteins may act as an intramole- 
cular chaperone to ensure correct folding of the active site. Third, proteolysis is 
involved in the targeting of proteins to their proper destinations in the cell, a 
process known as protein translocation. 
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FIGURE 31.4 General features of the N-terminal signal 
sequences on E.coli proteins destined for translocation: 
a basic N-terminal region, a central apolar domain, and a 
nonhelical C-terminal region. 


3'le3n How Do Proteins Find Their Proper Place in the Cell? 


Proteins are targeted to their proper cellular locations by signal sequences: Proteins 
destined for service in membranous organelles or for export from the cell are syn- 
thesized in precursor form carrying an N-terminal stretch of amino acid residues, 
or leader peptide, that serves as a signal sequence. In effect, signal sequences serve as 
“zip codes” for sorting and dispatching proteins to their proper compartments. 
Thus, the information specifying the correct cellular localization of a protein is 
found within its structural gene. Once the protein is routed to its destination, the 
signal sequence is often, but not always, proteolytically clipped from the protein by 
a signal sequence-specific endopeptidase called a signal peptidase. 


Proteins Are Delivered to the Proper Cellular Compartment 
by Translocation 


Protein translocation is the name given to the process whereby proteins are inserted 
into membranes or delivered across membranes. Protein translocation occurs in all 
cells. Newly synthesized chains of membrane proteins or secretory proteins are tar- 
geted to the plasma membrane (in prokaryotes) or the endoplasmic reticulum (in 
eukaryotes) by their signal sequences. In addition to the ER, a number of eukary- 
otic membrane systems are competent in protein translocation, including the mem- 
branes of the nucleus, mitochondria, chloroplasts, and peroxisomes. Several com- 
mon features characterize protein translocation systems: 


1. Proteins to be translocated are made as preproteins containing contiguous 
blocks of amino acid sequence that act as organelle-specific sorting signals. 

2. Signal recognition particles (SRPs) recognize the presence of a nascent protein 
chain in the ribosomal exit tunnel and, together with signal receptors (SRs), de- 
liver the nascent chain to the membrane. If the nascent sequence emerging from 
the ribosome is a signal sequence, it is delivered to a specific membrane protein 
complex, the éranslocon, that mediates protein integration into the membrane or 
protein translocation across the membrane. 

3. Translocons are selectively permeable protein-conducting channels that catalyze 
movement of the proteins across the membrane, and metabolic energy in the 
form of ATP, GTP, or a membrane potential is essential. In eukaryotes, ATP- 
dependent chaperone proteins within the membrane compartment usually as- 
sociate with the entering polypeptide and provide the energy for translocation. 
Proteins destined for membrane integration contain amino acid sequences that 
act as stop-transfer signals, allowing diffusion of transmembrane segments into 
the bilayer. 

4. Preproteins are maintained in a loosely folded, translocation-competent confor- 
mation through interaction with molecular chaperones. 


Prokaryotic Proteins Destined for Translocation Are Synthesized 
as Preproteins 


Gram-negative bacteria typically have four compartments: cytoplasm, plasma (or in- 
ner) membrane, periplasmic space (or periplasm), and outer membrane. Most pro- 
teins destined for any location other than the cytoplasm are synthesized with amino- 
terminal leader sequences 16 to 26 amino acid residues long. These leader sequences, 
or signal sequences, consist of a basic N-terminal region, a central domain of 7 to 13 
hydrophobic residues, and a nonhelical C-terminal region (Figure 31.4). The con- 
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served features of the last part of the leader, the C-terminal region, include a helix- 
breaking Gly or Pro residue and amino acids with small side chains located one and 
three residues before the proteolytic cleavage site. Unlike the basic N-terminal and 
nonpolar central regions, the C-terminal features are not essential for translocation 
but instead serve as recognition signals for the leader peptidase, which removes the 
leader sequence. The exact amino acid sequence of the leader peptide is unimpor- 
tant. Nonpolar residues in the center and a few Lys residues at the amino terminus 
are sufficient for successful translocation. The functions of leader peptides are to re- 
tard the folding of the preprotein so that molecular chaperones have a chance to in- 
teract with it and to provide recognition signals for the translocation machinery and 
leader peptidase. 


Eukaryotic Proteins Are Routed to Their Proper Destinations 
by Protein Sorting and Translocation 


Eukaryotic cells are characterized by many membrane-bounded compartments. In 
general, signal sequences targeting proteins to their appropriate compartments are 
located at the N-terminus as cleavable presequences, although many proteins have 
N-terminal localization signals that are not cleaved and others have internal targeting 
sequences that may or may not be cleaved. Proteolytic removal of the leader sequences 
is also catalyzed by specialized proteases, but removal is not essential to translocation. 
No sequence similarity is found among the targeting signals for each compartment. 
Thus, the targeting information resides in more generalized features of the leader se- 
quences such as charge distribution, relative polarity, and secondary structure. For ex- 
ample, proteins destined for secretion enter the lumen of the endoplasmic reticulum 
(ER) and reach the plasma membrane via a series of vesicles that traverse the endo- 
membrane system. Recognition by the ER depends on an N-terminal amino acid se- 
quence that contains one or more basic amino acids followed by a run of 6 to 12 hydro- 
phobic amino acids. An example is serum albumin, which is synthesized in precursor 
form (preproalbumin) having a MKWVTFLLLLFISGSAFSR N-terminal signal se- 
quence. The italicized K highlights the basic residue in the sequence, and the bold 
residues denote a continuous stretch of (mostly) hydrophobic residues. A signal pep- 
tidase in the ER removes the signal sequence by cleaving the preproprotein between 
the S and R. 


The Synthesis of Secretory Proteins and Many Membrane Proteins Is Coupled to 
Translocation Across the ER Membrane The signals recognized by the ER translo- 
cation system are virtually indistinguishable from bacterial signal sequences; indeed, 
the two are interchangeable in vitro. In addition, the translocon systems in prokary- 
otes and eukaryotes are highly analogous. In higher eukaryotes, translation and 
translocation of many proteins destined for processing via the ER are tightly coupled. 
Translocation across the ER occurs co-translationally (that is, as the protein is being 
translated on the ribosome). As the N-terminal sequence of a protein undergoing syn- 
thesis enters the exit tunnel of the ribosome, it is detected by a signal recognition 
particle (SRP; Figure 31.5). SRP is a 325-kD nucleoprotein assembly that contains six 
polypeptides and a 300-nucleotide 7S RNA. SRP54, a 54-kD subunit of SRP and a 
G-protein family member, recognizes the nascent protein’s signal sequence, and SRP 
binding of the signal sequence causes the ribosome to cease translation. This arrest 
prevents release of the growing protein into the cytosol before it reaches the ER 
and its intended translocation. The SRP-ribosome complex is referred to as the 
RNC-SRP (ribosome nascent chain: SRP complex). 


Interaction Between the RNC-SRP and the SR Delivers the RNC to the Membrane 
The RNC-SRP is then directed to the cytosolic face of the ER, where it binds to the 
signal receptor (SR), an af heterodimeric protein. The 70-kD a-subunit is anchored 
to the membrane by the transmembrane $-subunit; both subunits are G-protein 
family members, and both have bound GTP. When SRP54 docks with SRa, the 
RNC-SRP becomes membrane associated (Figure 31.5). If the nascent chain emerg- 
ing from the ribosome is not a signal sequence, the RNC is released from the SRP 
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FIGURE 31.5 Synthesis of a eukaryotic secretory protein 
and its translocation into the endoplasmic reticulum. 

(1) The signal recognition particle (SRP red) recognizes 
the signal sequence emerging from a translating ribo- 
some (ribosome nascent complex [RNC], gray).(2) The 
RNC-SRP interacts with the signal receptor (SR, purple) 
and is transferred to the translocon (pink). (3) Release of 
the SRP and alignment of the peptide exit tunnel of the 
RNC with the protein-conducting channel of the trans- 
locon stimulates the ribosome to resume translation. 

(4) The membrane-associated signal peptidase (purple 
circle) clips off the N-terminal signal sequence, and BiP 
(the ER lumen Hsp70 chaperone, blue) binds the nascent 
chain mediating its folding into its native conformation. 
(5) Following dissociation of the ribosome, BiP plugs the 
translocon channel. Not shown are subsequent secretory 
protein maturation events, such as glycosylation. (Adapted 
from Figures 1a and 2a in Frydman, J. 2001. Folding of newly 
translated proteins in vivo: The role of molecular chaperones. 
Annual Review of Biochemistry 70:603-647.) 
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and the membrane. If the nascent chain emerging from the ribosome is a signal se- 
quence, the complex remains intact, and SRP54 and SRa function together as reci- 
procal GTPase-activating proteins. GTP hydrolysis causes the dissociation of SRP 
from SR and transfer of the RNC to the translocon. 


The Ribosome and the Translocon Form a Common Conduit for Transfer of the 
Nascent Protein Through the ER Membrane and into the Lumen Through inter- 
actions with the translocon, the ribosome resumes protein synthesis, delivering its 
growing polypeptide through the ER membrane. The peptide exit tunnel of the 
large ribosomal subunit and the protein-conducting channel of the translocon are 
aligned with one another, forming a continuous conduit from the peptidyl trans- 
ferase center of the ribosome to the ER lumen. 

The mammalian translocon is a complex, multifunctional entity that has as its 
core the Sec61 complex, a heterotrimeric complex of membrane proteins, and a 
unique fourth subunit, TRAM, that is required for insertion of nascent integral 
membrane proteins into the membrane. The 53-kD a-subunit of Sec6lp has ten 
membrane-spanning segments, whereas the $- and y-subunits are single TMS pro- 
teins. Sec6la forms the transmembrane protein-conducting channel through 
which the nascent polypeptide is transported into the ER lumen (Figure 31.5). The 
pore size of Sec61p is very dynamic, ranging from about 0.6 to 6 nm in diameter. 
Thus, a great variety of protein structures could be accommodated easily within the 
translocon. This flexibility allows the Sec6lp translocon complex to function in 
post-translational translocation (translocation of completely formed proteins) as 
well as co-translational translocation. 

As the protein is threaded through the Sec6lp channel into the lumen, an 
Hsp70 chaperone family member called BiP binds to it and mediates proper fold- 
ing. BiP function, like that of other Hsp70 proteins, is ATP-dependent, and ATP- 
dependent protein folding provides the driving force for translocation of the 
polypeptide into the lumen. When the ribosome dissociates from the translocon, 
BiP serves as a plug to block the protein-conducting channel, preventing ions and 
other substances from moving between the ER lumen and the cytosol. 


A Signal Peptidase Within the ER Lumen Clips Off the Signal Peptide Soon after 
it enters the ER lumen, the signal peptide is clipped off by membrane-bound signal 
peptidase (also called leader peptidase), which is a complex of five proteins. Other 
modifying enzymes within the lumen introduce additional post-translational alter- 
ations into the polypeptide, such as glycosylation with specific carbohydrate 
residues. ER-processed proteins destined for secretion from the cell or inclusion in 
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vesicles such as lysosomes end up contained within the soluble phase of the ER lu- 
men. On the other hand, polypeptides destined to become membrane proteins 
carry stop-transfer sequences within their mature domains. The stop-transfer se- 
quence is typically a 20-residue stretch of hydrophobic amino acids that arrests the 
passage across the ER membrane. Proteins with stop-transfer sequences remain em- 
bedded in the ER membrane with their C-termini on the cytosolic face of the ER. 
Such membrane proteins arrive at their intended destinations via subsequent pro- 
cessing of the ER. 


Retrograde Translocation Prevents Secretion of Damaged Proteins and Recycles 
Old ER Proteins To prevent secretion of inappropriate proteins, fragmented or 
misfolded secretory proteins are passed from the ER back into the cytosol via 
Sec61p. Thus, Sec61p also serves as a channel for aberrant secretory proteins to be 
returned to the cytosol so that they can be destroyed by the proteasome degrada- 
tion apparatus (see Section 31.4). Among these proteins are ER membrane proteins 
that are damaged or no longer needed. 


Mitochondrial Protein Import Most mitochondrial proteins are encoded by the 
nuclear genome and synthesized on cytosolic ribosomes. Mitochondria consist of 
four principal subcompartments: the outer membrane, the intermembrane space, 
the inner membrane, and the matrix. Thus, not only must mitochondrial proteins 
find mitochondria, they must gain access to the proper subcompartment; and once 
there, they must attain a functionally active conformation. As a consequence, mi- 
tochondria possess multiple preprotein translocons and chaperones. Similar con- 
siderations apply to protein import to chloroplasts, organelles with five principal 
subcompartments (outer membrane, intermembrane space, inner/thylakoid — FIGURE31.6 Structure of an amphipathic a-helix having 
membrane, stroma, and thylakoid lumen; see Chapter 21). basic (+) residues on one side and uncharged and 
Signal sequences on nuclear-encoded proteins destined for the mitochondria  PYdrophobic (R) residues on the other. 

are N-terminal cleavable presequences 10 to 70 residues long. These mitochondrial 

presequences lack contiguous hydrophobic regions. Instead, they have positively 

charged and hydroxy amino acid residues spread along their entire length. These 

sequences form amphipathic a-helices (Figure 31.6) with basic residues on one side 

of the helix and uncharged and hydrophobic residues on the other; that is, mito- 

chondrial presequences are positively charged amphiphatic sequences. In general, 

mitochondrial targeting sequences share no sequence homology. Once synthesized, 

mitochondrial preproteins are retained in an unfolded state with their target se- 

quences exposed, through association with Hsp70 molecular chaperones. Import 

involves binding of a preprotein to the mitochondrial outer membrane translocon 

(TOM) (Figure 31.7). If the protein is destined to be an outer mitochondrial mem- 

brane protein, it is transferred from the TOM to the sorting and assembly complex 
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(Top) A ubiquitin: E1 heterodimer complex (pdb id = 
1R4N; ubiquitin is shown in blue). (Middle) A ubiquitin: 
E2 complex (pdb id = 1FXT; E2 is shown in orange, 
ubiquitin in blue). (Bottom) The clamp-shaped E3 
heteromultimer (pdb id = 1LDK and 1FQV).The target 
protein is bound between the jaws of the clamp. 





(SAM) and inserted in the outer membrane. If it is an integral protein of the inner 
mitochondrial membrane, it traverses the TOM complex, enters the intermem- 
brane space, and is taken up by the inner mitochondrial membrane translocon 
(TIM22) and inserted into the inner membrane. On the other hand, if it is destined 
to be a mitochondrial matrix protein, a different TIM complex, TIM23, binds the 
preprotein and threads it across the inner mitochondrial membrane into the ma- 
trix. Chloroplasts have TOCs (translocon outer chloroplast membrane) and TICs 
(translocon inner chloroplast membrane) for these purposes. 


3le4) How Does Protein Degradation Regulate Cellular 
Levels of Specific Proteins? 


Cellular proteins are in a dynamic state of turnover, with the relative rates of protein 
synthesis and protein degradation ultimately determining the amount of protein 
present at any point in time. In many instances, transcriptional regulation deter- 
mines the concentrations of specific proteins expressed within cells, with protein 
degradation playing a minor role. In other instances, the amounts of key enzymes 
and regulatory proteins, such as cyclins and transcription factors, are controlled via 
selective protein degradation. In addition, abnormal proteins arising from biosyn- 
thetic errors or postsynthetic damage must be destroyed to prevent the deleterious 
consequences of their buildup. The elimination of proteins typically follows first- 
order kinetics, with half-lives (t1/2) of different proteins ranging from several min- 
utes to many days. A single, random proteolytic break introduced into the polypep- 
tide backbone of a protein is believed sufficient to trigger its rapid disappearance be- 
cause no partially degraded proteins are normally observed in cells. 

Protein degradation poses a real hazard to cellular processes. To control this haz- 
ard, protein degradation is compartmentalized, either in macromolecular structures 
known as proteasomes or in degradative organelles such as lysosomes. Protein degra- 
dation within lysosomes is largely nonselective; selection occurs during lysosomal up- 
take. Proteasomes are found in eukaryotic as well as prokaryotic cells. The protea- 
some is a functionally and structurally sophisticated counterpart to the ribosome. 
Regulation of protein levels via degradation is an essential cellular mechanism. Reg- 
ulation by degradation is both rapid and irreversible. 


Eukaryotic Proteins Are Targeted for Proteasome Destruction 
by the Ubiquitin Pathway 


Ubiquitination is the most common mechanism to label a protein for proteasome 
degradation in eukaryotes. Ubiquitin is a highly conserved, 76-residue (8.5-kD) 
polypeptide widespread in eukaryotes. Proteins are condemned to degradation 
through ligation to ubiquitin. Three proteins in addition to ubiquitin are involved 
in the ligation process: Ej, Ey, and E; (Figure 31.8). E; is the ubiquitin-activating 
enzyme (105-kD dimer). It becomes attached via a thioester bond to the C-terminal 
Gly residue of ubiquitin through ATP-driven formation of an activated ubiquitin- 
adenylate intermediate. Ubiquitin is then transferred from E; to an SH group on 
Eo, the ubiquitin-carrier protein. (E; is actually a family of at least seven different 
small proteins, several of which are heat shock proteins; there is also a variety of Es 
proteins.) In protein degradation, EyS ~ ubiquitin transfers ubiquitin to free 
amino groups on proteins selected by Es (180 kD), the ubiquitin-protein ligase. 
Upon binding a protein substrate, E; catalyzes the transfer of ubiquitin from ExS ~ 
ubiquitin to free amino groups (usually Lys e NH¢) on the protein. More than one 
ubiquitin may be attached to a protein substrate, and tandemly linked chains 
of ubiquitin also occur via isopeptide bonds between the C-terminal glycine residue of 
one ubiquitin and the e-amino of Lys residues in another. Ubiquitin has seven lysine 
residues, at positions 6, 11, 27, 29, 33, 48, and 63. Only isopeptide linkages to K!, 
K”, KS, and K® have been found, with the K**-type being most common as a degra- 
dation signal. 
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E; plays a central role in recognizing and selecting proteins for degradation. Es 
selects proteins by the nature of the N-terminal amino acid. Proteins must have a 
free a-amino terminus to be susceptible. Proteins having either Met, Ser, Ala, Thr, 
Val, Gly, or Cys at the amino terminus are resistant to the ubiquitin-mediated 
degradation pathway. However, proteins having Arg, Lys, His, Phe, Tyr, Trp, Leu, 
Asn, Gln, Asp, or Glu N-termini have half-lives of only 2 to 30 minutes. 

Interestingly, proteins with acidic N-termini (Asp or Glu) show a tRNA require- 
ment for degradation (Figure 31.9). Transfer of Arg from Arg-tRNA to the 
N-terminus of these proteins alters their N-terminus from acidic to basic, rendering 
the protein susceptible to Es. It is also interesting that Met is less likely to be cleaved 
from the N-terminus if the next amino acid in the chain is one particularly suscep- 
tible to ubiquitin-mediated degradation. 

Most proteins with susceptible N-terminal residues are not normal intracellular 
proteins but tend to be secreted proteins in which the susceptible residue has 
been exposed by action of a signal peptidase. Perhaps part of the function of the 
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CENGAGENOW” ACTIVE FIGURE 31.8 Enzymatic reac- 
tions in the ligation of ubiquitin to proteins. Ubiquitin is 
attached to selected proteins via isopeptide bonds 
formed between the ubiquitin carboxy-terminus and 
free amino groups (œ-NH; terminus, Lys e-NH) side 
chains) on the protein. Test yourself on the concepts 
in this figure at www.cengage.com/login. 


FIGURE 31.9 Proteins with acidic N-termini show a tRNA 
requirement for degradation. Arginyl-tRNA‘%: protein 
transferase catalyzes the transfer of Arg to the free 
a-NH; of proteins with Asp or Glu N-terminal residues. 
Arg-tRNA“9: protein transferase serves as part of the 
protein degradation recognition system. 
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N-terminal recognition system is to recognize and remove from the cytosol any in- 
vading “foreign” or secreted proteins. 

Other proteins targeted for ubiquitin ligation and proteasome degradation con- 
tain PEST sequences—short, highly conserved sequence elements rich in proline 
(P), glutamate (E), serine (S), and threonine (T) residues. 


Proteins Targeted for Destruction Are Degraded by Proteasomes 


Proteasomes are large oligomeric structures enclosing a central cavity where prote- 
olysis takes place. The 20S proteasome from the archaeon Thermoplasma acidophilum 
is a 700-kD barrel-shaped structure composed of two different kinds of polypeptide 
chains, a and £, arranged to form four stacked rings of a78787a@7-subunit organiza- 
tion. The barrel is about 15 nm in height and 11 nm in diameter, and it contains a 
three-part central cavity (Figure 31.10a). The proteolytic sites of the 20S proteasome 
are found within this cavity. Access to the cavity is controlled through a 1.3-nm open- 
ing formed by the outer æy rings. These rings are believed to unfold proteins des- 
tined for degradation and transport them into the central cavity. The B-subunits pos- 
sess the proteolytic activity. Proteolysis occurs when the -subunit N-terminal 
threonine side-chain O atom makes nucleophilic attack on the carbonyl-C of a pep- 
tide bond in the target protein. The products of proteasome degradation are oligo- 
peptides seven to nine residues long. 

Eukaryotic cells contain two forms of proteasomes: the 20S proteasome, and its 
larger counterpart, the 26S proteasome. The eukaryotic 26S proteasome is a 45-nm- 
long structure composed of a 20S proteasome plus two additional substructures 
known as 19S regulators (also called 19S caps or PA700 [for proteasome activator- 
700 kD]) (Figure 31.10b). Overall, the 26S proteasome (approximately 2.5 megadal- 
tons) has 2 copies each of 32 to 34 distinct subunits, 14 in the 20S core and 18 to 
20 in the cap structures. Unlike the archaeal 20S proteasome, the eukaryotic 20S 
core structure contains seven different kinds of a-subunits and seven different kinds 
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FIGURE 31.10 The structure of the 26S proteasome. (a) The yeast (Saccharomyces cerevisiae) 20S proteasome 
core with bortezomib bound (red) (pdb id = 2F 16). Bortezomib is the first therapeutic proteasome inhibitor 
used in humans. It is approved in the United States for treatment of relapsed multiple myeloma and mantle 
cell lymphoma. (b) Composite model of the 26S proteasome. The 20S proteasome core is shown in yellow; the 
19S regulator (19S cap) structures are in blue. (Adapted from Figure 5 in Voges, D., Zwickl, P, and Baumeister, W., 1999. The 
26S proteasome: A molecular machine designed for controlled proteolysis. Annual Review of Biochemistry 68:1015-1068.) 
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of B-subunits. Interestingly, only three of the seven different -subunits have pro- 
tease active sites. The 26S proteasome forms when the 19S regulators dock to the two 
outer @7 rings of the 20S proteasome cylinder. Many of the 19S regulator subunits 
have ATPase activity. Replacement of certain 19S regulator subunits with others 
changes the specificity of the proteasome. The 19S regulators cause the proteolytic 
function of the 20S proteasome to become ATP-dependent and specific for ubiqui- 
tinylated proteins as substrates. That is, these 19S caps act as regulatory complexes 
for the recognition and selection of ubiquitinylated proteins for degradation by the 
20S proteasome core (Figure 31.11). The 26S proteasome shows a preference for 
proteins having four or more ubiquitin molecules attached to them. The 19S regu- 
lators also carry out the unfolding and transport of ubiquitinylated protein substrates 
into the proteolytic central cavity. 

The 19S regulators consist of two parts: the base and the lid. The base subcomplex 
connects to the 20S proteasome and contains the six ATPase subunits that unfold pro- 
teasome substrates. These subunits are members of the AAA family of ATPases 
(ATPases associated with various cellular activities); AAA-ATPases are an evolutionar- 
ily ancient family of proteins involved in a variety of cellular functions requiring 
energy-dependent unfolding, disassembly, and remodeling of proteins. The lid sub- 
complex acts as a cap on the base subcomplex and one of its subunits functions in 
recognition and ubiquitin-chain processing of proteasome protein substrates. 


ATPase Modules Mediate the Unfolding of Proteins in the Proteasome 


The base of the 19S regulators that cap the 26S proteasome consists of a hexameric 
ring of AAA-ATPases that mediate the ATP-dependent unfolding of ubiquitinated 
proteins targeted for destruction in the proteasome. Structural studies have re- 
vealed the presence of loops extending from these AAA-ATPase subunits; these 
loops face the central channel of the hexameric ring. The loops are in an “up” po- 
sition when an AAA-ATPase subunit has ATP bound in its active site, but they shift 
to a “down” position when ADP occupies the active site. Apparently, these loops 
bind protein substrates and act like the levers of a machine, using the energy of ATP 
hydrolysis to tug the protein into an unfolded state and thread it down through the 
narrow central channel. This channel leads into the cavity of the 20S proteasome, 
where proteolytic degradation takes place. The AAA-ATPase cycle of ATP binding, 
protein substrate binding, ATP hydrolysis, and protein unfolding is reminiscent of 
the ATP-dependent action of Hsp70 chaperones. 


Ubiquitination Is a General Regulatory Protein Modification 


Protein ubiquitination is a signal for protein degradation, as described on page 998. 
Ubiquitin conjugation to proteins is also used for other purposes in cells. Non- 
degradative functions for ubiquitination include roles in chromatin remodeling, 
DNA repair, transcription, signal transduction, endocytosis, spliceosome assembly, 
and sorting of proteins to specific organelles and cell structures. In addition, cells 
possess a variety of protein modifiers attached to target proteins by processes simi- 
lar to the ubiquitin pathway, as described in the following section. 


Small Ubiquitin-Like Protein Modifiers Are Post-transcriptional 
Regulators 


Small ubiquitin-like protein modifiers (SUMOs) are a highly conserved family of 
proteins found in all eukaryotic cells. Like ubiquitin, SUMO family members are 
covalently ligated to lysine residues in target proteins by a three-enzyme conjugat- 
ing system (Figure 31.12). SUMO proteins share only limited homology to ubiqui- 
tin, and sumoylated proteins are not targeted for destruction. Instead, sumoylation 
alters the ability of the modified protein to interact with other proteins. This ability 
to change protein—protein interactions is believed to be the biological purpose of 
SUMO proteins. 
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CENGAGENOW” ACTIVE FIGURE 31.11 Diagram of the 
ubiquitin-proteasome degradation pathway. Pink “lolli- 
pop’ structures symbolize ubiquitin molecules. (Adapted 
from Figure 1 in Hilt, W., and Wolf, D. H., 1996. Proteasomes: 
Destruction as a program. Trends in Biochemical Sciences 
21:96-102.) Test yourself on the concepts in this fig- 
ure at www.cengage.com/login. 
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FIGURE 31.12 The mechanism of reversible sumoyla- 
tion. Before conjugation, small ubiquitin-like protein 
modifiers (SUMOs) need to be proteolytically processed, 
removing anywhere from 2 to 11 amino acids to reveal 
the C-terminal Gly-Gly motif. SUMOs are then activated 
by the E1 enzyme in an ATP-dependent reaction to 
form a thioester bond between SUMO and E1.SUMO is 
then transferred to the catalytic Cys residue of the E2 
enzyme, Ubc9. Finally, an “isopeptide bond’ is formed, 
between the C-terminal Gly of SUMO and a Lys residue 
on the substrate protein, through the action of an E3 
enzyme. The SUMO-specific protease SENP can decon- 
jugate SUMO from target proteins. 
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Sumoylation can have three general consequences for modified proteins (Figure 
31.13): (1) sumoylation can interfere with the interactions between the target and 
its partner so that the interaction can only occur in the absence of sumoylation; 
(2) sumoylation can create a binding site for an interacting partner protein; and 
(3) sumoylation can induce a conformational change in the modified target pro- 
tein, altering its interactions with partner proteins. The regulatory opportunities as- 
sociated with sumoylation are significant for many cellular functions, including 
transcriptional regulation, chromosome organization, nuclear transport, and signal 
transduction (see Chapter 32). 

Many E2 enzymes participate in ubiquitination processes, but the only known 
SUMO F2 enzyme is Ubc9 (Figure 31.14a). Ubc9 recognizes a YKXD/E consensus 
sequence in proteins destined for sumoylation. In this sequence, w is an aliphatic 
branched amino acid (such as Leu), K is the lysine to which SUMO is conjugated, 
and X is any amino acid, with an acidic D or E completing the sequence. Recogni- 
tion by Ubc9 is only possible if the consensus sequence is in a relatively unstruc- 
tured part of a target protein or if it is part of an extended loop, as in RanGAP1 
(Figure 31.14a). In the Ubc9-RanGAP1 complex, Leu®”’ of RanGAP1 is in van der 
Waals contact with several nonpolar Ubc9 residues, whereas RanGAP1 Lys°”® lies in 
a hydrophobic groove of Ubc9, juxtaposed with the catalytic Cys? (Figure 31.14b). 
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FIGURE 31.13 The molecular consequences of sumoylation. The process can affect a modified protein in three 
ways: (a) Sumoylation can interfere with the interaction between a target protein and its binding partner. 

(b) Sumoylation can provide a new binding site for an interacting partner. (c) Sumoylation can induce a con- 
formational change in the modified target protein. 
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(b) 





FIGURE 31.14 (a) The complex formed by the E2 enzyme, Ubc9 (yellow), and a target protein, RanGAP1 (blue). 
(b) In the Ubc9-RanGAP1 complex, the exposed loop of RanGAP1 lies in the binding pocket of Ubc9. The 
exposed loop contains the consensus sequence for sumoylation (wKXD/E), including Leu?%, which is sur- 
rounded by hydrophobic residues from Ubc9; Lys°”°, which is coordinated by Asp!” and Cys” of Ubc9; and 
Glu°?8, which is coordinated by Ubc9 Thrê!. 


The pK, of Lys°*° is lowered in this complex, activating the lysine amino group for 
nucleophilic attack to form the SUMO-target protein conjugate. 


HtrA Proteases Also Function in Protein Quality Control 


The discussion thus far has stressed the importance of protein quality control to cel- 
lular health. HtrA proteases are a class of proteins involved in quality control that 
combine the dual functions of chaperones and proteasomes. (The acronym Htr 
comes from “high temperature requirement” because E. coli strains bearing muta- 
tions in HtrA genes do not grow at elevated temperatures.) In addition to their novel 
ability to be either chaperones or proteases, HtrA proteases are the only known pro- 
tein quality control factor that is not ATP-dependent. Prokaryotic HtrA proteases act 
as chaperones at low temperatures (20°C) where they have negligible protease ac- 
tivity. However, as the temperature increases, these proteins switch from a chaperone 
function to a protease function to remove misfolded or unfolded proteins from the 
cell. With this functional duality, HtrA proteases have the potential to mediate qual- 
ity control through protein triage (see A Deeper Look box on page 1004). 

The E. coli HtrA protein DegP is the best characterized HtrA protease. DegP is lo- 
calized in the £. coli periplasmic space, where it oversees quality control of proteins in- 
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Proteasome Inhibitors in Cancer Chemotherapy 





Proteasome inhibition offers a promising approach to treating 
cancer. The counterintuitive rationale goes like this: The protea- 
some is responsible for the regulated destruction of proteins in- 
volved in cell cycle progression and the control of apoptosis (pro- 
grammed cell death). Inhibition of proteasome function leads to 
cell cycle arrest and apoptosis. In clinical trials, proteasome in- 
hibitors have retarded cancer progression by interfering with the 
programmed degradation of regulatory proteins, causing cancer 


cells to self-destruct. Bortezomib, a small-molecule proteasome in- 
hibitor developed by Millenium Pharmaceuticals, Inc., has re- 
ceived FDA approval for the treatment of multiple myeloma, a 
cancer of plasma cells that accounts for 10% of all cancers of the 
blood (see Figure 31.10a). 


Source: Adams, J., 2003. The proteasome: Structure, function, and role in 
the cell. Cancer Treatment Reviews Supplement 1:3-9. 
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FIGURE 31.15 The HtrA protease structure. (a) A 
rimer of DegP subunits represents the HtrA 
unctional unit. The different domains are color- 
coded: The protease domain is green, PDZ do- 
main 1 (PDZ1) is yellow, and PDZ domain 2 
(PDZ2) is orange. Protease active sites are high- 
ighted in blue. The trimer has somewhat of a 
unnel shape, with the protease in the center 
and the PDZ domains on the rim. (b) Two HtrA 
rimers come together to form a hexameric 
structure in which the two protease domains 
orm a rigid molecular cage (blue) and the six 
PDZ domains are like tentacles (red) that both 
bind protein substrate targets and control lateral 
access into the protease cavity. (Adapted from Fig- 
ure 3 in Clausen, T., Southan, C., and Ehrmann, M., 2002. 
The HtrA family of proteases: Implications for protein 
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composition and cell fate. Molecular Cell 10:443-455.) 


volved in the cell envelope. It is a 448-residue protein containing a central protease 
domain with a classic Ser protease Asp-His-Ser catalytic triad (see Chapter 14) and two 
C-terminal PDZ domains. These domains are structural modules involved in protein- 
protein interactions, and they recognize and bind selectively to the C-terminal three 
or four residues of target proteins. Like other quality control systems, HtrA proteases 
have a central cavity where proteolysis occurs (Figure 31.15); the height of this cavity 
is 1.5 nm, which excludes folded proteins from access to the proteolytic sites. Thus, 
HtrA proteases can act only on misfolded proteins. As we have seen, limited access to 
proteolytic sites is an important regulatory feature of quality control proteases; only 
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Protein Triage—A Model for Quality Control 





Triage is a medical term for the sorting of patients according to Go 


their need for (and their likelihood to benefit from) medical treat- 
ment. Sue Wickner, Michael Maurizi, and Susan Gottesman have 
pointed out that cells control the quality of their proteins through 
a system of triage based on the chaperones and the ubiquitination- 
proteasome degradation pathway. These systems recognize non- 
native proteins (proteins that are only partially folded, misfolded, 
incorrectly modified, damaged, or in an inappropriate compart- 
ment). Depending on the severity of its damage, a non-native pro- 
tein is directed to chaperones for refolding or targeted for de- 
struction by a proteasome (see accompanying figure). 


Adapted from Figures 2 and 3 in Wickner, S., Maurizi, M., and Gottesman, 
S., 1999. Posttranslational quality control: Folding, refolding, and degrad- 
ing proteins. Science 286:1888-1893. 
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Problems 1005 


proteins targeted for destruction have access to such sites. The PDZ domains of the 
HtrA proteases act as gatekeepers, determining access of protein substrates to the pro- 
teolytic centers. Human HtrA proteases are implicated in stress response pathways. 
Human HtrA1 is expressed at higher levels in osteoarthritis and aging and lower lev- 
els in ovarian cancer and melanoma. Secreted human Htrl may be involved in degra- 
dation of extracellular matrix proteins involved in arthritis as well as tumor progres- 


sion and invasion. 


SUMMARY 


31.1 How Do Newly Synthesized Proteins Fold? Most proteins fold 
spontaneously, as anticipated by Anfinsen, whose experiments suggested 
that all of the information necessary for a polypeptide chain to assume 
its active, folded conformation resides in its amino acid sequence. How- 
ever, some proteins depend on molecular chaperones to achieve their 
folded conformation within the crowded intracellular environment. 
Hsp70 chaperones are ATP-dependent proteins that bind to exposed 
hydrophobic regions of polypeptides, preventing nonproductive associa- 
tions with other proteins and keeping the protein in an unfolded state 
until productive folding steps can take place. Hsp60 chaperones such as 
GroEL-GroES are ATP-dependent cylindrical chaperonins that provide 
a protected central cavity or “Anfinsen cage,” where partially folded pro- 
teins can fold spontaneously, free from the danger of nonspecific hy- 
drophobic interactions with other unfolded protein chains. Hsp90 chap- 
erones assist a subset of “client proteins” involved in signal transduction 
pathways in assuming their active conformations. 


31.2 How Are Proteins Processed Following Translation? Nascent pro- 
teins are seldom produced in their final, functional form. Maturation 
typically involves proteolytic cleavage and may require post-translational 
modification, such as phosphorylation, glycosylation, or other covalent 
substitutions. Removal of nascent N-terminal methionine residues is a 
common form of protein processing by proteolysis. The number of hu- 
man proteins is believed to exceed the number of human genes (20,000 
or so) by an order of magnitude or more. The great number of proteins 
available from a fixed set of genes is attributed to a variety of processes, 
including alternative splicing of mRNAs and post-translational modifica- 
tion of proteins. 


31.3 How Do Proteins Find Their Proper Place in the Cell? Most pro- 
teins destined for compartments other than the cytosol are synthesized 
with N-terminal signal sequences that target them to their proper desti- 
nations. These signal sequences are recognized by signal recognition 
particles as they emerge from translating ribosomes. The signal recog- 
nition particle interacts with a membrane-bound signal receptor, deliv- 


ering the translating ribosome to a translocon, a multimeric integral 
membrane protein structure having at its core the Sec61p complex. The 
translocon transfers the growing protein chain across the membrane 
(or in the case of nascent integral membrane proteins, inserts the pro- 
tein into the membrane). Signal peptidases within the membrane com- 
partment clip off the signal sequence. Other post-translational modifi- 
cations may follow. Mitochondrial protein import and membrane 
insertion are mediated by specific translocon complexes in the outer 
mitochondrial membrane called TOM and SAM. Proteins destined for 
the inner mitochondrial membrane or mitochondrial matrix must in- 
teract with inner mitochondrial translocons (either TIM23 or TIM22) 
as well as TOM. 


31.4 How Does Protein Degradation Regulate Cellular Levels of Specific 
Proteins? Protein degradation is potentially hazardous to cells, because 
cell function depends on active proteins. Therefore, protein degradation 
is compartmentalized in lysosomes or in proteasomes. Proteins targeted 
for destruction are selected by ubiquitination. A set of three enzymes 
(Ej), Eo, and Es) mediate transfer of ubiquitin to free —NHp groups in tar- 
geted proteins. Proteins with charged or hydrophobic residues at their 
N-termini are particularly susceptible to ubiquitination and destruction. 
The ubiquitin moieties are recognized by 19S cap structures found at 
either end of 26S proteasomes. Protein degradation occurs when, in an 
ATP-dependent process, the ubiquitinated protein is unfolded and 
threaded into the central cavity of the cylindrical a78767a7 20S part of the 
26S proteasome. The -subunits possess protease active sites that chop 
the protein substrate into short (seven- to nine-residue) fragments; the 
ubiquitin moieties are recycled. Linkage of SUMO (small ubiquitin-like 
protein modifiers) to target proteins has the ability to alter their protein- 
protein interactions. HtrA proteases also function in protein quality con- 
trol. HtrA proteins are novel in two aspects: Unlike other chaperones and 
proteasomes, they are ATP-independent; and unlike the others, they have 
dual chaperone and protease activities and switch between these two 
functions in response to stress conditions. 


PROBLEMS 


CENGAGENOW" Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 

1. (Integrates with Chapter 30.) Human rhodanese (33 kD) consists of 
296 amino acid residues. Approximately how many ATP equivalents 
are consumed in the synthesis of the rhodanese polypeptide chain 
from its constituent amino acids and the folding of this chain into 
an active tertiary structure? 

2. A single proteolytic break in a polypeptide chain of a native protein 
is often sufficient to initiate its total degradation. What does this 
fact suggest to you regarding the structural consequences of proteo- 
lytic nicks in proteins? 

3. Protein molecules, like all molecules, can be characterized in terms of 
general properties such as size, shape, charge, solubility/hydropho- 
bicity. Consider the influence of each of these general features on the 
likelihood of whether folding of a particular protein will require chap- 


erone assistance or not. Be specific regarding just Hsp70 chaperones 
or Hsp70 chaperones and Hsp60 chaperonins. 

4. Many multidomain proteins apparently do not require chaperones 
to attain the fully folded conformations. Suggest a rational scenario 
for chaperone-independent folding of such proteins. 

5. The GroEL ring has a 5-nm central cavity. Calculate the maximum 
molecular weight for a spherical protein that can just fit in this cav- 
ity, assuming the density of the protein is 1.25 g/mL. 

6. (Integrates with Chapter 24.) Acetyl-CoA carboxylase has at least 
seven possible phosphorylation sites (residues 23, 25, 29, 76, 77, 
95, and 1200) in its 2345-residue polypeptide (see Figure 24.4). 
How many different covalently modified forms of acetyl-CoA car- 
boxylase protein are possible if there are seven phosphorylation 
sites? 
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7. (Integrates with Chapter 30.) In what ways are the mechanisms of 
action of EF-Tu/EF-Ts and DnaK/GrpE similar? What mechanistic 
functions do the ribosome A-site and DnaJ have in common? 


8. The amino acid sequence deduced from the nucleotide sequence of a 
newly discovered human gene begins: MRSLLILVLCFLPLAALGK... . 
Is this a signal sequence? If so, where does the signal peptidase act 
on it? What can you surmise about the intended destination of this 
protein? 

9. Not only is the Sec61p translocon complex essential for transloca- 
tion of proteins into the ER lumen, it also mediates the incorpora- 
tion of integral membrane proteins into the ER membrane. The 
mechanism for integration is triggered by stop-transfer signals that 
cause a pause in translocation. Figure 31.5 shows the translocon as 
a closed cylinder spanning the membrane. Suggest a mechanism 
for lateral transfer of an integral membrane protein from the 
protein-conducting channel of the translocon into the hydrophobic 
phase of the ER membrane. 

10. The Sec61p core complex of the translocon has a highly dynamic 
pore whose internal diameter varies from 0.6 to 6 nm. In post- 
translational translocation, folded proteins can move across the 
ER membrane through this pore. What is the molecular weight 
of a spherical protein that would just fit through a 6-nm pore? 
(Adopt the same assumptions used in problem 5.) 

11. (Integrates with Chapters 6, 9, and 30.) During co-translational 
translocation, the peptide tunnel running from the peptidyl trans- 
ferase center of the large ribosomal subunit and the protein- 
conducting channel are aligned. If the tunnel through the ribosomal 
subunit is 10 nm and the translocon channel has the same length as 
the thickness of a phospholipid bilayer, what is the minimum number 
of amino acid residues sequestered in this common conduit? 

12. Draw the structure of the isopeptide bond formed between Gly” of 
one ubiquitin molecule and Lys*® of another ubiquitin molecule. 

13. Assign the 20 amino acids to either of two groups based on their sus- 
ceptibility to ubiquitin ligation by Eş ubiquitin protein ligase. Can 
you discern any common attributes among the amino acids in the 
less susceptible versus the more susceptible group? 

14, Lactacystin is a Streptomyces natural product that acts as an irre- 
versible inhibitor of 26S proteasome -subunit catalytic activity by 
covalent attachment to N-terminal threonine —OH groups. Predict 
the effects of lactacystin on cell cycle progression. 


15. HtrA proteases are dual-function chaperone-protease protein qual- 
ity control systems. The protease activity of HtrA proteases depends 
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on a proper spatial relationship between the Asp-His-Ser catalytic 
triad. Propose a mechanism for the temperature-induced switch of 
HtrA proteases from chaperone function to protease function. 


16. As described in this chapter, the most common post-translational 
modifications of proteins are proteolysis, phosphorylation, methy- 
lation, acetylation, and linkage with ubiquitin and SUMO pro- 
teins. Carry out a Web search to identify at least eight other post- 
translational modifications and the amino acid residues involved 
in these modifications. 


17. Fluorescence resonance energy transfer (FRET) is a spectroscopic 
technique that can be used to provide certain details of the confor- 
mation of biomolecules. Look up FRET on the Web or in an intro- 
ductory text on FRET uses in biochemistry, and explain how FRET 
could be used to observe conformational changes in proteins 
bound to chaperonins such as GroEL. A good article on FRET in 
protein folding and dynamics can be found here: Haas, E., 2005. 
The study of protein folding and dynamics by determination of in- 
tramolecular distance distributions and their fluctuations using en- 
semble and single-molecule FRET measurements. ChemPhysChem 
6:858-870. Studies of GroEL using FRET analysis include the fol- 
lowing: Sharma, S., et al., 2008. Monitoring protein conformation 
along the pathway of chaperonin-assisted folding. Cell 133:142-153; 
and Lin, Z., et al., 2008. GroEL stimulates protein folding through 
forced unfolding. Nature Structural and Molecular Biology 15:303-311. 

18. The cross-talk between phosphorylation and ubiquitination in pro- 
tein degradation processes is encapsulated in the concept of the 
“phosphodegron.” What is a phosphodegron, and how does phos- 
phorylation serve as a recognition signal for protein degradation? 
(A good reference on the phosphodegron and crosstalk between 
phosphorylation and ubiquitination is Hunter, T., 2007. The age of 
crosstalk: Phosphorylation, ubiquitination, and beyond. Molecular 
Cell 28:730-738.) 


Preparing for the MCAT Exam 

19. A common post-translational modification is removal of the univer- 
sal N-terminal methionine in many proteins by Met-aminopeptidase. 
How might Met removal affect the half-life of the protein? 

20. Figure 31.6 shows the generalized amphipathic a-helix structure 
found as an N-terminal presequence on a nuclear-encoded mito- 
chondrial protein. Write out a 20-residue-long amino acid sequence 
that would give rise to such an amphipathic a-helical secondary 
structure. 
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The Reception 
and Transmission 
of Extracellular Information 





ESSENTIAL QUESTION 


Higher life forms must have molecular mechanisms for detecting environmental infor- 
mation as well as mechanisms that allow for communication at the cell and tissue lev- 
els. Sensory systems detect and integrate physical and chemical information from the 
environment and pass this information along by the process of neurotransmission. 
Control and coordination of processes at the cell and tissue levels are achieved not 
only by neurotransmission but also by chemical signals in the form of hormones that 
are secreted by one set of cells to direct the activity of other cells. 
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Drawing of a human fetus in utero, by Leonardo da 

Vinci. Human sexuality and embryonic development What are these mechanisms of information transfer that mediate the molecular 

pipes hormonally regulated processes of uni- basis of hormone action and that use excitable membranes to transduce the 
signals of neurotransmission and sensory systems? 





“How little we know, how much to discover, 
What chemical forces flow from lover to lover. 
How little we understand what touches off that 


tingle, Hormones are secreted by certain cells, usually located in glands, and travel, either 
That sudden explosion when two tingles by simple diffusion or circulation in the bloodstream, to specific target cells (Fig- 
intermingle. 


ure 32.1). As we shall see, some hormones bind to specialized receptors on the 
Who cares with your boson mine plasma membrane and induce responses within the cell without themselves enter- 
How ignorant bliss is, ing the target cell (Figure 32.2). Other hormones actually enter the target cell and 
So long as you kiss me and the world around us interact with specific receptors there. By these mechanisms, hormones: 


shatters? i ; ; 
A : e Regulate the metabolic processes of various organs and tissues 
How little it matters how little we know” 8 P 8 


alom Tate We Rae? e Facilitate and control growth, differentiation, reproductive activities, learning, 
and memory 
Help the organism cope with changing conditions and stresses in its environment 


Who cares to define what chemistry this is? 


by P. Springer and C. Leigh 
(Excerpt from “How Little We Know” by P. Springer 


and C. Leigh as recorded by Frank Sinatra, ; : A . 
April 5. 1956. eas Reede Ine) All these effects of hormonal signals are either metabolic responses or gene expression 


responses (Figure 32.2). 





KEY QUESTIONS 





























32.1 What Are Hormones? 2ail What Are Hormones? 

32.2 What Is Signal Transduction? Many different chemical species act as hormones. Steroid hormones, all derived from 

32.3 ow Do Signal-Transducing Receptors cholesterol, regulate metabolism, salt and water balances, inflammatory processes, 
Respond to the Hormonal Message? and sexual function. Several hormones are amino acid derivatives. Among these are 

32.4 How Are Receptor Signals Transduced? epinephrine and norepinephrine (which regulate smooth muscle contraction and relax- 

32.5 How Do Effectors Convert the Signals ation, blood pressure, cardiac rate, and the processes of lipolysis and glycogenolysis) 
o Actions in the Cell? and the thyroid hormones (which stimulate metabolism). Peptide hormones are a large 

32.6 ow Are Signaling Pathways Organized group of hormones that regulate processes in all body tissues, including the release 
anel Tancichic. ack of yet other hormones. 

32.7 How Do Neurotransmission Pathways Hormones and other signal molecules in biological systems bind with very high 


Gomirell tne Funetien oi sensory systems? affinities to their receptors, displaying Kp values in the range of 107! to 107° M. The 


hormones are produced at concentrations equivalent to or slightly above these Kp 
values. Once hormonal effects have been induced, the hormone is usually rapidly 
metabolized. 


Steroid Hormones Act in Two Ways 


The steroid hormones include the glucocorticoids (cortisol and corticosterone), the 
CENGAGENOW” Create your own study path for mineralocorticoids (aldosterone), and the sex hormones (progesterone and testos- 
Unie crepar wiin Conia line aU Ome eniin, terone, for example) (Figure 32.1; see Chapter 24 for the details of their synthesis). 
and Active Figures at www.cengage.com/login. x ‘ ‘ $ : : 

The steroid hormones exert their effects in two ways: First, by entering cells and mi- 
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grating to the nucleus, steroid hormones act as transcription regulators, modulating 
gene expression. These effects of the steroid hormones occur on time scales of hours 
and involve synthesis of new proteins. Steroids can also act at the cell membrane, di- 
rectly regulating ligand-gated ion channels and perhaps other processes. These latter 
processes take place very rapidly, on time scales of seconds and minutes. 
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FIGURE 32.1 The sites of synthesis and action of a few 
of the polypeptide and steroid hormones. Hormones 
typically circulate at low concentrations (1 nM or less) 
and bind with high affinity to their receptor proteins. 


FIGURE 32.2 Nonsteroid hormones bind exclusively to 
plasma membrane receptors, which mediate the cellular 
responses to the hormone. Steroid hormones exert their 
effects either by binding to plasma membrane recep- 
tors or by diffusing to the nucleus, where they modulate 
transcriptional events. Intracellular responses to hor- 
mone binding include metabolic changes and alter- 
ations of gene expression. 
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FIGURE 32.3 The conversion of prepro-opiomelanocortin to a family of peptide hormones, including E] 
corticotropin, B- and y-lipotropin, œ- and B-MSH, and endorphin. y-Lipotropin 


Polypeptide Hormones Share Similarities of Synthesis and Processing 


The largest class of hormones in vertebrate organisms is that of the polypeptide 
hormones (Figure 32.1). One of the first polypeptide hormones to be discovered, 
insulin, was described by Banting and Best in 1921. Insulin, a secretion of the pan- 
creas, controls glucose utilization and promotes the synthesis of proteins, fatty acids, 
and glycogen. Insulin, which is typical of the secreted polypeptide hormones, is dis- 
cussed in detail in Chapters 5, 15, and 22. 

Many other polypeptide hormones are produced and processed in a manner sim- 
ilar to that of insulin. Three unifying features of their synthesis and cellular process- 
ing should be noted. First, all secreted polypeptide hormones are originally synthe- 
sized with a signal sequence, which facilitates their eventual direction to secretory 
granules, and thence to the extracellular milieu. Second, peptide hormones are usu- 
ally synthesized from mRNA as inactive precursors, termed preprohormones, which 
become activated by proteolysis. Third, a single polypeptide precursor or prepro- 
hormone may produce several different peptide hormones by suitable proteolytic 
processing. 

An impressive example of the production of many hormone products from a 
single precursor is the case of prepro-opiomelanocortin, a 250-residue precursor 
peptide synthesized in the pituitary gland. A cascade of proteolytic steps produces, 
as the name implies, a natural opiate substance (endorphin) and several other hor- 
mones (Figure 32.3). Endorphins and other opiatelike hormones are produced by 
the body in response to systemic stress. These substances probably contribute to the 
“runner’s high” that marathon runners describe. 


32:28 What Is Signal Transduction? 


Hormonal regulation depends on the transduction of the hormonal signal across 
the plasma membrane to specific intracellular sites, particularly the nucleus. Signal 
transduction consists of a stepwise progression of signaling stages: receptor— 
transducers—effectors. The receptor perceives the signal, transducers relay the 
signal, and the effectors convert the signal into an intracellular response. 

What are the characteristics of signal transduction systems? Regardless of the 
organism or the tissue, certain features appear to be universal, or nearly so: 


e Transmembrane communication of hormonal signals by receptor proteins 
e Protein interaction domains that selectively recognize specific structural motifs 
and bind them with high affinity and specificity 


e Clustering of membrane receptors and their ligands in large aggregates called 
signalsomes 

e Reversible covalent modifications that change the function of certain proteins 
and lipids (including phosphorylation, methylation, acetylation, ubiquitylation, 
hydroxylation, and cleavage) 

e Second messengers that bind to specific targets, changing their activity and be- 
havior 

e Intracellular signaling pathways, often involving a series of enzymes (such as pro- 

tein kinases), that link receptors to their downstream functional targets 

Cooperativity 

Spatial and temporal control of signals and messengers 

Integration of signals 

Converging and diverging networks 

Signal amplification 

Desensitization and adaptation 


Many Signaling Pathways Involve Enzyme Cascades 


Signaling pathways must operate with speed and precision, facilitating the accurate 
relay of intracellular signals to specific targets. But how does this happen? Enzyme 
cascades are one answer to this question. Enzymes can produce (or modify) a large 
number of molecules rapidly and specifically. Many enzymes of signaling cascades 
are protein kinases and protein phosphatases, and many steps in signaling pathways 
involve phosphorylation of serine, threonine, and tyrosine residues on target pro- 
teins. The complexity of signal transduction is thus manifested in the estimates that 
the human genome contains about a thousand protein kinase and protein phos- 
phatase genes. Enzyme cascades act like a series of amplifiers, dramatically increas- 
ing the magnitude of the intracellular response available from a very small amount 
of hormone. 


Signaling Pathways Connect Membrane Interactions with Events 
in the Nucleus 


The complete pathway from hormone binding at the plasma membrane to modula- 
tion of transcription in the nucleus is understood for a few signaling pathways. Figure 
32.4 shows a complete signal transduction pathway that connects receptor tyrosine 
kinases, the Ras GTPase, cytoplasmic Raf, and two other protein kinases with tran- 
scription factors that alter gene expression in the nucleus. This pathway represents 
just one component of a complex signaling network that involves many other proteins 
and signaling factors. The existence of nearly 4000 human genes devoted to signal 
transduction portends a complex and interwoven network of signaling interactions in 
nearly all human cells. 


Signaling Pathways Depend on Multiple Molecular Interactions 


Each step in a signaling pathway depends on one or more molecular interac- 
tions. For example, a protein may bind to a small molecule, a peptide, or even 
another protein (Figure 32.5). In fact, many signaling proteins are constructed 
in a cassettelike fashion with several distinct modules, termed protein interaction 
domains (PIDs), each of which mediates a particular interaction (Figure 32.6a). 
A protein with multiple modules can interact with several binding targets at once 
to create a complex, called a signalsome, with multiple signaling capabilities 
(Figure 32.6b). Several signaling systems described in this chapter involve events 
at a signalsome. 


'The American Association for the Advancement of Science oversees a consortium of researchers who 
have established a Web site—the Signal Transduction Knowledge Environment (STKE). This site is an 
up-to-date and ongoing compilation of information about cell signaling and signal transduction. The 
URL is http://www.stke. org. 
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CENGAGENOW” ACTIVE FIGURE 32.4 A complete signal transduction pathway that connects a hormone 
receptor with transcription events in the nucleus. A number of similar pathways have been characterized. 
Test yourself on the concepts in this figure at www.cengage.com/login. 
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Pumilio, complexed 
with RNA (green) 





PDZ (dimer) 


FIGURE 32.5 Five of the protein modules found in cell signaling proteins. The binding specificity of these mod- 
ules is shown in Figure 32.6a. SH3 domains bind to proline-rich peptides; SH2 domains bind to phosphorylated 
tyrosine residues; PH domains bind to phosphoinositides (such as IP); PDZ domains bind to the terminal four 
or five residues of a target protein; pumilio domains bind to segments of RNA.A given protein may have a 
number of these protein modules, giving it the ability to interact with multiple partners. 


32:30 How Do Signal-Transducing Receptors Respond 
to the Hormonal Message? 


All receptors that mediate transmembrane signaling processes fit into one of three 
receptor superfamilies (Figure 32.7): 


1. The G-protein—coupled receptors (see Section 32.4) are integral membrane pro- 
teins with an extracellular recognition site for ligands and an intracellular recog- 
nition site for a GTP-binding protein (see following discussion). 

2. The single-transmembrane segment (1-TMS) catalytic receptors are proteins with 
only a single transmembrane segment and substantial globular domains on both 
the extracellular and the intracellular faces of the membrane. The extracellular 
domain in the ligand recognition site and the intracellular catalytic domain is ei- 
ther a tyrosine kinase or a guanylyl cyclase. 

3. Oligomeric ion channels consist of associations of protein subunits, each of 
which contains several transmembrane segments. These oligomeric structures 
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FIGURE 32.6 (a) Many signaling proteins consist of com- 
binations of protein modules, each with a specific bind- 
ing or enzymatic function. (b) Such multifunctional pro- 
teins can act as scaffolds that direct the assembly of 
large signaling complexes, termed signalsomes. 


FIGURE 32.7 The membrane receptor superfamilies. The 
G-protein—-coupled receptors are named for the GTP- 
binding proteins that mediate some of their effects. The 
receptor tyrosine kinases and receptor guanylyl cyclases 
contain intracellular enzymatic domains that respond to 
extracellular hormone binding. The oligomeric ion chan- 
nels (some of which were discussed in Chapter 9) open 
and close in response to ligand binding and/or changes 
of the transmembrane electrochemical potential. 
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are ligand-gated ion channels. Binding of the specific ligand typically opens the 
ion channel. The ligands for these ion channels are neurotransmitters. 


The G-Protein—Coupled Receptors Are 7-TMS Integral 
Membrane Proteins 


The G-protein—coupled receptors (GPCRs) have primary and secondary structure 
similar to that of bacteriorhodopsin (see Chapter 9), with seven transmembrane 
a-helical segments; they are thus known as 7-transmembrane segment (7-TMS) pro- 
teins. Rhodopsin and the 6-adrenergic receptors, for which epinephrine is a ligand, 
are good examples (Figure 32.8). The site for binding of cationic catecholamines 
to the adrenergic receptors is located within the hydrophobic core of the receptor. 
Binding of hormone to a GPCR induces a conformation change that activates a 
GTP-binding protein, also known as a G protein (discussed in Section 32.4). Acti- 
vated G proteins trigger a variety of cellular effects, including activation of adenylyl 
and guanylyl cyclases (which produce cAMP and cGMP from ATP and GTP), 
activation of phospholipases (which produce second messengers from phospho- 
lipids) and activation of Ca?* and K* channels (which leads to elevation of intra- 
cellular [Ca**] and [K*]). (All of these effects are described in Section 32.4.) 


The Single TMS Receptors Are Guanylyl Cyclases or Tyrosine Kinases 


Receptor proteins that span the plasma membrane with a single helical trans- 
membrane segment possess an external ligand recognition site and an internal do- 
main with enzyme activity—either receptor tyrosine kinase (RTK) or receptor 


(a) By-Adrenergic receptor b) 
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FIGURE 32.8 (a) The arrangement of the B2-adrenergic receptor in the membrane. Substitution of Asp!" in the 
third hydrophobic domain of the B-adrenergic receptor with an Asn or Gln by site-directed mutagenesis results 
in a dramatic decrease in affinity of the receptor for both agonists and antagonists. Significantly, this Asp residue 
is conserved in all other GPCRs that bind biogenic amines but is absent in receptors whose ligands are not 
amines. Asp''? appears to be the counterion of the amine moiety of adrenergic ligands. (b) The structure of a Bx- 
adrenergic receptor (pdb id = 2RH1).The flexible third intracellular loop and C-terminal segment are not shown. 
(From Figure 2 from Palczewski, K., et al, 2000. Crystal structure of rhodopsin: A G-protein-coupled receptor. Science 289:739-745.) 
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FIGURE 32.9 (a) The EGF receptor and (b) the insulin re- 
ceptor are receptor tyrosine kinases. EGF receptors are 
activated by ligand-induced dimerization, whereas the 
insulin receptor is a glycoprotein composed of two guanylyl cyclase (RGC). Each of these enzyme activities is manifested in two dif- 
kinds of subunits in an a8; tetramer stabilized by disu- -ferent cellular forms. Thus, guanylyl cyclase activity is found in both membrane- 
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a receptor guanylyl cyclase with a large extracellular lipid-anchored proteins. 

hormone-binding domain and two intracellular do- 

mains, and activation typically involves ligand-induced 


dimerization. (d) The growth factor receptor Retisare- RTKs and RGCs Are Membrane-Associated Allosteric Enzymes 


ceptor tyrosine kinase that mediates the effects of neu- 








ronal growth factors known as neurotrophins. The Ret The binding of polypeptide hormones and growth factors to RTKs and RGCs acti- 
receptor domains (yellow) requires a GPl-anchored vates the intracellular enzyme activity of these proteins. These catalytic receptors are 
coreceptor (blue) for binding of ligands (pink). composed of three domains (Figure 32.9): an extracellular receptor-binding domain 


(which may itself include several subdomains), a transmembrane domain consisting 
of a single transmembrane a-helix, and an intracellular domain. This intracellular 
portion includes a tyrosine kinase or guanylyl cyclase domain that mediates the bio- 
logical response to the hormone or growth factor via its catalytic activity and a regu- 
latory domain that contains multiple phosphorylation sites. The human genome 
contains at least 58 different RTKs, which can be grouped into about 20 families on 
the basis of their kinase domain sequences and the various extracellular subdomains. 
The epidermal growth factor (EGF) receptor and the insulin receptor are represen- 
tative of this class of receptor proteins. 

Given that the extracellular and intracellular domains of RTKs and RGCs are joined 
by only a single transmembrane helical segment, how does extracellular hormone 
binding activate intracellular enzyme activity? How is the signal transduced? As shown 
in Figure 32.10, signal transduction occurs by hormone-induced oligomeric associa- 
tion of receptors. Hormone binding triggers a conformational change in the extracel- 
lular domain, which induces oligomeric association. Oligomeric association allows ad- 
jacent cytoplasmic domains to interact, leading to phosphorylation of the cytoplasmic 
domains and stimulation of cytoplasmic enzyme activity. By virtue of these ligand- 
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CENGAGENOW” ANIMATED FIGURE 32.10 Ligand-induced oligomerization of 
membrane receptors can occur in several ways. (a) EGF receptor dimerization 

and activation involves binding of hormone to the extracellular domains of two 
receptor molecules. (b) The insulin receptor is a preexisting tetramer. Two insulin- 
binding sites are created by association of the extracellular domains. (c) RGCs bind 
a single hormone ligand at the dimer interface. (Ligand dimers could presumably 
act in a similar manner). (d) Neurotrophin activation of Ret involves association of 
two molecules of the Ret kinase and two molecules of a GPl-anchored coreceptor 
protein. See this figure animated at www.cengage.com/login. 





induced conformation changes and oligomeric interactions, RTKs and RGCs are 
membrane-associated allosteric enzymes. 


EGF Receptor Is Activated by Ligand-Induced Dimerization 


Human EGF is a 53-amino acid peptide that stimulates proliferation of epithelial 
cells (cells that cover a surface or line a cavity in biological tissues). The human EGF 
receptor is a 1186-amino acid RTK. The extracellular domain, where EGF binds, 
contains 622 amino acids and is divided into four domains (Figure 32.9). Domains 
I and III are similar -helical barrels, whereas domains II and IV are long, slender, 
cysteine-rich domains stabilized by disulfide bonds. Domain II is characterized by a 
B-hairpin structure that protrudes from the middle of the domain (Figure 32.11a). 

Prior to EGF binding, EGF receptors exist as inactive monomers in the plasma 
membrane. In this state, the four domains are folded so that domains II and IV lie 
parallel, with the 6-hairpin of domain II in contact with domain IV (Figure 32.11a). 
Binding of EGF to domain I induces a conformation change that rotates domains I 
and II so that the bound EGF is brought into contact with domain III and the 
B-hairpin is extended away from the rest of the structure. Pairs of such receptor 
monomers then dimerize by mutual association of the -hairpin structures (Figure 
32.11b). The next event is the critical step in transmembrane signal transduction. Dimer- 
ization of the extracellular domains brings the intracellular domains together, acti- 
vating the tyrosine kinase activity. Thus, EGF-induced dimerization allows an extra- 
cellular signal (EGF) to exert an intracellular response. 


EGF Receptor Activation Forms an Asymmetric Tyrosine Kinase Dimer 


The tyrosine kinase domain of the EGF receptor consists of an N-terminal domain 
built around a twisted -sheet and a C-terminal domain that is primarily a-helical 
(Figure 32.12a). In the inactive, monomeric EGF receptor, the active site of the 
tyrosine kinase domain is blocked by a 30-residue loop of the protein (residues 
831-860) that is termed the activation loop. Extracellular dimer formation by the 
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FIGURE 32.11 (a) In the absence of hormone, the extracellular domain of the EGF recep- 
tor is folded with the hairpin loop and dimerization interface on domain II (green) asso- 
N ciated with domain IV (orange). Binding of EGF (red) induces a conformation change 
that exposes the dimerization interface and promotes formation of the activated dimer 
complex (pdb id = 1NQL). (b) Dimerization involves dove-tailing of the domain II hair- 
pin loops (pdb id = 11VO). 


EGF receptor appears to promote formation of an asymmetric dimer of the intra- 
cellular tyrosine kinase domains (Figure 32.12b), with the C-terminal lobe of one 
kinase domain juxtaposed with the N-terminal lobe of the other kinase domain. In 
this asymmetric dimer, one monomer is inactive but acts as an activator of kinase ac- 
tivity in the other monomer. Conversion of the kinase domain from its inactive state 
to the active conformation involves rotation of the activation loop out of the active 
site, making room for a peptide substrate to enter the site (Figure 32.12c). 

The activated tyrosine kinase domain of the EGF receptor can phosphorylate 
several tyrosine residues at its own C-terminus (Figure 32.13), a process termed 
autophosphorylation. These phosphorylated tyrosines are binding sites for a variety 
of other signaling proteins that contain phosphotyrosine-binding SH2 domains (see 
Figures 32.5 and 32.6). Each of these SH2-domain-containing proteins can initiate 
several signal transduction cascades. 





FIGURE 32.12 (a) The kinase domain of the EGF receptor consists of an N-terminal lobe built around a twisted 
B-sheet and an a-helical C-terminal lobe. Bound ATP is pink, and the activation loop, shown blocking the 
active site, is blue. (b) Hormone-induced receptor dimerization promotes formation of an asymmetric dimer of 
the intracellular kinase domains (yellow and green). (c) Before hormonal activation (left), the kinase active site 
is blocked by a 30-residue activation loop (green). In the activated complex (right), the activation loop (green) 
has rotated away from the active site in one of the subunits, making room for binding of a peptide substrate 
(blue) (a, b, c [right]: pdb id = 2GS6; c [left]: pdb id = 2GS7). 
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FIGURE 32.13 Hormone-induced 
dimerization of the EGF receptor 
promotes autophosphorylation 

of five tyrosine residues near the 
C-terminus of each EGF receptor 
subunit. Signaling proteins that 
contain phosphotyrosine-binding 
SH2 domains can be activated by 
binding to these phosphotyrosines. 
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FIGURE 32.14 Substrates of the insulin receptor tyrosine 
kinase include the insulin receptor substrate (IRS) pro- 
teins, as well as Gab-1, Shc, and APS. The phosphorylated 
substrates in turn bind to several families of SH2 
domain-containing proteins, activating several signal- 
ing pathways. 
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FIGURE 32.15 The ectodomain formed by the insulin 
receptor is constructed from the two @/B units. These two 
units are folded so that the L1 domain of one a-chain is 
juxtaposed with the Fn1 domain of the other a-chain. 
Each insulin-binding site is created by the L1 and L2 
domains of one a-chain and the Fn2 domain of the 
other a/B chain pair (see Figure 32.9) (pdb id = 2DTG). 
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The Insulin Receptor Mediates Several Signaling Pathways 


Insulin, a small heterodimeric peptide (see Figure 5.8), is the most potent anabolic 
hormone known. It regulates blood glucose levels, and it promotes the synthesis 
and storage of carbohydrates, proteins, and lipids. Abnormalities of insulin pro- 
duction, action, or both lead to diabetes, as well as other serious health issues. In- 
sulin binding to receptors in liver, muscle, and other tissues triggers multiple sig- 
naling pathways, and insulin action is responsible for a variety of cellular effects. 

The insulin receptor is an RTK that catalyzes the phosphorylation of tyrosine 
residues of several intracellular substrates, including the insulin receptor substrate 
(IRS) proteins, and several other proteins known as Gab-1, Shc, and APS (Figure 
32.14). Each of these phosphorylated substrates binds a particular family of proteins 
containing SH2 domains (see Figures 32.5 and 32.6). These SH2 proteins interact 
specifically with phosphotyrosine-containing IRS sequences. Each of these signaling 
pathways can be confined to distinct cellular locations and can proceed with a dif- 
ferent time course, thus providing spatial and temporal dimensions to insulin ac- 
tion in cells. 


The Insulin Receptor Adopts a Folded Dimeric Structure 
in the Membrane 


Unlike the majority of RTKs, which are single-chain receptors, the insulin receptor 
is an Qo» tetramer. The a-chain contains two leucine-rich domains (L1 and L2) 
with a cysteine-rich domain (CR) between them, as well as an intact fibronectin do- 
main (Fn1) and a partial fibronectin domain (Fn2). The -chain contains the other 
half of Fn2, followed by a third fibronectin domain (Fn3), a transmembrane 
a-helix, and (inside the cell) the tyrosine kinase domain (see Figure 32.9). The ex- 
tracellular domain (the ectodomain) forms a folded dimer, with the L1 and L2 
domains of one a-chain juxtaposed with the Fn2 domain of the other a/f chain 
pair, to create the insulin-binding site (Figure 32.15). Thus, each of the two insulin- 
binding sites of the ectodomain consists of portions of both a-chains. 


Autophosphorylation of the Insulin Receptor Kinase Opens 
the Active Site 


Binding of insulin to its receptor activates the tyrosine kinase activity of the intra- 
cellular domains. Like the EGF receptor kinase, the insulin receptor kinase con- 
tains an activation loop that lies across the kinase active site, excluding substrate 
peptides and thus inhibiting the kinase. Phosphorylation of three tyrosine residues 
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FIGURE 32.16 In its inactive state, the insulin receptor 
tyrosine kinase is inhibited by an activation loop (yellow 
and red), which prevents substrate access to the active 
site (left—pdb id = 1IRK). Autophosphorylation of three 
tyrosine residues on the activation loop induces a con- 
formation change that rotates the loop out of the active 
site (right—pdb id = 11R3), permitting access by insulin 
receptor substrates (blue) and ATP (ATP analog in 
orange). 





on the activation loop—another case of autophosphorylation—causes the activa- 
tion loop to move out of and away from the active site (Figure 32.16). This opens 
the active site so that target proteins are bound and phosphorylated by the kinase, 
triggering the appropriate signaling pathways (see Figure 32.14). 


Receptor Guanylyl Cyclases Mediate Effects of Natriuretic Hormones 


When you eat a salty meal, your body secretes hormones that protect you from the 
harmful effects of excess salt intake. When your heart senses that blood volume is too 
great, it sends signals to the kidneys to excrete NaCl and water (processes termed 
natriuresis and diuresis, respectively). These are just two examples of the action of 
natriuretic hormones, which allow tissues and organs to communicate with one an- 
other to regulate the volumes of blood and other body fluids and the osmotic effects 
of Na+, K*, Cl’, and other ions. Guanylin and uroguanylin are produced in the in- 
testines after ingestion of a salty meal and are secreted into the intestinal lumen. 
Binding of guanylin and uroguanylin to receptor guanylyl cyclases on cell mem- 
branes lining the lumen activates the intracellular guanylyl cyclase (Figure 32.17a). 
cGMP (see Figure 10.12) produced in this reaction is a second messenger that in- 
hibits Na* uptake from the lumen and activates Cl” export into the lumen. The re- 
sult is a beneficial enhanced excretion of NaCl and water. (This effect can be carried 
too far, however. Heat-stable enterotoxin (ST) produced by E. coli—with a sequence 
similar to guanylin and uroguanylin—causes violent diarrhea.) Atrial natriuretic 
peptide (ANP) and brain natriuretic peptide (BNP, so named because it was discov- 
ered first in brain) are both produced primarily in the heart. When the heart mus- 
cle is stressed and stretched by increased blood volume, the heart secretes ANP and 
BNP into the blood. In the kidneys, ANP and BNP bind to RGCs in kidney tubules, 
activating intracellular guanylyl cyclase and producing cGMP (Figure 32.17b). As in 
the intestines, CGMP inhibits Na* uptake, once again stimulating excretion of salt 
and water and reducing blood volume. 


A Symmetric Dimer Binds an Asymmetric Peptide Ligand 


RGC monomers associate as dimers in the membrane, even in the absence of their 
hormone ligands (Figure 32.18a). A dimeric receptor complex is activated by the 
binding of a single polypeptide hormone. This raises two important questions about 
the mechanism of action of these receptors: (1) How does an asymmetric hormone 
ligand (for example, ANP) bind to a symmetric homodimeric receptor? And (2) how 
does hormone binding to its extracellular dimeric receptor activate the intracellular 
guanylyl cyclase domain? Answers to these provocative questions have been provided 
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FIGURE 32.17 (a) Ingestion of a salty meal triggers excretion of NaCl and water in the intestines. Binding of 
guanylin and uroguanylin to RGC on cell membranes lining the lumen activates the intracellular guanylyl 
cyclase. cGMP produced by the cyclase inhibits Nat uptake from the lumen and activates Cl” export into the 
lumen. (b) Atrial natriuretic peptide (ANP) protects the heart and circulatory system from the deleterious 
effects of increased blood volume.When the heart muscle is stressed and stretched, the heart secretes ANP. 
ANP binding to RGCs in kidney tubules activates intracellular guanylyl cyclase, producing CGMP. Inhibition of 
Na* uptake in the kidneys stimulates excretion of salt and water, reducing blood volume. 


by Kunio Misono and his colleagues, who determined the structure of the ANP 
receptor-ANP complex (Figure 32.18b). Remarkably, there is no significant in- 
tramolecular conformational change in either of the ANP receptor monomers. How- 
ever, upon ANP binding, the two ANP receptor molecules undergo a twist motion in 
the membrane in order to insert the ANP hormone between them (Figure 32.19). 
The hormone lies like a disc between the two receptor subunits. The hormone- 
induced twist of the extracellular domains suggests a mechanism for activation of 
guanylyl cyclase activity across the membrane. Rotation of the transmembrane 
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FIGURE 32.18 Activation of the ANP receptor involves binding of an asymmetric ligand at the interface of two 

identical receptor domains. Comparison of the structures of the receptor domain in the absence (a) and 

presence (b) of ANP reveals no significant intramolecular conformational change in either of the receptor 

monomers. ANP binding induces a twist of the two ANP receptor molecules, allowing the ANP hormone to 

insert between them. (a) PDB file provided by Kunio Misono, University of Nevada, Reno; (b) pdb id = 1734. 


32.3 How Do Signal-Transducing Receptors Respond to the Hormonal Message? 1023 


Complex 


apo 





helices upon hormone binding could reorient the two intracellular domains, giving 
rise to guanylyl cyclase activity. 


Nonreceptor Tyrosine Kinases Are Typified by pp60* 


The first tyrosine kinases to be discovered were associated with viral transforming 
proteins. These proteins, produced by oncogenic viruses, enable the virus to trans- 
form animal cells, that is, to convert them to the cancerous state. A prime example 
is the tyrosine kinase expressed by the src gene of Rous or avian sarcoma virus. The 
protein product of this gene is pp60"*"* (the abbreviation refers to phosphopro- 
tein, 60 kD, viral origin, sarcoma-causing). The v-src gene was derived from the 
avian proto-oncogenic gene c-src during the original formation of the virus. The 
cellular proto-oncogene homolog of pp60**" is referred to as pp60". pp60"** is a 
526-residue peripheral membrane protein. It undergoes two post-translational 
modifications: First, the amino group of the NH>-terminal glycine is modified by 
the covalent attachment of a myristoyl group (this modification is required for 
membrane association of the kinase; see Figure 32.20). Then Ser!” and Tyr*!® are 


(b) 


(a) 





FIGURE 32.19 The rotation mechanism proposed by 
Misono and colleagues for transmembrane signaling by 
the ANP receptor. ANP binding causes a twist of the two 
extracellular domains, leading to rotation of the two 
intracellular domains and activation of guanylyl cyclase 
activity. PKLD is a protein-kinase-like domain and GCD is 
a guanylylcyclase domain. (Adapted from Misono, K., Ogawa, 
H, Qiu, Y., and Ogata, C., 2005. Structural studies of the natriuretic 
peptide receptor: A novel hormone-induced rotation mechanism 
for transmembrane signal transduction. Peptides 26:957—-968.) 





FIGURE 32.20 (a) The soluble tyrosine kinase pp60"*"* 
is anchored to the plasma membrane via an N-terminal 
myristoyl group. (b) The structure of protein tyrosine 
kinase pp60""5, showing AMP-PNP in the active site 
(blue, green, red), Tyr“ (orange), and Tyr?” (yellow). 
Tyr is phosphorylated (purple). 
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A DEEPER LOOK 





Nitric Oxide, Nitroglycerin, and Alfred Nobel 


NO.: is the active agent released by nitroglycerin (see accompany- 

ing figure), a powerful drug that ameliorates the symptoms of CH, — CH— CH, 
heart attacks and angina pectoris (chest pain due to coronary | | | 
artery disease) by causing the dilation of coronary arteries. Nitro- o O O 
glycerin is also the active agent in dynamite. Ironically, Alfred 

Nobel, the inventor of dynamite who also endowed the Nobel NO, Noy Noy 
prizes, himself suffered from angina pectoris. In a letter to a friend 


in 1885, Nobel wrote, “It sounds like the irony of fate that I should 
be ordered by my doctor to take nitroglycerin internally.” 


A The structure of nitroglycerin, a potent vasodilator. 
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phosphorylated. The phosphorylation at Tyr*!®, which increases kinase activity 


twofold to threefold, appears to be an autophosphorylation. On the other hand, 
phosphorylation at Tyr”? is inhibitory and is catalyzed by another kinase known as 
CSK. The significance of nonreceptor tyrosine kinase activity to cell growth and 
transformation is only partially understood, but 1% of all cellular proteins (many 
of which are also kinases) are phosphorylated by these kinases. 


Soluble Guanylyl Cyclases Are Receptors for Nitric Oxide 


Nitric oxide, or NO., a reactive free radical, acts as a neurotransmitter and as a sec- 
ond messenger, activating soluble guanylyl cyclase more than 400-fold. The cGMP 
thus produced also acts as a second messenger, inducing relaxation of vascular 
smooth muscle and mediating penile erection. As a dissolved gas, NO: is capable of 
rapid diffusion across membranes in the absence of any apparent carrier mecha- 
nism. This property makes NO- a particularly attractive second messenger because 
NO: generated in one cell can exert its effects quickly in many neighboring cells. 
NO: has a very short cellular half-life (1 to 5 seconds) and is rapidly degraded by 
nonenzymatic pathways. 


32:4) How Are Receptor Signals Transduced? 


Receptor signals are transduced in one of three ways to initiate actions inside 
the cell: 


1. Exchange of GDP for GTP by GTP-binding proteins (G proteins), which leads to 
generation of second messengers, including cAMP, phospholipid breakdown prod- 
ucts, and Ca?*. 

2. Receptor-mediated activation of phosphorylation cascades that in turn trigger ac- 
tivation of various enzymes. This is the action of the receptor tyrosine kinases 
described in Section 32.3. Protein kinases and protein phosphatases acting as ef- 
fectors will be discussed in Section 32.5. 

3. Conformation changes that open ion channels or recruit proteins into nuclear tran- 
scription complexes. Ion channels are discussed in Section 32.7, and the formation 
of nuclear transcription complexes was described in Chapter 29. 


GPCR Signals Are Transduced by G Proteins 


The signals of G-protein—coupled receptors (GPCRs) are transduced by GTP-binding 
proteins, known more commonly as G proteins. The large G proteins are hetero- 
trimers consisting of a- (45 to 47 kD), B- (35 kD), and y- (7 to 9 kD) subunits. The 
a-subunit binds GDP or GTP and has an intrinsic, slow GTPase activity. The Gagy 
complex (Figure 32.21, and see Figure 15.19) in the unactivated state has GDP at the 
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nucleotide site. Binding of hormone to receptor stimulates a rapid exchange of GTP 
for GDP on G,. The binding of GTP causes G, to dissociate from Gg, and to associate 
with an effector protein such as adenylyl cyclase (Figure 32.22). Binding of Gy(GTP) ac- 
tivates adenylyl cyclase. The adenylyl cyclase actively synthesizes cAMP as long as G, (GTP) 
remains bound to it. However, the intrinsic GTPase activity of G, eventually hydrolyzes 
GTP to GDP, leading to dissociation of G,(GDP) from adenylyl cyclase and reassocia- 
tion with the Gg, dimer, regenerating the inactive heterotrimeric G,g, complex. 

The hormone-activated GPCR is a guanine-nucleotide exchange factor (GEF)— 
promoting the exchange of GDP with GTP on the G protein—in a manner entirely 
similar to the interaction of EF-Ts with EF-Tu(GDP) (pages 969-970). By contrast, 
the Gg, complex, which normally acts to inhibit the spontaneous release of GDP 
from G, (in the inactivated state of the GPCR), is termed a guanine—nucleotide 
dissociation inhibitor (GDI). Other proteins may also behave as GEFs and GDIs; 
their actions are discussed in Section 32.5. 

Two stages of amplification occur in the G-protein—mediated hormone response. 
First, a single hormone-receptor complex can activate many G proteins before the 
hormone dissociates from the receptor. Second, and more obvious, the G,-activated — FIGURE 32.21 A heterotrimeric G protein (a—pink, 
adenylyl cyclase synthesizes many cAMP molecules. Thus, the binding of hormone to 8 yellow, y—blue) docked with a Bxadrenergic re- 
a very small number of membrane receptors stimulates a large increase in concen- CEPON (JEET) (Pabia enna TSON 
tration of cAMP within the cell. The hormone receptor, G protein, and cyclase con- 
stitute a complete hormone signal transduction unit. 

Hormone-receptor—mediated processes regulated by G proteins may be stimula- 
tory or inhibitory. Each hormone receptor interacts specifically with either a stimu- 
latory G protein, denoted G,, or an inhibitory G protein, denoted G; (Figure 32.22). 





Cyclic AMP Is a Second Messenger 


Cyclic AMP (denoted cAMP) was identified in 1956 by Earl Sutherland, who 
termed cAMP a second messenger, because it is the intracellular response pro- 
voked by binding of hormone (the first messenger) to its receptor. Since Suther- 
land’s discovery of cAMP, many other second messengers have been identified 
(Table 32.1). The concentrations of second messengers in cells are carefully reg- 
ulated. Synthesis or release of a second messenger is followed quickly by degra- 
dation or removal from the cytosol. Following its synthesis by adenylyl cyclase, 
cAMP is broken down to 5’-AMP by phosphodiesterase (Figure 32.23). 


B-Effector Qo-Effector 







Stimulatory Adenylyl Inhibitory 
-receptor cyclase Qo-receptor 








G protein exchange 





CENGAGENOW” ACTIVE FIGURE 32.22 Adenylyl cyclase activity is modulated by the interplay of stimulatory 
(G) and inhibitory (G) G proteins. Binding of hormones to B;- and B2-adrenergic receptors activates adenylyl 
cyclase via G, whereas hormone binding to a-adrenergic receptors leads to the inhibition of adenylyl cyclase. 
Inhibition may occur by direct inhibition of cyclase activity by G;,, or by binding of Gig, to Gsa. Test yourself 
on the concepts in this figure at www.cengage.com/login. 
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TABLE 32.1 

Messenger Source Effect 

cAMP Adenylyl cyclase Activates protein kinases 

cGMP Guanylyl cyclase Activates protein kinases, regulates ion channels, regulates 
phosphodiesterases 

Ca?* Ion channels in ER and plasma membrane Activates protein kinases, activates Ca?*-modulated proteins 

IP; PLC action on PI Activates Ca?* channels 

DAG PLC action on PI Activates protein kinase C 

Phosphatidic acid Membrane component and product of PLD Activates Ca?* channels, inhibits adenylyl cyclase 

Ceramide PLC action on sphingomyelin Activates protein kinases 

Nitric oxide (NO) NO synthase Activates guanylyl cyclase, relaxes smooth muscle 

Cyclic ADP-ribose cADP-ribose synthase Activates Ca?* channels 











*IP3 is inositol-1,4,5-trisphosphate; Pl is phosphatidylinositol; DAG is diacylglycerol; PLC is phospholipase C; PLD is phospholipase D (see Figure 32.26). 
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HO OH HO H* 
ATP BP 
H,O Cyclic AMP 
FIGURE 32.23 Cyclic AMP is synthesized by membrane- P rophosphatase 
bound adenylyl cyclase and degraded by soluble È 
phosphodiesterase. 2® 
CENGAGENOW” Go to CengageNOW at Adenylyl cyclase (AC) is an integral membrane enzyme. Its catalytic domain, on 
www.cengage.com/login and click the cytoplasmic face of the plasma membrane, includes two subdomains, denoted 
BiochemistryInteractive to explore the tn ae $ : : 
structure and function of adenylyl cyclase. VC, and IIC». Binding of the a-subunit of G, (denoted Gs«) activates the AC catalytic 
domain. 


Alfred Gilman, Stephen Sprang, and co-workers have determined the structure of 
a complex of G,, (with bound GTP) with the cytoplasmic domains (VC; and IC») of 
adenylyl cyclase (Figure 32.24). The G,, complex binds to a cleft at one corner of the 
Cə domain, and the surface of G,,-GTP that contacts adenylyl cyclase is the same sur- 
face that binds the Gg, dimer. The catalytic site, where ATP is converted to cyclic 
AMP, is far removed from the bound G protein. 


cAMP Activates Protein Kinase A 


All second messengers exert their cellular effects by binding to one or more target 
molecules. cAMP produced by adenylyl cyclase activates a protein kinase, which is thus 
known as cAMP-dependent protein kinase. Protein kinase A, as this enzyme is also known, 
activates many other cellular proteins by phosphorylation. The activation of protein ki- 
nase A by cAMP and regulation of the enzyme by intrasteric control was described in 
detail in Chapter 15. The structure of protein kinase A has served as a paradigm for un- 
derstanding many related protein kinases (see Figure 15.9). 


Ras and Other Small GTP-Binding Proteins Are 
Proto-Oncogene Products 


GTP-binding proteins are implicated in growth control mechanisms in higher or- 
ganisms. Certain tumor virus genomes contain genes encoding 21-kD proteins that 
bind GTP and show regions of homology with other G proteins. The first of these 
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F91 SWI 


CENGAGENOW” ACTIVE FIGURE 32.24 (a) Two views of the complex of G.g with the VG;-lIC, catalytic domain 
of adenylyl cyclase and Gs«. (b) Details of the Gs complex in the same orientation as the structures in (a). SW-I 
and SW-Il are “switch regions,” whose conformations differ greatly depending on whether GTP or GDP is bound. 
(Courtesy of Alfred Gilman, University of Texas Southwestern Medical Center) Test yourself on the concepts in this 
figure at www.cengage.com/login. 


genes to be identified was found in rat sarcoma virus and was dubbed the ras gene. 
Genes implicated in tumor formation are known as oncogenes; they are often mu- 
tated versions of normal, noncancerous genes involved in growth regulation, so- 
called proto-oncogenes. The normal, cellular Ras protein is a GTP-binding protein 
that functions in a manner similar to that of other G proteins described previously, 
activating metabolic processes when GTP is bound and becoming inactive when GTP 
is hydrolyzed to GDP. The GTPase activity of the normal Ras p21 is very low, as is ap- 
propriate for a G protein that regulates long-term effects like growth and differenti- 
ation. A specific GTPase-activating protein (GAP) increases the GTPase activity of 
the Ras protein. Mutant (oncogenic) Ras proteins have severely impaired GTPase ac- 
tivity, which apparently causes serious alterations of cellular growth and metabolism 
in tumor cells. The conformations of Ras proteins (Figure 32.25) in complexes with 
GDP are different from the corresponding complexes with GTP analogs such as 
GMP-PNP (a nonhydrolyzable analog of GTP in which the £-P and y-P are linked by 
N rather than by O). Two regions of the Ras structure change conformation upon 
GTP hydrolysis. These conformation changes mediate the interactions of Ras with 
other proteins, termed effectors. 


G Proteins Are Universal Signal Transducers 


A given G protein can be activated by several different hormone-receptor complexes. 
For example, either glucagon or epinephrine, binding to their distinctive receptor 
proteins, can activate the same species of G protein in liver cells. The effects are 
additive, and combined stimulation by glucagon and epinephrine leads to higher 
cytoplasmic concentrations of cAMP than activation by either hormone alone. 

G proteins are a universal means of signal transduction in higher organisms, ac- 
tivating many hormone-receptor-initiated cellular processes in addition to adenylyl 
cyclase. Such processes include, but are not limited to, activation of phospholipases 








(b) 





FIGURE 32.25 The structure of Ras complexed with 

(a) GDP (pdb id = 1LF5) and (b) GMP-PNP (pdb id = 
1LFO). The Ras p21-GMP-PNP complex is the active 
conformation of this protein. A Mg?* ion (red) is shown 
in both structures. 
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TABLE 32.2 CI f 

G Protein Location Stimulus Effector Effect 

Gs Liver Epinephrine, glucagon Adenylyl cyclase Glycogen breakdown 

G; Adipose tissue Epinephrine, glucagon Adenylyl cyclase Fat breakdown 

G; Kidney Antidiuretic hormone Adenylyl cyclase Conservation of water 

Gi Heart muscle Acetylcholine Potassium channel Decreased heart rate and 

pumping force 
Gi/G, Brain neurons Enkephalins, endorphins, | Adenylyl cyclase, potassium Changes in neuron electrical 
opioids channels, calcium channels activity 

G, Smooth muscle cells in | Angiotensin Phospholipase C Muscle contraction, blood 
blood vessels pressure elevation 

Gorr Neuroepithelial cells in Odorant molecules Adenylyl cyclase Odorant detection 
the nose 

Transducin (G,) Retinal rod and cone Light cGMP phosphodiesterase Light detection (vision) 
cells of the eye 











C and A; and the opening or closing of transmembrane channels for K*, Na*, and 
Ca?* in brain, muscle, heart, and other organs (Table 32.2). G proteins are integral 
components of sensory pathways such as vision and olfaction. More than 100 dif- 
ferent GPCRs and at least 21 distinct G proteins are known. At least a dozen differ- 
ent G-protein effectors have been identified, including a variety of enzymes and ion 
channels. 


Specific Phospholipases Release Second Messengers 


A diverse array of second messengers are generated by breakdown of membrane phospholipids. 
Binding of certain hormones and growth factors to their respective receptors triggers 
a sequence of events that can lead to the activation of specific phospholipases. The 
action of these phospholipases on membrane lipids produces the second messengers 
shown in Figure 32.26. 











(a) o PLC 
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fatty acid (arachidonate) phosphates 


FIGURE 32.26 (a) The general action of phospholipase 





A2 (PLA;), phospholipase C (PLC), and phospholipase D y 
(PLD). (b) The synthesis of second messengers from Eicosanoids 
phospholipids by the action of phospholipases and and 


sphingomyelinase (SMase). ? 
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HUMAN BIOCHEMISTRY 


The disease state known as cancer is the uncontrolled growth and 
proliferation of one or more cell types in the body. Control of cell 
growth and division is an incredibly complex process, involving 
the signal-transducing proteins (and small molecules) described 
in this chapter and many others like them. The genes that give rise 
to these growth-controlling proteins are of two distinct types: 


1. Oncogenes: These genes code for proteins that are capable 
of stimulating cell growth and division. In normal tissues and 
organisms, such growth-stimulating proteins are regulated so 
that growth is appropriately limited. However, mutations in 
these genes may result in loss of growth regulation, leading to 
uncontrolled cell proliferation and tumor development. These 
mutant genes are known as oncogenes because they induce the 
oncogenic state—cancer. The normal versions of these genes 
are termed proto-oncogenes; proto-oncogenes are essential for 
normal cell growth and differentiation. Oncogenes are domi- 
nant, because mutation of only one of the cell’s two copies of 
the gene can lead to tumor formation. Table A lists a few of the 
known oncogenes (more than 60 are now known). 








TABLE A | 

Proto- 

Oncogene Neoplasm(s) 

Abl Chronic myelogenous leukemia 

ErbB-1 Squamous cell carcinoma; astrocytoma 

ErbB-2 (Neu) Adenocarcinoma of breast, ovary, and stomach 

Myc Burkitt’s lymphoma; carcinoma of lung, breast, 
and cervix 

H-Ras Carcinoma of colon, lung, and pancreas; 
melanoma 

N-Ras Carcinoma of genitourinary tract and thyroid; 
melanoma 

Ros Astrocytoma 

Sre Carcinoma of colon 

oi Several 








Adapted from Bishop, J. M., 1991. Molecular themes in oncogenesis. Cell 64:235-248; 
Croce, C. M., 2008. Oncogenes and cancer. New England Journal of Medicine 358:502-511. 





Cancer, Oncogenes, and Tumor Suppressor Genes 


2. Tumor suppressor genes: These genes code for proteins 
whose normal function is to turn off cell growth. A mutation 
in one of these growth-limiting genes may result in a protein 
product that has lost its growth-limiting ability. Since the nor- 
mal products suppress tumor growth, the genes are known as 
tumor suppressor genes. Because both cellular copies of a tumor 
suppressor gene must be mutated to foil its growth-limiting 
action, these genes are recessive in nature. Table B presents 
several recognized tumor suppressor genes. 


Careful molecular analysis of cancerous tissue has shown that 
tumor development may result from mutations in several proto- 
oncogenes or tumor suppressor genes. The implication is that there 
is redundancy in cellular growth regulation. Many (if not all) tumors 
are either the result of interactions of two or more oncogene prod- 
ucts or arise from simultaneous mutations in a proto-oncogene and 
both copies of a tumor suppressor gene. Cells have thus evolved 
with overlapping growth-control mechanisms. When one is com- 
promised by mutation, others take over. 








TABLE B | 

Tumor 

Suppressor 

Gene Neoplasm(s) 

RBI Retinoblastoma; osteosarcoma; carcinoma of 
breast, bladder, and lung 

p53 Astrocytoma; carcinoma of breast, colon, and 
lung; osteosarcoma 

WTI Wilms’ tumor 

DCC Carcinoma of colon 

NFI Neurofibromatosis type 1 

FAP Carcinoma of colon 

MEN-1 Tumors of parathyroid, pancreas, pituitary, and 
adrenal cortex 











Adapted from Bishop, J. M., 1991. Molecular themes in oncogenesis. Cell 64:235-248, and 
Sherr, C. J., 2004. Principles of tumor suppression. Cell 116:235-246. 





Inositol Phospholipid Breakdown Yields Inositol-1,4,5-Trisphosphate 


and Diacylglycerol 


Breakdown of phosphatidylinositol (PI) and its derivatives by phospholipase C 
produces a family of second messengers. In the best-understood pathway, 
successive phosphorylations of PI produce phosphatidylinositol-4-P (PIP) and 
phosphatidylinositol-4,5-bisphosphate (PIP). Four isozymes of phospholipase C (de- 
noted a, 8, y, and 6) hydrolyze PI, PIP, and PIP». Hydrolysis of PIP by phospholipase 
C yields the second messenger inositol-1,4,5-trisphosphate (IP), as well as another 
second messenger, diacylglycerol (DAG) (Figure 32.27). IPs is water soluble and dif- 
fuses to intracellular organelles where release of Ca?* is activated. DAG, on the other 
hand, is lipophilic and remains in the plasma membrane, where it activates a Ca**- 
dependent protein kinase known as protein kinase C (see following discussion). 
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FIGURE 32.27 The family of second messengers pro- 
duced by phosphorylation and breakdown of phos- 
phatidylinositol. PLC action instigates a bifurcating path- 
way culminating in two distinct and independent 
second messengers: DAG and IP3. 


FIGURE 32.28 Phospholipase C-G is activated specifically 
by Gg, a GTP-binding protein, and also by Ca**. 


FIGURE 32.29 Phospholipase C-y is activated upon phos- 
phorylation by receptor tyrosine kinases and by Ca?*. 


PI —_——_—_——+> PI4-P ———_——_> PH4,5-P, 





Activation of Phospholipase C Is Mediated by G Proteins 

or by Tyrosine Kinases 

Phospholipase C-8, C-y, and C-ô are all Ca?*-dependent, but the different phos- 
pholipase C isozymes are activated by different intracellular events. Phospholipase 


C- is stimulated by G proteins (Figure 32.28). On the other hand, phospholipase 
C-y is activated by receptor tyrosine kinases (Figure 32.29). The domain organiza- 
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tion of phospholipase C-6 and C-y is shown in Figure 32.30. The X and Y domains 
of phospholipase C-6 and C-y are highly homologous, and both of these domains 
are required for phospholipase C activation. The other domains of these isozymes 
confer specificity for G-protein activation or tyrosine kinase activation. 


Phosphatidylcholine, Sphingomyelin, and Glycosphingolipids 
Also Generate Second Messengers 


In addition to PI, other phospholipids serve as sources of second messengers. 
Breakdown of phosphatidylcholine by phospholipases yields a variety of second 
messengers, including DAG, phosphatidic acid, and prostaglandins. The action 
of sphingomyelinase on sphingomyelin produces ceramide, which stimulates 
ceramide-activated protein kinase. Similarly, gangliosides (such as ganglioside 
Gms; see Chapter 8) and their breakdown products modulate the activity of pro- 
tein kinases and GPCRs. 


Calcium Is a Second Messenger 


Calcium ion is an important intracellular signal. Binding of certain hormones and 
signal molecules to plasma membrane receptors can cause transient increases in 
cytoplasmic Ca?* levels, which in turn can activate a wide variety of enzymatic 
processes, including smooth muscle contraction, exocytosis, and glycogen metabo- 
lism. (Most of these activation processes depend on special Ca?*-binding proteins 
discussed in the following section.) Cytoplasmic [Ca?*] can be increased in two ways 
(Figure 32.31). As mentioned briefly earlier, cAMP can activate the opening of 
plasma membrane Ca?* channels, allowing extracellular Ca?* to stream in. On the 
other hand, cells also contain intracellular reservoirs of Ca2*, within the endoplas- 
mic reticulum and calciosomes, small membrane vesicles that are similar in some 
ways to muscle sarcoplasmic reticulum. These special intracellular Ca? stores are 
not released by cAMP. They respond to IPs, a second messenger derived from PI. 


Intracellular Calcium-Binding Proteins Mediate the Calcium Signal 


Given the central importance of Ca?* as an intracellular messenger, it should not be 
surprising that complex mechanisms exist in cells to manage and control Ca**. When 
Ca?* signals are generated by cAMP, IPs, and other agents, these signals are translated 
into the desired intracellular responses by calcium-binding proteins, which in turn 
regulate many cellular processes (Figure 32.32). One of these, protein kinase C, is de- 
scribed in Section 32.5. The other important Ca?*-binding proteins can, for the most 
part, be divided into two groups on the basis of structure and function: (1) the 
calcium-modulated proteins, including calmodulin, parvalbumin, troponin C, and 
many others, all of which have in common a structural feature called the EF hand 
(Figure 32.33), and (2) the annexin proteins, a family of homologous proteins that 
interact with membranes and phospholipids in a Ca**-dependent manner. 
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FIGURE 32.30 The amino acid sequences of phospho- 
lipase C isozymes B, y, and 6 share two homologous 
domains, denoted X and Y. The sequence of y-isozyme 
contains src homology domains, denoted SH2 and SH3. 
SH2 domains (approximately 100 residues in length) in- 
teract with phosphotyrosine-containing proteins (such 
as RTKs), whereas SH3 domains mediate interactions 
with Pro-rich sequences. (Adapted from Dennis, E., Rhee, S, 
Gillah, M., and Hannun, E., 1991. Role of phospholipases in gener- 
ating lipid second messengers in signal transduction. The FASEB 
Journal 5:2068-2077.) 





Calciosome 


Endoplasmic 
reticulum 


CENGAGENOW” ANIMATED FIGURE 32.31 Cytosolic 
[Ca?*] increases occur via the opening of Ca?* channels 
in the membranes of calciosomes, the endoplasmic 
reticulum, and the plasma membrane. See this figure 
animated at www.cengage.com/login. 





PI Metabolism and the Pharmacology of Li+ 





An intriguing aspect of the phosphoinositide story is the specific ac- 
tion of lithium ion, Li*, on several steps of PI metabolism. Lithium 
salts have been used in the treatment of manic-depressive illnesses for 
more than 30 years, but the mechanism of lithium’s therapeutic ef- 
fects had been unclear. Recently, however, several reactions in the 
phosphatidylinositol degradation pathway have been shown to be 


sensitive to Lit ion. For example, Li* is an uncompetitive inhibitor 
of myo-inositol monophosphatase (see Chapter 13). Lit levels simi- 
lar to those used in treatment of manic illness thus lead to the ac- 
cumulation of several key intermediates. This story is far from com- 
plete, and many new insights into phosphoinositide metabolism 
and the effects of Li* can be anticipated. 
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CENGAGENOW” ACTIVE FIGURE 32.32 |P3-mediated signal transduction pathways. Increased [Ca*] activates 
protein kinases, which phosphorylate target proteins. Ca2*/CaM represents calci-calmodulin (Ca?* complexed 
with the regulatory protein calmodulin). Test yourself on the concepts in this figure at www.cengage.com/ 
login. 


(a) (b) 





FIGURE 32.33 (a) Structure of uncomplexed calmodulin (pdb id = 1LKJ). Calmodulin, with four Ca?*-binding 
domains, forms a dumbbell-shaped structure with two globular domains joined by an extended, central helix. 
Each globular domain juxtaposes two Ca?+-binding EF-hand domains. An intriguing feature of these EF-hand 
domains is their nearly identical three-dimensional structure despite a relatively low degree of sequence homol- 
ogy (only 25% in some cases). (b, c) Complex of calmodulin (red) with a peptide from myosin light chain kinase 
(blue); (b) side view; (c) top view (pdb id = 1QTX). 
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More than 170 calcium-modulated proteins are known. All possess a characteris- 
tic peptide domain consisting of a short a-helix, a loop of 12 amino acids, and a sec- 
ond a-helix (Figure 32.33). Robert Kretsinger at the University of Virginia initially 
discovered this pattern in parvalbumin, a protein first identified in the carp fish and 
later in neurons possessing a high firing rate and a high oxidative metabolism. 
Kretsinger lettered the six helices of parvalbumin A through F. He noticed that the 
E and F helices, joined by a loop, resembled the thumb and forefinger of a right 
hand and named this structure the EF hand, a name in common use today to iden- 
tify the helix-loop-helix motif in calcium-binding proteins. In the EF hand, Ca?" is 
coordinated by six carboxyl oxygens contributed by a glutamate and three aspar- 
tates, by a carbonyl oxygen from a peptide bond, and by the oxygen of a coordi- 
nated water molecule. The EF hand was subsequently identified in calmodulin, tro- 
ponin C, and calbindin-9K. Most of the known EF-hand proteins possess two or 
more (as many as eight) EF-hand domains, usually arranged so that two EF-hand do- 
mains may directly contact each other. 


Calmodulin Target Proteins Possess a Basic Amphiphilic Helix 


The conformations of EF-hand proteins change dramatically upon binding of Ca?* 
ions. This change promotes binding of the EF-hand protein with its target pro- 
tein(s). For example, calmodulin (CaM), a 148-residue protein found in many 
cell types, modulates the activities of a large number of target proteins, including 
Ca**-ATPases, protein kinases, phosphodiesterases, and NAD* kinase. CaM binds to 
these and to many other proteins with extremely high affinities (Kp values typically 
in the high picomolar to low nanomolar range). All CaM target proteins possess a 
basic amphiphilic alpha helix (a Baa helix), to which CaM binds specifically and 
with high affinity. Viewed end-on, in the so-called helical wheel representation (Fig- 
ure 32.34), a Baa helix has mostly hydrophobic residues on one face; basic residues 
are collected on the opposite face. However, the Baa helices of CaM target proteins, 
although conforming to the model, show extreme variability in sequence. How does 
CaM, itself a highly conserved protein, accommodate such variety of sequence and 
structure? Each globular domain consists of a large hydrophobic surface flanked by 
regions of highly negative electrostatic potential—a surface suitable for interacting 
with a Baa helix. The long central helix joining the two globular regions behaves as 


(a) (b) 
Leu Lys 





Leu 


FIGURE 32.34 Helical wheel representations of (a) a model calmodulin-binding peptide, Ac-WKKLLKLLKKLLKL- 
CONH,;, and (b) the calmodulin-binding domain of spectrin. Positively charged and polar residues are indi- 
cated in green, and hydrophobic residues are orange. (Adapted from O'Neil, K, and DeGrado, W, 1990. How calmodulin 
binds its targets: Sequence independent recognition of amphiphilic a-helices. Trends in Biochemical Sciences 15:59-64.) 
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a long, flexible tether. When the target protein is bound, the two globular domains 
fold together (Figure 32.33b). The flexible nature of the tethering helix allows the 
two globular domains to adjust their orientation synergistically for maximal binding 
of the target protein or peptide. 


3225» How Do Effectors Convert the Signals 
to Actions in the Cell? 


Transduction of the hormonal signal leads to activation of effectors—usually pro- 
tein kinases and protein phosphatases—that elicit a variety of actions that regulate 
discrete cellular functions. Of the thousands of mammalian kinases and phos- 


phatases, the structures and functions of a few are representative. 


A DEEPER LOOK 





Mitogen-Activated Protein Kinases and Phosphorelay Systems 


In multicellular organisms, many physiological processes, includ- 
ing mitosis, gene expression, metabolism, and programmed death 
of cells, are regulated by a family of mitogen-activated protein 
kinases (MAPKs). (A mitogen is any agent that induces cell divi- 
sion, that is, mitosis.) MAPKs phosphorylate specific serines and 
threonines of target protein substrates, and these phosphorylation 
events function as switches to turn on or off the activity of the sub- 
strate proteins. These “substrates” may be other protein kinases, 
phospholipases, transcription factors, and cytoskeletal proteins. 
Protein phosphatases reverse the process, removing the phos- 
phates that were added by MAPKs. 

MAPKs are part of a phosphorelay system composed of three 
kinases that are activated in sequence (see accompanying figure). 
In such systems, the MAPK itself is phosphorylated by a MAPK 


Stimulus 
Activator 





> MAPK phosphorelay systems. The left column is a 
general model. The four columns to the right show the 
four known MAPK phosphorelay families. 


MKK 


kinase (denoted MKK), which is itself phosphorylated by a MAPK 
kinase kinase (denoted MKKK). MKKKs have distinct domains 
and motifs that respond to different cellular stimuli, and they have 
other domains that recognize specific MKKs. The same kinds of 
specificities regulate the action of MKKs. These specificities are 
accounted for in the classification of four subfamilies of MAPKs: 
One group is that of the extracellular signal-regulated kinases, 
notably ERK] and ERK2; another is the c-Jun-amino-terminal 
kinases, including JNK, JNK1, and JNK2; a third group depends 
on the ERK5 kinase; and the fourth group involves the p38 ki- 
nases, including p38a, p386, p38y, and p386. There are un- 
doubtedly other MAPK families yet to be discovered. As shown in 
the figure, these phosphorelay systems link a variety of stimuli to 
substrates that affect many cellular functions. 
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From Johnson, G. L., and Lapadat, R., 2002. Mitogen-activated protein ki- 
nase pathways mediated by ERK, JNK, and p38 protein kinases. Science 298: 
1911-1912. 











32.5 How Do Effectors Convert the Signals to Actions in the Cell? 1035 





Protein Kinase A Is a Paradigm of Kinases 


Most protein kinases share a common catalytic core first characterized in protein ki- 
nase A (PKA), the enzyme that phosphorylates phosphorylase kinase (see Figure 
15.9 and Figure 15.17). The active site of the catalytic subunit of PKA in a ternary 
complex with MnAMP-PNP and a pseudosubstrate inhibitor peptide, as shown in 
Figure 32.35, includes a glycine-rich £-strand that acts as a flap over the triphosphate 
moiety of the bound nucleotide. A conserved residue, Asp, is the catalytic base that 
deprotonates the Ser/Thr-OH during phosphorylation, and Lys!® stabilizes the tran- 
sition state of the reaction. Three Glu residues on the enzyme are involved in recog- 
nition of the pseudosubstrate inhibitor peptide. 


Protein Kinase C Is a Family of lsozymes 


The enzymes called protein kinase C are actually a family of similar enzymes— 
isozymes—that encompass three subclasses. The “conventional PKCs,” a, BI, BII, 
and y, are regulated by Ca?*, diacylglycerol (DAG), and phosphatidylserine (PS). 
Because Ca?" levels increase in the cell in response to IPs, the activation of conven- 
tional PKCs depends on both of the second messengers released by the hydrolysis 
of PIP». The “novel PKCs,” ô, €, 0, and n, are Ca**-independent but are regulated by 
DAG and PS. PKCs ¢@, v, and A are termed “atypical” and are activated by PS alone. 
These various cofactor requirements are imparted by subdomains represented in 
the conventional PKC polypeptide sequence. Conventional PKCs are comprised 
(Figure 32.36) of four conserved domains (C1—C4) and five variable regions 
(V1-V5). Domain Cl is a pseudosubstrate sequence that regulates the kinase by intra- 
steric control (see page 461), C2 is a Ca?*-binding domain, C3 is the ATP-binding do- 
main, and C4 binds peptide substrates. 


Conventional a BI BIL y C1lA C1B 12 c3 C4 





pseudosubstrate hinge 





Apia gaa m 


FIGURE 32.35 The structure of the catalytic subunit of 
PKA in a ternary complex with MnAMP-PNP and a 
pseudosubstrate inhibitor peptide (pink). A glycine-rich 
B-strand acts as a flap over the triphosphate moiety of 
the bound nucleotide. The glycine-rich flap that covers 
the ATP-binding site is shown in white, AMP-PNP is 
bound in the ATP site, Asp'® is shown in beige, and 
Lys!® is shown below in white (pdb id = 1ATP). 





CENGAGENOW” Go to CengageNOW at 
www.cengage.com/login and click 
Biochemistrylnteractive to explore the 
structure and function of protein kinase C. 


FIGURE 32.36 The primary structures of the PKC 
isozymes. Conserved domains C1-C4 are indicated. 
Variable regions are shown as simple lines. 
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12-O-Tetradecanoylphorbol-13-acetate 


FIGURE 32.37 The structure of a phorbol ester. Long- 
chain fatty acids predominate at the 12-position, whereas 
acetate is usually found at the 13-position. 


FIGURE 32.38 A ribbon diagram showing the struc- 
ture of protein tyrosine kinase pp60°** with bound 
AMP-PNP. (Image kindly provided by Stephen C. Harrison.) 


SH3 


PKC phosphorylates serine and threonine residues on a wide range of protein 
substrates. A role for protein kinase C in cellular growth and division is demon- 
strated through its strong activation by phorbol esters (Figure 32.37). These com- 
pounds, from the seeds of Croton tighum, are tumor promoters—agents that do not 
themselves cause tumorigenesis but that potentiate the effects of carcinogens. The 
phorbol esters mimic DAG, bind to the regulatory pseudosubstrate domain of the 
enzyme, and activate protein kinase C. 


Protein Tyrosine Kinase pp60°* Is Regulated 
by Phosphorylation/Dephosphorylation 


The structure of protein tyrosine kinase pp60" (see Figure 32.20) consists of an 
N-terminal “unique domain,” an SH2 domain, an SH3 domain, and a kinase do- 
main that includes a small lobe comprised mainly of a twisted 6-sheet and a large 
lobe that is predominantly a-helical (Figure 32.38; see also Section 32.2). Phospho- 
rylation of Tyr”? in the SH2 domain inhibits tyrosine kinase activity by drawing an 
“activation loop” into the active site, blocking ATP and/or substrate binding. De- 
phosphorylation of Tyr°?’ induces a conformation change that removes the activa- 
tion loop from the active site, permitting autophosphorylation of Tyr#!®, which stim- 
ulates tyrosine kinase activity. 


Protein Tyrosine Phosphatase SHP-2 Is a Nonreceptor 
Tyrosine Phosphatase 


The human phosphatase SHP-2 is a cytosolic nonreceptor tyrosine phosphatase. It 
comprises two SH2 domains, a catalytic phosphatase domain, and a C-terminal do- 
main. The SH2 domains enable the enzyme to bind to its target substrates, and they 
also regulate the phosphatase activity. The catalytic domain of SHP-2 consists of nine 
a-helices and a ten-stranded mixed -sheet that wraps around one of the helices (Fig- 
ure 32.39). The other eight helices pack together on the opposite side of the B-sheet. 

The N-terminal SH2 domain regulates phosphatase activity by binding to the 
phosphatase domain and directly blocking the active site. When a target peptide 
containing a phosphotyrosine group binds to the SH2 domain, a conformation 
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change causes this domain to dissociate from the catalytic domain, exposing the 
active site and allowing peptide substrate to bind. Binding of phosphotyrosine- 
containing peptide to the second SH2 domain provides additional activation. 
Target peptides with two phosphotyrosines (one to bind to each SH2 domain) 
provide maximal activation of the phosphatase activity. 


32:6) How Are Signaling Pathways Organized 
and Integrated? 


All signaling pathways are organized in time and space in the cell, they are care- 
fully regulated, and they are integrated with one another. Remarkably, these com- 
plex features of signaling depend on the simple concepts already covered in this 
chapter: PIDs (see Figures 32.5 and 32.6) modulate and control the association of 
signaling molecules with one another, often in large signalsomes; signaling mole- 
cules are switched on and off by covalent modifications such as phosphorylations; 
and signaling often involves amplification and cooperative effects. The multifac- 
eted behavior of GPCRs serves as a paradigm for the organization and integration 
of signaling pathways and is the focus of this section. 


GPCRs Can Signal Through G-Protein—Independent Pathways 


The classic GPCR signaling pathway involves coupling to heterotrimeric G proteins, 
but GPCRs can interact with other effector molecules as well. The cellular src kinase 
(page 1023) can be activated directly by the o-adrenergic receptor (Figure 32.40), 
leading to activation of a MAPK pathway (see A Deeper Look box, page 1034). Ac- 
tivated src phosphorylates Tyr®!” on the Shc adaptor protein, promoting binding by 
an SH2 domain of Grb2. Grb2 in turn binds to and activates Sosl, which activates 
Ras. Ras then activates a kinase cascade (Figure 32.40). 

The Janus protein kinase (JAK) and its associated transducers and activators of 
transcription (STAT) constitute the JAK/STAT signaling pathway (Figure 32.41). 
For example, Jana Stankova and her colleagues have shown that binding of the 
platelet-activating factor (see Figure 8.8b) to the platelet-activating factor receptor 
(a GPCR) directly stimulates phosphorylation of tyk2, a Janus kinase. The activa- 
tions of src (described earlier) and tyk2 raise the intriguing possibility that GPCR 
action may represent an alternate means of activating these signaling pathways (Fig- 
ures 32.40 and 32.41). 








(MAPKK) MEK1 ~<— Raf I (MAPKKK) 


| 


ERK1 (MAPK) 


AINN 


1037 





FIGURE 32.39 A ribbon diagram showing the structure 
of protein tyrosine phosphatase SHP-2 in its auto- 
inhibited, closed conformation. The N- and C-terminal 
SH2 domains are orange and purple, respectively. The 
catalytic domain is blue (pdb id = 2SHP). 


FIGURE 32.40 GPCRs can initiate cellular signaling path- 
ways without involvement of G proteins. Under certain 
conditions, binding to GPCRs can induce autophospho- 
rylation of the src kinase. Activated src can then bind Shc 
and phosphorylate Tyr?!”, which promotes binding of 
Grb2 and initiation of a Ras-dependent kinase cascade. 
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FIGURE 32.41 Platelet-activating factor receptor (a GPCR) 
can trigger autophosphorylation of the tyk2 kinase (a 
Janus protein kinase). Tyk2-induced phosphorylation of 
STAT essentially mimics the activation of the JAK/STAT 
signaling pathway by hormones such as erythropoietin 
(EPO). 
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G-Protein Signaling Is Modulated by RGS/GAPs 


The signal-transducing effects of G proteins persist as long as the bound GTP is 
not hydrolyzed. However, G proteins, such as Ras p21 and Gsa, are themselves weak 
GTPases. For example, Ras p21 hydrolyzes GTP with a rate constant of only 
0.02 min™!. If Ras p21 and G,, were efficient enzymes, the GTP-bound state would 
be short-lived and G-protein—mediated signaling would be ineffective. 

How can G proteins be switched off if they are inherently poor GTPases? The an- 
swer is provided by regulators of G-protein signaling (RGS), which act as GTPase- 
activating proteins (GAPs). RGS/GAPs elicit dramatic increases in GTPase activity 
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(b) 





when bound to G proteins. For example, RGS/GAPs increase the GTPase activity 
of Ras p21 by a factor of 100,000 and accelerate G,,-catalyzed GTP hydrolysis 
nearly 100-fold. RGS/GAPs induce conformation changes in the switch domains 
of their G protein targets and stabilize the transition state of the GTPase reaction 
(Figure 32.42). 

Interestingly, however, RGS proteins are more than GAPs. All RGS proteins con- 
tain several other signaling modules in addition to the 120-residue RGS module 
(Figure 32.43). These additional modules enable RGS proteins to bind to a variety 
of signaling proteins, to behave as effector molecules themselves, and to act as scaf- 
folding proteins in the formation of signalsome complexes. RGS proteins interact 
directly, for example, with adenylyl cyclase, phospholipase C-6, cGMP phospho- 
diesterase, guanylyl cyclase, Ca** channels, and potassium channels (Figure 32.44). 
These RGS-mediated interactions enable “cross-talk” between many signaling 
pathways. 


GPCR Desensitization Leads to New Signaling Pathways 


Activation of GPCRs by hormones and other extracellular signals leads to hetero- 
trimeric G protein binding, which triggers a variety of intracellular signals as shown. 
Importantly, the activated GPCR also binds two other classes of molecules: a family 
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CENGAGENOW” ANIMATED FIGURE 32.42 (a) A 
fragment of an RGS/GAP (brown) bound to Ras p21 
(blue). GAPs increase the GTPase activity of Ras p21 
by a factor of 100,000 (pdb id = 1WQ1). (b) An 
RGS/GAP (pink) bound to Gig (blue). GDP is shown 
in yellow (pdb id = 2IK8). See this figure ani- 
mated at www.cengage.com/login. 


FIGURE 32.43 The RGS proteins (mammalian cells 
contain more than 30) are classified in subfamilies. 
Representative members of six of those subfamilies 
are shown. In addition to the RGS module of 121 
residues, each RGS protein contains other motifs and 
modules that define its functionality. Many of those 
shown are discussed elsewhere in Chapter 32. 
(Adapted from Bansal, G., Druey, K. M., and Xie, Z., 2007. R4 RGS 
proteins: Regulation of G-protein signaling and beyond. 
Pharmacology and Therapeutics 116:473-495.) 
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FIGURE 32.44 Several interactions between RGS proteins and G-protein effectors. RGS proteins interact directly 
with adenylyl cyclase, phospholipase C-B, CGMP phosphodiesterase, and retinyl guanylyl cyclase, as well as 
with the potassium channel known as GIRK and certain calcium channels. (Adapted from Abramow-Newelly, M., 
Roy, A.A., Nunn, C., and Chidiac, P, 2006. RGS proteins have a signaling complex: Interactions between RGS proteins and GPCRs, 
effectors, and auxiliary proteins. Cellular Signaling 18:579-591.) 
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Whimsical Names for Proteins and Genes 


The study of cell signaling and the identification of hundreds of new 
signaling proteins provided an unprecedented creative opportunity 
for cell biologists and geneticists in the naming of these proteins. In 
the early days of molecular biology, such names were typically arcane 
abbreviations and acronyms. One such case is the family of 14-3-3 
proteins, named for the migration patterns of these proteins on 
DEAE-cellulose chromatography and starch-gel electrophoresis. In 
the 1970s, a few creative scientists suggested whimsical names for 
newly discovered genes, such as sevenless, named in reference to R7, 
one of the eight photoreceptor cells in the compound eye of 
Drosophila, the common fruit fly. What began as a trickle became a 
torrent of whimsical names for proteins and genes. Sevenless was fol- 
lowed by bride of sevenless (boss, a ligand of sevenless), and son of 
sevenless (sos, first isolated in a genetic screen of the sevenless 
receptor tyrosine kinase pathway in Drosophila). The hedgehog (hh) 
genes, including sonic hedgehog (Shh), play critical roles in the devel- 
opment and patterning of vertebrate embryonic tissues but were 
named for a popular video game. 

The accompanying table lists a few notable examples of whim- 
sically named genes and gene products, many of which were first 
identified in Drosophila. 











Name Role or Function 

Armadillo Plakoglobin = -catenin 

Bag of marbles Novel protein involved in oogenesis and 
spermatogenesis 

Bullwinkle Oocyte protein 

Cactus Signaling protein—IkB homolog 

Cheap date Alcohol sensitivity 

Chickadee Profilin homolog—regulation of actin 
cytoskeleton 

Corkscrew A protein tyrosine phosphatase 

Dachshund Novel nuclear protein of unknown function 

Dishevelled Novel cytoplasmic protein in the wingless 
pathway 

Dunce A cAMP phosphodiesterase 

Hopscotch A Janus family tyrosine kinase 

Naked A segment polarity gene 

Reaper A death-domain protein functioning in 
apoptosis 

Rutabaga A Ca?*/calmodulin-dependent protein 
kinase 

Shark A tyrosine kinase SH2-nonreceptor 

Yak Literally “yet another kinase” 
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of protein kinases known as known as GPCR kinases (GRKs) and a family of adap- 
tor and scaffolding proteins known as f-arrestins. On one level, these two protein 
families work together to desensitize the GPCRs, “arresting” the G-protein activation 
by GPCRs. On another level, the GRKs and f-arrestins act together as a molecular 
switch, directing GPCRs to a distinctly different role in cell signaling. GRK phos- 
phorylation of several sites on the C-terminal sequence of the GPCR promotes bind- 
ing of -arrestin (Figure 32.45) to form a signalsome assembly. Binding of -arrestin 
has two effects: The GPCR is no longer able to interact with and activate G pro- 
teins, and the GPCR is targeted for clathrin-mediated endocytosis (Figure 32.45). 
Following formation of the endosome, the GPCR-(-arrestin signalsome enters an 
arrestin signaling mode, recruiting a variety of catalytically active proteins, such as 
components of the Raf-MEK-ERK signaling cascade. Arrestin-mediated signaling 
continues until the GPCR returns to the plasma membrane, where it waits for a 
new extracellular signal. It is important to appreciate that G-protein signaling oc- 
curs within seconds of extracellular GPCR activation and typically lasts for 10 min- 
utes or less, whereas signaling by the GPCR-f-arrestin complex becomes maximal 
around 10 minutes and typically persists for 30 minutes. 


Receptor Responses Can Be Coordinated by Transactivation 


Hormone and signaling receptors and their pathways do not operate in isolation. 
Rather, one or more of the second messengers and effectors of a given pathway can 
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FIGURE 32.45 GPCRs have two signaling modes. Signal initiation by ligand binding first activates the G-protein- 
mediated signaling pathways at the plasma membrane. At the same time, G-protein receptor kinases (GRKs) 
begin to phosphorylate the receptor, creating high-affinity binding sites for arrestin. Arrestin binding uncouples 
the receptor from G proteins and targets the receptor for endocytosis. The GPCR-arrestin complex serves as a sig- 
nalsome scaffold, recruiting catalytically active proteins such as components of the Raf-MEK-ERK cascade. This ini- 
tiates a distinct set of signals from the endosome-bound signalsome. Endosomes are eventually processed and 
degraded, returning the GPCRs to the plasma membrane. (Adapted from Figure 1 of Gesty-Palmer, D, and Luttrell, L.M., 
2008. Heptahelical terpsichory.Who calls the tune? Journal of Receptors and Signal Transduction 28:39-58.) 
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(a) Transcriptional regulation of GPCR ligand synthesis 
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FIGURE 32.46 (a) An example of transactivation of cell signaling pathways. Binding of IGF-1 to RTK (in human 
breast cancer cells) initiates a signaling pathway that leads to an enhanced synthesis of RANTES (a peptide). 
RANTES is secreted into the extracellular medium, where it can bind a GPCR known as CCR5. CCR5 activation 
triggers G-protein-mediated signaling pathways. (b) Transactivation that is the reverse of that in panel a. 
Activation of the a@;-adrenergic receptor activates phospholipase C-B, which triggers metalloproteinase (MMP) 
activation. MPP cleaves a precursor of heparin-binding EGF-like growth factor (HB-EGF). Binding of HB-EGF to 
the EGF receptor initiates an RTK signaling pathway. 


often activate (or inhibit) another pathway. For example, binding of insulin-like 
growth factor 1 (IGF-1) to its own RTK signals enhanced synthesis of a peptide 
known as RANTES. RANTES is secreted into the extracellular medium, where it 
binds to a GPCR known as the CCR5 receptor (Figure 32.46a). As an extracellular 
signal, RANTES initiates a signaling pathway that induces cell migration, a part of 
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the cell’s response to inflammation caused by infection. This is an example of trans- 
activation of a GPCR by a RTK. 

The reverse can happen as well. GPCRs can transactivate RTKs by a variety of 
mechanisms. For example, in certain neurons in the hypothalamus, stimulation of 
q@,-adrenergic receptors triggers a G-protein—mediated pathway that activates a ma- 
trix metalloproteinase. Metalloproteinase action releases heparin-binding EGF-like 
growth factor (HB-EGF) in the extracellular matrix. Binding of HB-EGF to the EGF 
receptor initiates a classic RTK-activated signaling pathway (Figure 32.46b). 


Signals from Multiple Pathways Can Be Integrated 


A cell can be exposed simultaneously to multiple, potentially contradictory signals in 
the form of soluble hormones and ligands anchored to adjacent cells or the extracel- 
lular matrix. Cells must have mechanisms for sorting these various signals into a de- 
fined response. The Rsk] protein serine/threonine kinases exhibit such behavior, in- 
tegrating several signals to achieve full activation. Rskl has two protein kinase 
domains (Figure 32.47), of which the N-terminal domain phosphorylates downstream 
targets. This N-terminal kinase domain is controlled by multiple inputs, including 
from the C-terminal domain. The Erk MAPK binds to a docking site at the C-termi- 
nus of Rskl, phosphorylating sites in the linker region between the two kinase do- 
mains and in the C-terminal domain, all of which are essential for activation. Full 
activation, however, also requires phosphorylation of the N-terminal kinase domain 
by the PIP;-stimulated PDK1 protein kinase. Rsk] activation thus requires inputs from 
both the Erk MAPK pathway and the PIP; pathway (Figure 32.47). 


32/8 How Do Neurotransmission Pathways Control the 
Function of Sensory Systems? 


The survival of higher organisms is predicated on the ability to respond rapidly to sen- 
sory input from physical signals (sights, sounds) and chemical cues (smells). The re- 
sponses to such stimuli may include muscle movements and many forms of intercel- 
lular communication. Hormones (as described earlier in this chapter) can move 
through an organism only at speeds determined by the circulatory system. In most 
higher organisms, a faster means of communication is crucial. Nerve impulses, which 
can be propagated at speeds up to 100 m/sec, provide a means of intercellular signal- 
ing that is fast enough to encompass sensory recognition, movement, and other phys- 
iological functions and behaviors in higher animals. The generation and transmission 
of nerve impulses in vertebrates is mediated by an extremely complicated neural net- 
work that connects every part of the organism with the brain—itself an interconnected 
array of as many as 10! cells. 

Despite their complexity and diversity, the nervous systems of animals all possess 
common features and common mechanisms. Physical or chemical stimuli are rec- 
ognized by specialized receptor proteins in the membranes of excitable cells. Con- 
formational changes in the receptor protein result in a change in enzyme activity or 
a change in the permeability of the membrane. These changes are then propagated 
throughout the cell or from cell to cell in specific and reversible ways to carry infor- 
mation through the organism. This section describes the characteristics of excitable 
cells and the mechanisms by which these cells carry information at high speeds 
through an organism. 


Nerve Impulses Are Carried by Neurons 


Neurons and neuroglia (or glial cells) are cell types unique to nervous systems. The 
reception and transmission of nerve impulses are carried out by neurons (Figure 
32.48), whereas glial cells serve protective and supportive functions. (Neuroglia could 
be translated as “nerve glue.”) Glial cells differ from neurons in several ways. Glial 
cells do not possess axons or synapses, and they retain the ability to divide through- 
out their life spans. Glial cells outnumber neurons by at least 10 to 1 in most animals. 
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< FIGURE 32.48 The structure of a mammalian motor neuron.The nucleus and most other organelles are 
contained in the cell body. One long axon and many shorter dendrites project from the body. The dendrites 
receive signals from other neurons and conduct them to the cell body. The axon transmits signals from this 
cell to other cells via the synaptic knobs. Glial cells called Schwann cells envelop the axon in layers of an in- 
sulating myelin membrane. Although glial cells lie in proximity to neurons in most cases, no specific con- 
nections (such as gap junctions, for example) connect glial cells and neurons. However, gap junctions can 
exist between adjacent glial cells. 


Neurons are distinguished from other cell types by their long cytoplasmic exten- 
sions or projections, called processes. Most neurons consist of three distinct regions 
(see Figure 32.48): the cell body (called the soma), the axon, and the dendrites. 
The axon ends in small structures called synaptic terminals, synaptic knobs, or 
synaptic bulbs. Dendrites are short, highly branched structures emanating from the 
cell body that receive neural impulses and transmit them to the cell body. The space 
between a synaptic knob on one neuron and a dendrite ending of an adjacent neu- 
ron is the synapse or synaptic cleft. 

Three kinds of neurons are found in higher organisms: sensory neurons, inter- 
neurons, and motor neurons. Sensory neurons acquire sensory signals, either 
directly or from specific receptor cells, and pass this information along to either 
interneurons or motor neurons. Interneurons simply pass signals from one neuron 
to another, whereas motor neurons pass signals from other neurons to muscle cells, 
thereby inducing muscle movement (motor activity). 


lon Gradients Are the Source of Electrical Potentials in Neurons 


The impulses that are carried along axons, as signals pass from neuron to neuron, are 
electrical in nature. These electrical signals occur as transient changes in the electrical potential 
differences (voltages) across the membranes of neurons (and other cells). Such potentials are gen- 
erated by ion gradients. The cytoplasm of a neuron at rest is low in Na‘ and Cl and high 
in K*, relative to the extracellular fluid (Figure 32.49). These gradients are generated 
by the Na*,K*-ATPase (see Chapter 9). A resting neuron exhibits a potential differ- 
ence of approximately —60 mV (that is, negative inside). 


Action Potentials Carry the Neural Message 


Nerve impulses, also called action potentials, are transient changes in the membrane 
potential that move rapidly along nerve cells. Action potentials are created when the 
membrane is locally depolarized by approximately 20 mV—from the resting value of 
about —60 mV to a new value of approximately —40 mV. This small change is 
enough to have a dramatic effect on specific proteins in the axon membrane called 
voltage-gated ion channels. These proteins are ion channels that are specific either 
for Na* or K*. These ion channels are normally closed at the resting potential of 
—60 mV. When the potential difference rises to —40 mV, the “gates” of the Na” chan- 
nels are opened and Na‘ ions begin to flow into the cell. As Na* enters the cell, the 
membrane potential continues to increase and additional Na* channels are opened 
(Figure 32.49). The potential rises to more than +30 mV. At this point, Na* influx 
slows and stops. As the Na* channels close, K* channels begin to open and K* ions 
stream out of the cell, returning the membrane potential to negative values. The po- 
tential eventually overshoots its resting value a bit. At this point, K* channels close 
and the resting potential is eventually restored by action of the Na*,K*-ATPase and 
the other channels. Alan Hodgkin and Andrew Huxley originally observed these 
transient increases and decreases, first in Na* permeability and then in K* perme- 
ability. For this and related work, Hodgkin and Huxley, along with J. C. Eccles, won 
the Nobel Prize in Physiology or Medicine in 1963. 


The Action Potential Is Mediated by the Flow of Na* and K+ lons 


These changes in potential in one part of the axon are rapidly passed along the ax- 
onal membrane (Figure 32.50). The sodium ions that rush into the cell in one 
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FIGURE 32.49 (a) The concentrations of Nat, K+, and CIT ions inside 
and outside of a typical resting mammalian axon are shown. Assum- 
ing relative permeabilities for Kt, Nat, and CIT are 1, 0.04, and 0.45, (c) 
respectively, the Goldman equation yields a membrane potential of 
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—60 mV. (See problem 14, page 1058.) (b and c) The time depen- 
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CENGAGENOW” ACTIVE FIGURE 32.50 The propagation of action potentials along an axon. Figure 32.49 
shows the time dependence of an action potential at a discrete point on the axon. This figure shows how the 
membrane potential varies along the axon as an action potential is propagated. (For this reason, the shape of 
the action potential is the apparent reverse of that shown in Figure 32.49.) At the leading edge of the action 
potential, membrane depolarization causes Nat channels to open briefly. As the potential moves along the 
axon, the Na* channels close and Kt channels open, leading to a drop in potential and the onset of hyperpo- 
larization. When the resting potential is restored, another action potential can be initiated. Test yourself on 
the concepts in this figure at www.cengage.com/login. 


localized region actually diffuse farther along the axon, raising the Na* concentra- 
tion and depolarizing the membrane, causing Na* gates to open in that adjacent re- 
gion of the axon. In this way, the action potential moves down the axon in wavelike 
fashion. This simple process has several very dramatic properties: 


1. Action potentials propagate very rapidly—up to and sometimes exceeding 
100 m/sec. 
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FIGURE 32.51 Nat channels are infrequently and ran- 
domly distributed in unmyelinated nerve. In myelinated 
axons, Nat channels are clustered in large numbers in 
the nodes of Ranvier, between the regions surrounded 
by myelin sheath structures. 


2. The action potential is not attenuated (diminished in intensity) as a function of 
distance transmitted. 


The input of energy all the way along an axon—in the form of ion gradients main- 
tained by Na*,K*-ATPase—ensures that the shape and intensity of the action po- 
tential are maintained over long distances. The action potential has an all-or-none 
character. There are no gradations of amplitude; a given neuron is either at rest 
(with a polarized membrane) or is conducting a nerve impulse (with a reversed po- 
larization). Because nerve impulses display no variation in amplitude, the size of the 
action potential is not important in processing signals in the nervous system. In- 
stead, it is the number of action potential firings and the frequency of firing that carry specific 
information. 

The action potential is a delicately orchestrated interplay between the Na‘,K*- 
ATPase and the voltage-gated Na* and K* channels that is initiated by a stimulus 
at the postsynaptic membrane. The density and distribution of Na* channels along 
the axon are different for myelinated and unmyelinated axons (Figure 32.51). In 
unmyelinated axons, Na* channels are uniformly distributed, although they are 
few in number—approximately 20 channels per um. On the other hand, in myeli- 
nated axons, Na* channels are clustered at the nodes of Ranvier. In these latter re- 
gions, they occur with a density of approximately 10,000 per wm?. (Ion channel 
structure and function were discussed in Chapter 9.) 


Neurons Communicate at the Synapse 


How are neuronal signals passed from one neuron to the next? Neurons are juxta- 
posed at the synapse. The space between the two neurons is called the synaptic 
cleft. The number of synapses in which any given neuron is involved varies greatly. 
There may be as few as one synapse per postsynaptic cell (in the midbrain) to many 
thousands per cell. Typically, 10,000 synaptic knobs may impinge on a single spinal 
motor neuron, with 8000 on the dendrites and 2000 on the soma or cell body. The 
ratio of synapses to neurons in the human forebrain is approximately 40,000 to 1! 
Synapses are actually quite specialized structures, and several different types 
exist. A minority of synapses in mammals, termed electrical synapses, are characterized by a 
very small gap—approximately 2 nm—between the presynaptic cell (which delivers the sig- 
nal) and the postsynaptic cell (which receives the signal). At electrical synapses, the ar- 
rival of an action potential on the presynaptic membrane leads directly to depo- 
larization of the postsynaptic membrane, initiating a new action potential in the 
postsynaptic cell. However, most synaptic clefts are much wider—on the order of 
20 to 50 nm. In these, an action potential in the presynaptic membrane causes se- 
cretion of a chemical substance—called a neurotransmitter—by the presynaptic 
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cell. This substance binds to receptors on the postsynaptic cell, initiating a new ac- 
tion potential. Synapses of this type are thus chemical synapses. 

Different synapses utilize specific neurotransmitters. The cholinergic synapse, a 
paradigm for chemical transmission mechanisms at synapses, employs acetyl- 
choline as a neurotransmitter. Other important neurotransmitters and receptors 
have been discovered and characterized. These all fall into one of several major 
classes, including amino acids (and their derivatives), catecholamines, peptides, 
and gaseous neurotransmitters. Table 32.3 lists some, but not all, of the known 
neurotransmitters. 


Communication at Cholinergic Synapses Depends upon Acetylcholine 


In cholinergic synapses, small synaptic vesicles inside the synaptic knobs contain 
large amounts of acetylcholine (approximately 10,000 molecules per vesicle; Figure 
32.52). When the membrane of the synaptic knob is stimulated by an arriving ac- 
tion potential, special voltage-gated Ca?*+ channels open and Ca** ions stream into 
the synaptic knob, causing the acetylcholine-containing vesicles to attach to and 
fuse with the knob membrane. The vesicles open, spilling acetylcholine into the 
synaptic cleft. Binding of acetylcholine to specific acetylcholine receptors in the 
postsynaptic membrane causes opening of ion channels and the creation of a new 
action potential in the postsynaptic neuron. 

A variety of toxins can alter or affect this process. The anaerobic bacterium 
Clostridium botulinum, which causes botulism poisoning, produces several toxic pro- 
teins that strongly inhibit acetylcholine release. The black widow spider, Lactrodec- 
tus mactans, produces a venom protein, a@-latrotoxin, that stimulates abnormal re- 
lease of acetylcholine at the neuromuscular junction. The bite of the black widow 
causes pain, nausea, and mild paralysis of the diaphragm but is rarely fatal. 


There Are Two Classes of Acetylcholine Receptors 


Two different acetylcholine receptors are found in postsynaptic membranes. 
They were originally distinguished by their responses to muscarine, a toxic alka- 
loid in toadstools, and nicotine (Figure 32.53). The nicotinic receptors are cation 
channels in postsynaptic membranes, and the muscarinic receptors are trans- 
membrane proteins that interact with G proteins. The receptors in sympathetic 
ganglia and those in motor endplates of skeletal muscle are nicotinic receptors. 
Nicotine locks the ion channels of these receptors in their open conformation. 
Muscarinic receptors are found in smooth muscle and in glands. Muscarine mim- 
ics the effect of acetylcholine on these latter receptors. 

The nicotinic acetylcholine receptor is a 290-kD transmembrane glycoprotein con- 
sisting of a ring of four homologous subunits (a, 6, y, and ô) in the order ayapô (Fig- 
ure 32.54a). The receptor is shaped like an elongated (160 A) funnel, with a large 
extracellular ligand-binding domain, a membrane-spanning pore, and a smaller 
intracellular domain. Acetylcholine binds to the two a-subunits at sites that lie 40 A 
from the membrane surface. 


The Nicotinic Acetylcholine Receptor Is a Ligand-Gated lon Channel 


The nicotinic acetylcholine receptor functions as a ligand-gated ion channel, and on 
the basis of its structure, it is also an oligomeric ion channel. When acetylcholine (the 
ligand) binds to this receptor, a conformational change opens the channel, which is 
equally permeable to Na‘ and K*. Na‘ rushes in while K* streams out, but because 
the Na* gradient across this membrane is steeper than that of K+, the Na* influx 
greatly exceeds the K* efflux. The influx of Na‘ depolarizes the postsynaptic mem- 
brane, initiating an action potential in the adjacent membrane. After a few millisec- 
onds, the channel closes, even though acetylcholine remains bound to the receptor. 
At this point, the channel will remain closed until the concentration of acetylcholine 
in the synaptic cleft drops to about 10 nM. 
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Cholinergic Agents 
Acetylcholine 





Catecholamines 

Norepinephrine (noradrenaline) 
Epinephrine (adrenaline) 
L-Dopa 

Dopamine 

Octopamine 


Amino Acids (and Derivatives) 
y-Aminobutyric acid (GABA) 
Alanine 

Aspartate 

Cystathione 

Glycine 

Glutamate 

Histamine 

Proline 

Serotonin 

Taurine 


Tyrosine 


Peptide Neurotransmitters 
Cholecystokinin 
Enkephalins and endorphins 
Gastrin 

Gonadotropin 

Neurotensin 

Oxytocin 

Secretin 

Somatostatin 

Substance P 

Thyrotropin releasing factor 
Vasopressin 


Vasoactive intestinal peptide (VIP) 


Gaseous Neurotransmitters 
Carbon monoxide (CO) 
Nitric oxide (NO) 
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FIGURE 32.52 Cell-cell communication at the synapse (a) is mediated by neurotransmitters such as acetyl- 
choline, produced from choline by choline acetyltransferase. The arrival of an action potential at the synaptic 
knob (b) opens Ca?* channels in the presynaptic membrane. Influx of Ca?* induces the fusion of acetylcholine- 
containing vesicles with the plasma membrane and release of acetylcholine into the synaptic cleft (c). Binding 
of acetylcholine to receptors in the postsynaptic membrane opens Nat channels (d). The influx of Na* depolar- 
izes the postsynaptic membrane, generating a new action potential. 
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FIGURE 32.53 Two types of acetylcholine receptors are known. Nicotinic acetylcholine receptors are locked in 
their open conformation by nicotine. Obtained from tobacco plants, nicotine is named for Jean Nicot, French 
ambassador to Portugal, who sent tobacco seeds to France in 1550 for cultivation. Muscarinic acetylcholine re- 
ceptors are stimulated by muscarine, obtained from the intensely poisonous mushroom, Amanita muscaria. 


Acetylcholinesterase Degrades Acetylcholine in the Synaptic Cleft 


Following every synaptic signal transmission, the synapse must be readied for the 
arrival of another action potential. Several things must happen very quickly. First, 
the acetylcholine left in the synaptic cleft must be rapidly degraded to resensitize 
the acetylcholine receptor and to restore the excitability of the postsynaptic 
membrane. This reaction is catalyzed by acetylcholinesterase (Figure 32.55). 
When [acetylcholine] has decreased to low levels, acetylcholine dissociates from 
the receptor, which thereby regains its ability to open in a ligand-dependent man- 
ner. Second, the synaptic vesicles must be reformed from the presynaptic mem- 
brane by endocytosis (Figure 32.56) and then must be restocked with acetylcholine. 
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A DEEPER LOOK 
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Tetrodotoxin and saxitoxin are highly specific blockers of Na* 
channels and bind with very high affinity (Kp <1 nM). This 
unique specificity and affinity have made it possible to use radio- 
active forms of these toxins to purify Na* channels and map their 
distribution on axons. Tetrodotoxin is found in the skin and several 
internal organs of puffer fish, also known as blowfish or swellfish, 
members of the family Tetraodontidae, which react to danger by in- 
flating themselves with water or air to nearly spherical (and often 
comical) shapes (see accompanying figure). Although tetrodo- 
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Tetrodotoxin and Saxitoxin Are Na* Channel Toxins 


toxin poisoning can easily be fatal, puffer fish are delicacies in 
Japan, where they are served in a dish called fugu. For this purpose, 
the puffer fish must be cleaned and prepared by specially trained 
chefs. Saxitoxin is made by Gonyaulax catenella and G. tamarensis, 
two species of marine dinoflagellates or plankton that are respon- 
sible for “red tides” that cause massive fish kills. Saxitoxin is con- 
centrated by certain species of mussels, scallops, and other shellfish 
that are exposed to red tides. Consumption of these shellfish by an- 
imals, including humans, can be fatal. 


© Stephen Frink/CORBIS 


< (a) Tetrodotoxin is found in puffer fish, which are prepared and served in 
Japan as fugu. The puffer fish on the left is unexpanded; the one on the right 
is inflated. (b) Structures of tetrodotoxin and saxitoxin. 
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FIGURE 32.54 The nicotinic acetylcholine receptor is an 
elongated funnel constructed from homologous sub- 
units named a, B, y, and 6. The pentameric channel 
includes two copies of the a-subunit. The extracellular 
domain of each subunit is a B-barrel, whereas the trans- 
membrane and intracellular domains are a-helical. 

(a) Top view; (b) side view (pdb id = 2BG9). 
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FIGURE 32.55 Acetylcholine is degraded to acetate and choline by acetylcholinesterase, a serine protease. 


This occurs through the action of an ATP-driven Ht pump and an acetylcholine 
transport protein. The H* pump in this case is a member of the family of V-type 
ATPases. It uses the free energy of ATP hydrolysis to create an H* gradient across 
the vesicle membrane. This gradient is used by the acetylcholine transport protein 
to drive acetylcholine into the vesicle, as shown in Figure 32.56. 

Antagonists of the nicotinic acetylcholine receptor are particularly potent neuro- 
toxins. These agents, which bind to the receptor and prevent opening of the ion 
channel, include d-tubocurarine, the active agent in the South American arrow poi- 
son curare, and several small proteins from poisonous snakes. These latter agents in- 
clude cobratoxin from cobra venom, and a-bungarotoxin, from Bungarus multicinctus, 
a snake common in Taiwan (Figure 32.57). 


Muscarinic Receptor Function Is Mediated by G Proteins 


There are several different types of muscarinic acetylcholine receptors, with differ- 
ent structures and different apparent functions in synaptic transmission. However, 
certain structural and functional features are shared by this class of receptors. Mus- 
carinic receptors are 70-kD glycoproteins and are members of the GPCR family. 


<“ CENGAGENOW” ANIMATED FIGURE 32.56 Following a synaptic transmission event, acetylcholine is repack- 
aged in vesicles in a multistep process. Synaptic vesicles are formed by endocytosis, and acetylcholine is syn- 
thesized by choline acetyltransferase. A proton gradient is established across the vesicle membrane by an 
H*-transport ATPase, and a proton-acetylcholine transport protein transports acetylcholine into the synaptic 
vesicles, exchanging acetylcholine for protons in an electrically neutral antiport process. See this figure ani- 
mated at www.cengage.com/login. 
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FIGURE 32.57 Tubocurarine, obtained from the plant Chondrodendron tomentosum, is the active agent in “tube 
curare,” named for the bamboo tubes in which it is kept by South American tribal hunters. Atropine is pro- 
duced by Atropa belladonna, the poisonous deadly nightshade. The species name, which means “beautiful 
woman," derives from the use of atropine in years past by Italian women to dilate their pupils. Atropine is still 
used for pupil dilation in eye exams by ophthalmologists. Cobratoxin and a-bungarotoxin are produced by the 
cobra (Naja naja) and the banded krait snake (Bungarus multicinctus), respectively. 


Activation of muscarinic receptors (by binding of acetylcholine) results in several 
G-protein—mediated effects, including the inhibition of adenylyl cyclase, the stimu- 
lation of phospholipase C, and the opening of K* channels. Many antagonists for 
muscarinic acetylcholine receptors are known, including atropine from Atropa bella- 
donna, the deadly nightshade plant whose berries are sweet and tasty but highly poi- 
sonous (Figure 32.57). 

Both the nicotinic and muscarinic acetylcholine receptors are sensitive to certain 
agents that inactivate acetylcholinesterase itself. Acetylcholinesterase is a serine 
esterase similar to trypsin and chymotrypsin (see Chapter 14). The reactive serine 
at the active site of such enzymes is a vulnerable target for organophosphorus in- 
hibitors (Figure 32.58). DIPF and related agents form stable covalent complexes 





FIGURE 32.58 Covalent inhibitors of acetylcholinesterase include DIFP, the nerve gases tabun and sarin, and 
the insecticide malathion. 
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FIGURE 32.59 NMDA receptors assemble as tetramers, 
with two NR1 subunits and two NR2 subunits. (For 
clarity, only one of the NR1-NR2 pairs is shown.) The 
extracellular portion of each subunit consists of an 
N-terminal domain (NTD) and an agonist-binding 
domain (ABD). Red lines indicate that stabilizing interac- 
tions occur between these domains. NMDA receptors 
are Na* and Ca** channels. They are stimulated by 
NMDA, inhibited by phencyclidine, and regulated by 
Zn?* and glycine. 


with the active-site serine, irreversibly blocking the enzyme. Malathion is a com- 
monly used insecticide, and sarin and tabun are nerve gases used in chemical war- 
fare. All these agents effectively block nerve impulses, stop breathing, and cause 
death by suffocation. 


Other Neurotransmitters Can Act Within Synaptic Junctions 


Synaptic junctions that use amino acids, catecholamines, and peptides (see Table 
32.3) appear to operate much the way the cholinergic synapses do. Presynaptic 
vesicles release their contents into the synaptic cleft, where the neurotransmitter 
substance can bind to specific receptors on the postsynaptic membrane to induce 
a conformational change and elicit a particular response. Some of these neuro- 
transmitters are excitatory in nature and stimulate postsynaptic neurons to trans- 
mit impulses, whereas others are inhibitory and prevent the postsynaptic neuron 
from carrying other signals. Just as acetylcholine acts on both nicotinic and mus- 
carinic receptors, so most of the known neurotransmitters act on several (and in 
some cases, many) different kinds of receptors. Biochemists are just beginning to 
understand the sophistication and complexity of neuronal signal transmission. 


Glutamate and Aspartate Are Excitatory Amino Acid Neurotransmitters 


The common amino acids glutamate and aspartate act as neurotransmitters. Like 
acetylcholine, glutamate and aspartate are excitatory and stimulate receptors on the 
postsynaptic membrane to transmit a nerve impulse. No enzymes that degrade 
glutamate exist in the extracellular space, so glutamate must be cleared by the high- 
affinity presynaptic and glial transporters—a process called reuptake. 

At least five subclasses of glutamate receptors are known. The best understood of 
these excitatory receptors is the N-methyl-D-aspartate (NMDA) receptor, a ligand- 
gated channel that, when open, allows Ca** and Na‘ to flow into the cell and K* to 
flow out of the cell. Phencyclidine (PCP) is a specific antagonist of the NMDA re- 
ceptor (Figure 32.59). Phencyclidine was once used as an anesthetic agent, but le- 
gitimate human use was quickly discontinued when it was found to be responsible 
for bizarre psychotic reactions and behavior in its users. Since this time, PCP has 
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(a) (b) 


FIGURE 32.60 GABA (y-aminobutyric acid) and 
glycine are inhibitory neurotransmitters that activate 
chloride channels. Influx of CI7 causes a hyperpolar- 
ization of the postsynaptic membrane. GABA recep- 
tors are similar in many respects to nicotinic acetyl- 
choline receptors, with an a2By6 stoichiometry. 

(a) Top view; (b) side view. (Images courtesy of Donald 
Weaver, University of Nova Scotia, and Valerie Campagna- 





Slater, University of Toronto.) 
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FIGURE 32.61 Glutamate is converted to GABA by glutamate decarboxylase. GABA is degraded by the action 
of GABA-glutamate transaminase and succinate semialdehyde dehydrogenase to produce succinate. 


been used illegally as a hallucinogenic drug, under the street name of angel dust. 
Sadly, it has caused many serious, long-term psychological problems in its users. 


y-Aminobutyric Acid and Glycine Are Inhibitory Neurotransmitters 


Certain neurotransmitters, acting through their conjugate postsynaptic receptors, 
inhibit the postsynaptic neuron from propagating nerve impulses from other neu- 
rons. Two such inhibitory neurotransmitters are y-aminobutyric acid (GABA) and 
glycine. These agents make postsynaptic membranes permeable to chloride ions 
and cause a net influx of Cl”, which in turn causes hyperpolarization of the post- 
synaptic membrane (making the membrane potential more negative). Hyperpolar- 
ization of a neuron effectively raises the threshold for the onset of action potentials 
in that neuron, making the neuron resistant to stimulation by excitatory neuro- 
transmitters. These effects are mediated by the GABA and glycine receptors, which 
are ligand-gated chloride channels (Figure 32.60). GABA is derived by a decar- 
boxylation of glutamate (Figure 32.61) and appears to operate mainly in the brain, 
whereas glycine acts primarily in the spinal cord. The glycine receptor has a specific 
affinity for the convulsive alkaloid strychnine (Figure 32.62). The effects of ethanol FIGURE 32.62 Glycine receptors are distinguished by 
on the brain arise in part from the opening of GABA receptor Cl” channels. their unique affinity for strychnine. 





Strychnine 
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HUMAN BIOCHEMISTRY 


The Biochemistry of Neurological Disorders 


Defects in catecholamine processing are responsible for the symp- 
toms of many neurological disorders, including clinical depression 
(which involves norepinephrine [NE]) and parkinsonism (involv- 
ing dopamine [DA]). Once these neurotransmitters have bound to 
and elicited responses from postsynaptic membranes, they must be 
efficiently cleared from the synaptic cleft (see accompanying figure, 
part a). Clearing can occur by several mechanisms. NE and DA 
transport or reuptake proteins exist both in the presynaptic mem- 
brane and in nearby glial cell membranes. On the other hand, cat- 
echolamine neurotransmitters can be metabolized and inactivated 
by two enzymes: catechol-O-methyl-transferase in the synaptic cleft 
and monoamine oxidase in the mitochondria (see figure, part b). 
Catecholamines transported back into the presynaptic neuron are 
accumulated in synaptic vesicles by the same H*-ATPase/H*-ligand 
exchange mechanism described for glutamate. Clinical depression 
has been treated by two different strategies. Monoamine oxidase 
inhibitors act as antidepressants by increasing levels of cate- 
cholamines in the brain. Another class of antidepressants, the tri- 
cyclics, such as desipramine (see figure, part c), act on several 
classes of neurotransmitter reuptake transporters and facilitate 


(a) 


Norepinephrine © 


> (a) The pathway for reuptake and vesicular 
repackaging of the catecholamine neurotransmit- 
ters. The sites of action of desipramine, tranyl- 
cypromine, and reserpine are indicated. 





Mitochondrion 


more prolonged stimulation of postsynaptic receptors. Prozac is a 
more specific reuptake inhibitor and acts only on serotonin reup- 
take transporters. 

Parkinsonism is characterized by degeneration of dopaminergic 
neurons, as well as consequent overproduction of postsynaptic 
dopamine receptors. In recent years, Parkinson’s patients have been 
treated with dopamine agonists such as bromocriptine (see figure, 
part d) to counter the degeneration of dopamine neurons. 

Catecholaminergic neurons are involved in many other interest- 
ing pharmacological phenomena. For example, reserpine (see fig- 
ure, part e), an alkaloid from a climbing shrub of India, is a power- 
ful sedative that depletes the level of brain monoamines by 
inhibiting the H*-monoamine exchange protein in the membranes 
of synaptic vesicles. 

Cocaine (see figure, part f), a highly addictive drug, binds with 
high affinity and specificity to reuptake transporters for the mono- 
amine neurotransmitters in presynaptic membranes. Thus, at least 
one of the pharmacological effects of cocaine is to prolong the 
synaptic effects of these neurotransmitters. 
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A (b) Norepinephrine can be degraded in the synaptic cleft by catechol-O-methyltransferase 
or in the mitochondria of presynaptic neurons by monoamine oxidase. (c) The structures of 
tranylcypromine, desipramine, and Prozac. (d) The structure of bromocriptine. (e) The struc- 
ture of reserpine. (f) The structure of cocaine. 
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< FIGURE 32.63 The pathway for the synthesis of catecholamine neurotransmitters. Dopa, dopamine, noradren- 
aline,and adrenaline are synthesized sequentially from tyrosine. 


The Catecholamine Neurotransmitters Are Derived from Tyrosine 


Epinephrine, norepinephrine, dopamine, and L-dopa are collectively known as the 
catecholamine neurotransmitters. These compounds are synthesized from tyrosine 
(Figure 32.63), both in sympathetic neurons and in the adrenal glands. They func- 
tion as neurotransmitters in the brain and as hormones in the circulatory system. 
However, these two pools operate independently, thanks to the blood-brain barrier, 
which permits only very hydrophobic species in the circulatory system to cross 
over into the brain. Hydroxylation of tyrosine (by tyrosine hydroxylase) to form 
3,4-dihydroxyphenylalanine (L-dopa) is the rate-limiting step in this pathway. Dopa- 
mine, a crucial catecholamine involved in several neurological diseases, is synthe- 
sized from L-dopa by a pyridoxal phosphate-dependent enzyme, dopa decarboxy- 
lase. Subsequent hydroxylation and methylation produce norepinephrine and 
epinephrine (Figure 32.63). The methyl group in the final reaction is supplied by 
S-adenosylmethionine. 

Each of these catecholamine neurotransmitters is known to play a unique role in 
synaptic transmission. The neurotransmitter in junctions between sympathetic 
nerves and smooth muscle is norepinephrine. On the other hand, dopamine is in- 
volved in other processes. Either excessive brain production of dopamine or hy- 
persensitivity of dopamine receptors is responsible for psychotic symptoms and 
schizophrenia, whereas lowered production of dopamine and the loss of dopamine 
neurons are important factors in Parkinson’s disease. 


Various Peptides Also Act as Neurotransmitters 


Many relatively small peptides have been shown to possess neurotransmitter activity 
(see Table 32.3). One of the challenges of this field is that the known neuropeptides 
may represent a very small subset of the neuropeptides that exist. Another chal- 
lenge arises from the small in vivo concentrations of these agents and the small 
number of receptors that are present in neural tissue. Physiological roles for most 
of these peptides are complex. For example, the endorphins and enkephalins are 
natural opioid substances and potent pain relievers. The endothelins are a family of 
homologous regulatory peptides, synthesized by certain endothelial and epithelial 
cells that act on nearby smooth muscle and connective tissue cells. They induce or 
affect smooth muscle contraction; vasoconstriction; heart, lung, and kidney func- 
tion; and mitogenesis and tissue remodeling. Vasoactive intestinal peptide (VIP) 
produces a G protein—adenylyl cyclase—mediated increase in cAMP, which in turn 
triggers a variety of protein phosphorylation cascades, one of which leads to con- 
version of phosphorylase b to phosphorylase a, stimulating glycogenolysis. More- 
over, VIP has synergistic effects with other neurotransmitters, such as norepineph- 
rine. In addition to increasing cAMP levels through f-adrenergic receptors, 
norepinephrine acting at aj-adrenergic receptors markedly stimulates the increases 
in cAMP elicited by VIP. Many other effects have also been observed. For example, 
injection of VIP increases rapid eye movement (REM) sleep and decreases waking 
time in rats. VIP receptors exist in regions of the central nervous system involved in 
sleep modulation. 


SUMMARY 


32.1 What AreHormones? Many different chemical species act as hor- 
mones. Steroid hormones, all derived from cholesterol, regulate metab- 
olism, salt and water balances, inflammatory processes, and sexual func- 
tion. Several hormones are amino acid derivatives. Among these are 
epinephrine and norepinephrine (which regulate smooth muscle con- 


traction and relaxation, blood pressure, cardiac rate, and the processes 
of lipolysis and glycogenolysis) and the thyroid hormones (which stimu- 
late metabolism). Peptide hormones are a large group of hormones that 
appear to regulate processes in all body tissues, including the release of 
yet other hormones. Hormones and other signal molecules in biological 


systems bind with very high affinities to their receptors, displaying Kp val- 
ues in the range of 107? to 1078 M. Hormones are produced at concen- 
trations equivalent to or slightly above these Kp values. Once hormonal 
effects have been induced, the hormone is usually rapidly metabolized. 


32.2 What Is Signal Transduction? Hormonal regulation depends 
upon the transduction of the hormonal signal across the plasma mem- 
brane to specific intracellular sites, particularly the nucleus. Signal trans- 
duction pathways consist of a stepwise progression of signaling stages: 
receptor—transducer—effector. The receptor perceives the signal, trans- 
ducers relay the signal, and the effectors convert the signal into an intra- 
cellular response. Often, effector action involves a series of steps, each of 
which is mediated by an enzyme, and each of these enzymes can be con- 
sidered as an amplifier in a pathway connecting the hormonal signal to 
its intracellular targets. 


32.3 How Do Signal-Transducing Receptors Respond to the Hormonal 
Message? Steroid hormones may either bind to plasma membrane 
receptors or exert their effects within target cells, entering the cell and 
migrating to their sites of action via specific cytoplasmic receptor pro- 
teins. The nonsteroid hormones, which act by binding to outward- 
facing plasma membrane receptors, activate signal transduction path- 
ways that mobilize various second messengers—cyclic nucleotides, Ca2* 
ions, and other substances—that activate or inhibit enzymes or cas- 
cades of enzymes in very specific ways. 


32.4 How Are Receptor Signals Transduced? Receptor signals are 
transduced in one of three ways, to initiate actions inside the cell: 
1. Exchange of GDP for GTP on GTP-binding proteins (G proteins), 
which in turn leads to generation of second messengers, including 
2+ 


cAMP, phospholipid breakdown products, and Ca-*. 

2. Receptor-mediated activation of phosphorylation cascades that in 
turn trigger activation of various enzymes. 

3. Conformation changes that open ion channels or recruit proteins 
into nuclear transcription complexes. 


Problems 1057 


32.5 How Do Effectors Convert the Signals to Actions in the Cell? 
Transduction of the hormonal signal leads to activation of effectors— 
usually protein kinases and protein phosphatases—that elicit a variety 
of actions that regulate discrete cellular functions. Of the thousands 
of mammalian kinases and phosphatases, the structures and functions 
of a few are representative, including protein kinase A (PKA), protein 
kinase C (PKC), and protein tyrosine phosphatase SHP-2. 


32.6 How Are Signaling Pathways Organized and Integrated? All sig- 
naling pathways are organized in time and space in the cell, they are care- 
fully regulated, and they are integrated with one another. PIDs modulate 
and control the association of signaling molecules with one another, of- 
ten in large signalsomes; signaling molecules are switched on and off by 
covalent modifications such as phosphorylations; and signaling often in- 
volves amplification and cooperative effects. GPCR signaling can occur 
through G-protein-independent pathways and is sometimes modulated by 
RGS/GAPs. Responses of signaling receptors can be coordinated by trans- 
activation, and signals from multiple pathways can be integrated. 


32.7 How Do Neurotransmission Pathways Control the Function of 
Sensory Systems? Nerve impulses, which can be propagated at speeds 
up to 100 m/sec, provide a means of intercellular signaling that is fast 
enough to encompass sensory recognition, movement, and other phys- 
iological functions and behaviors in higher animals. The generation 
and transmission of nerve impulses in vertebrates is mediated by an in- 
credibly complicated neural network that connects every part of the or- 
ganism with the brain—itself an interconnected array of as many as 10'* 
cells. Despite their complexity and diversity, the nervous systems of ani- 
mals all possess common features and common mechanisms. Physical or 
chemical stimuli are recognized by specialized receptor proteins in the 
membranes of excitable cells. Conformational changes in the receptor 
protein result in a change in enzyme activity or a change in the perme- 
ability of the membrane. These changes are then propagated through- 
out the cell or from cell to cell in specific and reversible ways to carry in- 
formation through the organism. 


PROBLEMS 


CENGAGENOW” Preparing for an exam? Create your own study path for this 
chapter at www.cengage.com/login. 


1. Compare and contrast the features and physiological advantages 
of each of the major classes of hormones, including the steroid 
hormones, polypeptide hormones, and the amino acid—derived 
hormones. 

2. Compare and contrast the features and physiological advantages of 
each of the known classes of second messengers. 

3. Nitric oxide may be merely the first of a new class of gaseous second 
messenger/neurotransmitter molecules. Based on your knowledge 
of the molecular action of nitric oxide, suggest another gaseous mol- 
ecule that might act as a second messenger and propose a molecular 
function for it. 

4. Herbimycin A is an antibiotic that inhibits tyrosine kinase activity by 
binding to SH groups of cysteine in the src gene tyrosine kinase and 
other similar tyrosine kinases. What effect might it have on normal 
rat kidney cells that have been transformed by Rous sarcoma virus? 
Can you think of other effects you might expect for this interesting 
antibiotic? 

5. Monoclonal antibodies that recognize phosphotyrosine are com- 
mercially available. How could such an antibody be used in studies 
of cell signaling pathways and mechanisms? 

6. Explain and comment on this statement: The main function of hor- 
mone receptors is that of signal amplification. 

7. Synaptic vesicles are approximately 40 nm in outside diameter, and 
each vesicle contains about 10,000 acetylcholine molecules. Calcu- 
late the concentration of acetylcholine in a synaptic vesicle. 

8. GTPyS is a nonhydrolyzable analog of GTP. Experiments with squid 
giant axon synapses reveal that injection of GTyS into the pre- 


synaptic end (terminal) of the neuron inhibits neurotransmitter re- 
lease (slowly and irreversibly). The calcium signals produced by 
presynaptic action potentials and the number of synaptic vesicles 
docking on the presynaptic membrane are unchanged by GTPyS. 
Propose a model for neurotransmitter release that accounts for all 
of these observations. 

9. A typical hormone binds to its receptor with an affinity (Kp) of ap- 
proximately 1 X 10~° M. Consider an in vitro (test-tube) system in 
which the total hormone concentration is approximately 1 nM and 
the total concentration of receptor sites is 0.1 nM. What fraction of 
the receptor sites is bound with hormone? If the concentration of 
receptors is decreased to 0.033 nM, what fraction of receptor is 
bound with the hormone? 

10. (Integrates with Chapter 24.) All steroid hormones are synthesized 
in the human body from cholesterol. What is the consequence for 
steroid hormones and their action from taking a “statin” drug, such 
as Zocor, which blocks the synthesis of cholesterol in the body? Are 
steroid hormone functions compromised by statin action? 

11. Given that 6-strands provide a more genetically economical way 
for a polypeptide to cross a membrane, why has nature chosen 
a-helices as the membrane-spanning motif for G-protein—coupled 
receptors? That is, what other reasons can you imagine to justify 
the use of a-helices? 

12. Write simple reaction mechanisms for the formation of cAMP from 
ATP by adenylyl cyclase and for the breakdown of cAMP to 5’-AMP 
by phosphodiesterases. 

13. (Integrates with Chapter 9.) Consider the data in Figure 32.49a. 
Recast Equation 9.2 to derive a form from which you could cal- 
culate the equilibrium electrochemical potential at which no net 
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flow of potassium would occur. This is the Nernst equation. Cal- 
culate the equilibrium potential for K* and also for Na‘, assum- 
ing T= 37°C. 

14. The calculation of the actual transmembrane potential difference 
for a neuron is accomplished with the Goldman equation: 


Aw = RT r 2 PolC]outside + ZPA [A] inside 
F È PolC]inside + SPALA] outside 





where [C] and [A] are the cation and anion concentrations, re- 
spectively, and Po and P, are the respective permeability coefficients 
of cations and anions. 

Assume relative permeabilities for K*, Na*, and Cl of 1, 0.04, 
and 0.45, respectively, and use this equation to calculate the actual 
transmembrane potential difference for the neuron whose ionic 
concentrations are those given in Figure 32.49a. 

15. Use the information in problems 13 and 14, together with Fig- 
ure 32.50, to discuss the behavior of potassium, sodium, and 
chloride ions as an action potential propagates along an axon. 

16. Review the cell signaling pathway shown in Figure 32.4. With the 
rest of the chapter as context, discuss all the steps of this pathway 
that involve signal amplification. 

17. In the cell signaling pathway shown in Figure 32.4, what would be 
the effect if Ras were mutated so that it had no GTPase activity? 


18. One of the topics discussed in this chapter is the ability of GPCRs to 
exert signaling effects without the involvement of G proteins. Using 


the pathway shown in Figure 32.40, and considering everything you 
have learned in this chapter, suggest some reasons that would ex- 
plain why this G-protein—independent signaling was difficult to ver- 
ify experimentally. 


Preparing for the MCAT Exam 


19. Malathion (Figure 32.58) is one of the secrets behind the near- 
complete eradication of the boll weevil from cotton fields in the 
United States. For most of the 20th century, boll weevils wreaked 
havoc on the economy of states from Texas to the Carolinas. When 
boll weevils attacked cotton fields in a farming community, the de- 
struction of cotton plants meant loss of jobs for farm workers, 
bankruptcies for farm owners, and resulting hardship for the en- 
tire community. Relentless application of malathion to cotton 
crops and fields has turned the tide, however, and agriculture ex- 
perts expect that boll weevils will be completely gone from cotton 
fields within a few years. Remarkably, malathion-resistant boll wee- 
vils have not emerged despite years of this pesticide’s use. Consider 
the structure and chemistry of malathion and suggest what you 
would expect to be the ecological consequences of chronic 
malathion application to cotton fields. 


20. Consult the excellent review article “Assembly of Cell Regulatory Sys- 
tems Through Protein Interaction Domains” (Science 300:445—452, 
2003, by Pawson and Nash) and discuss the structural requirements 
for a regulatory protein operating in a signaling network. 
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Abbreviated Answers to Problems 





For detailed answers to the end-of-chapter problems as well as addi- 
tional problems to solve, see The Student Solutions Manual, Study 
Guide and Problems Book by David Jemiolo and Steven Theg that 
accompanies this textbook. 


Chapter 1 


1. Because bacteria (compared with humans) have simple nutri- 
tional requirements, their cells obviously contain enzyme 
systems that allow them to convert rudimentary precursors 
(even inorganic substances such as NH,*, NOs, No, and COo) 
into complex biomolecules—proteins, nucleic acids, polysaccha- 
rides, and complex lipids. On the other hand, animals have an 
assortment of different cell types designed for specific physiolog- 
ical functions; these cells possess a correspondingly greater 
repertoire of complex biomolecules to accomplish their intri- 
cate physiology. 

2. Consult Figures 1.20 to 1.22 to confirm your answer. 

3. a. Laid end to end, 250 E. coli cells would span the head of a 

pin. 

b. The volume of an E. coli cell is about 107" L. 

c. The surface area of an E. coli cell is about 6.3 X 107? m°. Its 
surface-to-volume ratio is 6.3 X 10° m7!. 

d. 600,000 molecules. 

1.7 nM. 

f. Because we can calculate the volume of one ribosome to be 
4.2 X 107%% m? (or 4.2 X 107% L), 15,000 ribosomes would 
occupy 6.3 X 1071 L, or 6.3% of the total cell volume. 

g. Because the E. coli chromosome contains 4600 kilobase pairs 
(4.6 X 10° bp) of DNA, its total length would be 1.6 mm— 
approximately 800 times the length of an £. coli cell. This 
DNA would encode 4300 different proteins, each 360 amino 
acids long. 


P 


4. a. The volume of a single mitochondrion is about 4.2 X 107!° L 
(about 40% the volume calculated for an E. coli cell in 
problem 3). 

b. A mitochondrion would contain on average fewer than eight 
molecules of oxaloacetate. 


5. a. Laid end to end, 25 liver cells would span the head of a pin. 

b. The volume of a liver cell is about 8 X 107!? L (8000 times 
the volume of an £. coli cell). 

c. The surface area of a liver cell is 2.4 X 107° m?; its surface-to- 
volume ratio is 3 X 10° m™!, or about 0.05 (1/20) that of an 
E. coli cell. Cells with lower surface-to-volume ratios are 
limited in their exchange of materials with the environment. 

d. The number of base pairs in the DNA of a liver cell is 6 X 
10° bp, which would amount to a total DNA length of 2 m 
(or 6 feet of DNA!) contained within a cell that is only 


6. 


10. 


Ti; 


12; 


13. 


20 um on a side. Maximal information content of liver-cell 
DNA = 3 X 109 bp, which, expressed in proteins 400 amino 
acids in length, could encode 2.5 X 10° proteins. 


The amino acid side chains of proteins provide a range of 
shapes, polarity, and chemical features that allow a protein to be 
tailored to fit almost any possible molecular surface in a comple- 
mentary way. 


. Biopolymers may be informational molecules because they are 


constructed of different monomeric units (“letters”) joined 
head to tail in a particular order (“words, sentences”). Polysac- 
charides are often linear polymers composed of only one (or 
two repeating) monosaccharide unit(s) and thus display little 
information content. Polysaccharides with a variety of monosac- 
charide units may convey information through specific recogni- 
tion by other biomolecules. Also, most monosaccharide units 
are typically capable of forming branched polysaccharide struc- 
tures that are potentially very rich in information content (as in 
cell surface molecules that act as the unique labels displayed by 
different cell types in multicellular organisms). 


. Molecular recognition is based on structural complementarity. If 


complementary interactions involved covalent bonds (strong 
forces), stable structures would be formed that would be less re- 
sponsive to the continually changing dynamic interactions that 
characterize living processes. 


. Two carbon atoms interacting through van der Waals forces are 


0.34 nm apart; two carbon atoms joined in a covalent bond are 
0.154 nm apart. 


Slight changes in temperature, pH, ionic concentrations, and so 
forth may be sufficient to disrupt weak forces (H bonds, ionic 
bonds, van der Waals interactions, hydrophobic interactions). 


Living systems are maintained by a continuous flow of matter 
and energy through them. Despite the ongoing transformations 
of matter and energy by these highly organized, dynamic 
systems, no overt changes seem to occur in them: They are in a 
steady state. 


The fraction of the M. genitaliwm genes encoding proteins = 
0.925. Genes not encoding proteins encode RNA molecules. 
(0.925) (580,074 base pairs) = 536,820 base pairs devoted to 
protein-coding genes. Since 3 base pairs specify an amino 

acid in a protein, 369 amino acids are found in the average 

M. genitalium protein. If each amino acid contributes on average 
120 D to the mass of a protein, the mass of an average 

M. genitalium protein is 44,280 D. 


(0.925) (206) = 191 proteins. Assuming its genes are the same 
size as M. genitalium, the minimal genome would be 228,480 
base pairs. 
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14. Given 1109 nucleotides (or base pairs) per gene, the minimal 


15. 


16. 


AZ: 


virus, with a 3500-nucleotide genome, would have only 3 genes; 
the maximal virus, with a 280,000-bp genome, would have 252 
genes. 


Fate of proteins synthesized by the rough ER: 

a. Membrane proteins would enter the ER membrane, and, as 
part of the membrane, be passed to the Golgi, from which 
vesicles depart and fuse with the plasma membrane. 

b. A secreted protein would enter the ER lumen and be trans- 
ferred as a luminal protein to the Golgi, from which vesicles 
depart. When the vesicle fuses with the plasma membrane, 
the protein would be deposited outside the cell. 


Increasing kinetic energy increases the motions of molecules 
and raises their average energy, which means that the difference 
between the energy to disrupt a weak force between two mole- 
cules and the energy of the weak force is smaller. Thus, in- 
creases in kinetic energy may break the weak forces between 
molecules. 


Informational polymers must have “sense” or direction, and they 
must be composed of more than one kind of monomer unit. 


Chapter 2 


l. 
2; 


a. 3.3; b. 9.85; c. 5.7; d. 12.5; e. 4.4; f. 6.97. 


a. 1.26 mM; b. 0.25 mM: c. 4 X 10-2 M; d. 2 X 1074 M; e. 3.16 X 
10710 M; f. 1.26 X 1077 M (0.126 uM). 


. a. [H+] = 2.51 X 10-5 M; b. K, = 3.13 X 1078 M; pK, = 7.5. 


4. a. pH = 2.38; b. pH = 4.23. 


10. 


11. 


12; 


13. 


14. 


15; 


. Combine 187 mL of 0.1 M acetic acid with 813 mL of 0.1 M 


sodium acetate. 


. [HPO, *]/ [HPO] = 0.398. 
. Combine 555.7 mL of 0.1 M NasPO, with 444.3 mL of 0.1 M 


H3PQ,,. Final concentrations of ions will be [HPO; ] = 0.0333 
M; [HPO}?-] = 0.0667 M; [Nat] = 0.1667 M; [H*] = 3.16 x 
1078 M. 


. Add 432 mL of 0.1 N HCI to 1 L 0.05 MBICINE. [BICINE] joa = 


0.05 M/1.432 L = 0.0349 M [protonated form] = 0.0302 M. 


. a. Fraction of HsPO4: @pH 0 = 0.993; @pH 2 = 0.58; @pH 4 = 


0.01; negligible @pH 6. 
b. Fraction of HePO,~: @pH 0 = 0.007; @pH 2 = 0.41; @pH 
4 = 0.986; @pH 6 = 0.94; @pH 8 = 0.14; negligible @pH 10. 
c. Fraction of HPO, : negligible @pH 0, 2, and 4; @pH 6 = 
0.06; @pH 8 = 0.86; @pH 10 ~ 1.0; @pH 12 = 0.72. 
d. Fraction of PO,*-: negligible at any pH <10; @pH 12 = 0.28. 
At pH 5.2, [HsA] = 4.33 xX 107° M; [HA7] = 0.0051 M; 
[HA?~] = 0.014 M; [A*"] = 0.0009 M. 
a. pH = 7.02; [HPO] = 0.0200 M; [HPO] = 0.0133 M. 
b. pH = 7.38; [HsPO4 ] = 0.0133 M; [HPO] = 0.0200 M. 
[HeCOs3] = 2.2 uM; [COg@)] = 0.75 mM. When [HCO3 7] = 
15 mM and [CO2] = 3 mM, pH = 6.8. 
Titration of the fully protonated form of anserine will require 
the addition of three equivalents of OH~. The pK, values lie at 
2.64 (COOH); 7.04 (imidazole-N‘H); and 9.49 (NHs3"). Its iso- 
electric point lies midway between pK and pKs, so pH, = 8.265. 
To prepare 1 L of 0.04 M anserine buffer, add 164 mL of 0.1 M 
HCI to 400 mL of 0.1 M anserine at its isoelectric point, and 
make up to 1 L final volume. 
Add 410 mL of 0.1 M NaOH to 250 mL of 0.1 M HEPES in its 
fully protonated form and make up to 1 L final volume. 


166.7 g/mole. 


16. Add 193 mL of water and 307 mL of 0.1 M HCI to 500 mL of 
0.1 M triethanolamine. 

17. Combine 200 mL of 0.1 M Tris-H* with 732 mL water and 
68 mL of 0.1 M NaOH. 
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c. The relevant pK, for this calculation is 8.3. Combine 400 mL 
of 0.1 M bicine at its pH; (pH 5.3) with 155 mL of 0.1 M 
NaOH and 345 mL of water. 
d. The relevant pK, for this calculation is 2.3. The concentra- 
tion of fully protonated form of bicine at pH 7.5 is 2.18 X 
1077 M. 
19. 0.063 M 
20. 5.2 mM 


21. 




















9 0.2 0.4 0.6 0.8 1.0 
equivalents OHT 


22. 0.0153 nmol/mL - sec 
23. A drop in blood pH would occur. 
24. c. 


Chapter 3 

1. Keg = 613 M; AG® = —15.9 kJ/mol. 

2. AG? = 1.69 kJ/mol at 20°C; A G° = —5.80 kJ/mol at 30°C. 
AS = 0.75 kJ/mol - K. 

3. AG= —24.8 kJ/mol. 

4. State functions are quantities that depend on the state of the 
system and not on the path or process taken to reach that state. 
Volume, pressure, and temperature are state functions. Heat 
and all forms of work, such as mechanical work and electrical 
work, are not state functions. 

5. AG” = AG — 39.5 n (in kJ/mol), where nis the number of 
H* produced in any process. So AG? = AG” + 39.5 n= 
—30.5 kJ/mol + 39.5(1) kJ/mol. 

AGC = 9.0 kJ/mol at 1 M [B+]. 


10. 
T. 


12; 


13. 


14. 


15. 


16. 


ge 


a. Ke (AC) = (0.02 X 1000) = 20. 
b. AG?(AB) = 10.1 kJ/mol. 
AG (BC) = —17.8 kJ/mol. 
AG (AC) = —7.7 kJ/mol. 
Keq = 20. 


. See The Student Solutions Manual, Study Guide and Problems Book 


for resonance structures. 


< Keq = [Cr] [Pi]/[CrP] [H20]. 


Keq = 3.89 X 107. 


. CrP in the amount of 135.3 moles would be required per day to 


provide 5860 kJ energy. This corresponds to 17,730 g of CrP per 
day. With a body content of 20 g CrP, each molecule would 
recycle 886 times per day. 


Similarly, 637 moles of glycerol-3-P, or 108,300 g of glycerol-3-P, 
would be required. Each molecule would recycle 5410 times/day. 


AG= —46.1 kJ/mol. 


The hexokinase reaction is a sum of the reactions for hydrolysis 
of ATP and phosphorylation of glucose: 


ATP + HO == ADP + P; 
Glucose + P; == G-6-P + HO 
Glucose + ATP == G-6-P + ADP 
The free energy change for the hexokinase reaction can thus be 
obtained by summing the free energy changes for the first two 
reactions listed here. 


AG” for hexokinase = —30.5 kJ/mol + 13.9 kJ/mol = 
—16.6 kJ/mol 


Comparing the acetyl group of acetoacetyl-CoA and the methyl 
group of acetyl-CoA, it is reasonable to suggest that the acetyl 
group is more electron-withdrawing in nature. For this reason, it 
tends to destabilize the thiol ester of acetoacetyl-CoA, and the 
free energy of hydrolysis of acetoacetyl-CoA should be somewhat 
larger than that of acetyl-CoA. In fact, AG?” = —43.9 kJ/mol, 
compared with —31.5 kJ/mol for acetyl-CoA. 














Carbamoyl phosphate should have a somewhat larger free 
energy of hydrolysis than acetyl phosphate, at least in part 
because of greater opportunities for resonance stabilization in 
the products. In fact, the free energy of hydrolysis of carbamoyl 
phosphate is —51.5 kJ/mol, compared with —43.3 kJ/mol for 
acetyl phosphate. 

The denaturation of chymotrypsinogen is spontaneous at 

58°C, because the A G° at this temperature is negative (at ap- 
proximately —2.8 kJ/mol). The native and denatured forms are 
in equilibrium at approximately 56.6°C. 

The positive values for AC, for the protein denaturations de- 
scribed in Table 3.1 reflect an increase in motion of the peptide 
chain in the denatured state. This increased motion provides 
new ways to store heat energy. 

Keg = 2.04 X 10 at 298K; Keg = 9 at 320K. 

AH” = 219 kJ/mol 

AG” = 9.64 kJ/mol 

AS” = 702 J/mol: K 

The value of A G is determined at standard state, which in- 
cludes 1M [H*]. Using Equation 3.23, a value for A G° of 
—3.36 kJ/mol can be calculated. This value applies at pH 2, 7, 
and 12 because, of course, A G° is pH-independent. However, 
Equation 3.23 can also be used to determine that AG°’ is 


—14.77 kJ/mol at pH 2 and —71.82 kJ/mol at pH 12. Finally, 
AG” for enolase is not pH-dependent, because the enolase re- 


18. 


19. 
20. 
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action (see Figure 3.11) neither consumes nor produces 
protons. AG? = AG” = 1.8 kJ/mol. 


The magnitude of A G°’ for ATP hydrolysis is sufficiently great 
that it provides ample energy to drive the conversion of A to B, 
even though the reaction is unfavorable. The result is that the 
equilibrium ratio of B to A is more than 10° greater when the 
reaction is coupled to ATP hydrolysis. If A+B were coupled 
instead to 1,3-bisphosphoglycerate hydrolysis, the ratio of B 

to A would be even greater, since the A G” for hydrolysis of 
1,3-bisphosphoglycerate is substantially greater than that of 
ATP. Using the concentrations for 1,3-BPG and 3-PG in Table 
18.2, and repeating the calculations on page 67 yields a ratio 
of [Beq]/ [Acq] = 4.14 X 10°, an even greater ratio than that 
calculated on page 67. 


This exercise is left to the student. Use Figure 3.8 as a guide. 


Without pyrophosphate cleavage, the acyl-CoA synthetase reac- 
tion is only slightly favorable, with a A G” of 0.8 kJ/mol. With 
pyrophosphate cleavage included, the net AG” for the reac- 
tion is —33.6 kJ/mol, a far more favorable value. 


Chapter 4 


Li 


Structures for glycine, aspartate, leucine, isoleucine, methio- 
nine, and threonine are presented in Figure 4.3. 


. Asparagine = Asn = N. 


Arginine = Arg = R. 
Cysteine = Cys = C. 
Lysine = Lys = K. 
Proline = Pro = P. 
Tyrosine = Tyr = Y. 
Tryptophan = Trp = W. 

















. Alanine dissociation: 
COOH COO COO- 
H,Nt—C—H = H,N‘—C—H = H,N—C—H 
CH; CH; i 
Glutamate dissociation: 
COOH COO — COO- Coo 
H,N*—C—H = H;N*—C-H ==> A — ne ee 
CH CH, C li 
CH CHo ti a 
COOH COOH Doo loo 
Histidine dissociation: 
COOH Coo COO Coo 
H,N*—C—H => aeda = H.N é H > aN- 
CHo bie ai eg 
HESH ne=¢u ne=¢u ne=¢u 
H' Dee H* Se Noge NS er 
Lysine dissociation: 
COOH COO - COO — CoO 
H,N*—C—H = H,N*—C—H = H,N—-C—H ==> ee 
(CHo)4 (CHo) 4 (CHo) 4 (ct), 
NH,” NH,* NH,* JH 
Phenylalanine dissociation: 
COOH Coo - COO © 
H,Nt'—C—H = H,N*'—C—H = H,N—C—H 
CH CHo CH 
A a Za 
I ~~ A 
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4. The proximity of the a-carboxyl group lowers the pK, of the 
a-amino group. 
































6. Denoting the four histidine species as His?*, His*, His®, and 
His”, the concentrations are: 











5. i Equivalence point pH 2: [His** ] = 0.097 M, [His*] = 0.153 M, [His] = 
ai 1.53 X 10-5 M, [His~] = 9.6 X 10-3 M. 
T2: ‘ 
: pH 6.4: [His?*] = 1.78 X 1074 M, [His*] = 0.071 M, [His®] = 
E OTE CE 0.179 M, [His] = 2.8 x 10-4 M. 
bs Q =J. 
i : apaa pH 9.3: [His?*] = 1.75 x 10-!2 M, [Hist] — 5.5 x 10-5 M, 
9 [His] = 0.111 M, [His] = 0.139 M. 
s 7. pH = pK, + log (2/1) = 4.3 + 0.3 = 4.6. 
T 1 The y-carboxyl group of glutamic acid is 2/3 dissociated at pH 
6 = 4.6. 
E 8. pH = pK, + log (1/4) = 10.5 + (—0.6) = 9.9. 
i R group pK, = 3.9 9. a. The pH of a 0.3 M leucine hydrochloride solution is approxi- 
3 , a mately 1.46. 
2 COOH PK Tal b. The pH of a 0.3 M sodium leucinate solution is approxi- 
1 mately 11.5. 
or4 J c. The pH of a 0.3 M solution of isoelectric leucine is approxi- 
0 1 2 3 mately 6.05. 
Egaivalenis 10. The sequence of reactions shown would demonstrate that 
L(—)-serine is related stereochemically to L(—)-glyceraldehyde: 
COOH COOH COOH 
HN | H > HN y—ļ qy Br 
CHOH CH, CH, 


L-(—)-Serine 


L-(—)-Alanine 


2-Bromopropanoic acid 











CHO COOH COOH 
HO | H > HO | H > HO H 
CHOH CHOH CH, 


L-(—)-Glyceraldehyde L-Glyceric acid 2-Hydroxypropanoic acid 


Straight arrows indicate reactions that occur with retention of configuration, 
Looped arrows indicate inversion of configuration. (From Kopple, K. D., 1966. 
Peptides and Amino Acids. New York: Benjamin Co.) 


11. Cystine (disulfide-linked cysteine) has two chiral carbons, the 
two a-carbons of the cysteine moieties. Each chiral center can 
exist in two forms, so there are four stereoisomers of cystine. 


COO- 
| + | 
C—H H;N—C—H 


COO — COO - 


7 
HN 





However, it is impossible to distinguish the difference between 
L-cysteine/D-cysteine and D-cysteine/L-cysteine conjugates. So 
three distinct isomers are formed: 


COO” 
| o4 + | 
C—NH, H;N—C—H 


COO — COO — 





I 
I 

i | 4 
: ? | H ¢ NH; H 
I 
CH,—S—S—CH, | 
I 


Mirror plane 
L-Cystine 








12. NHo 
O 
I i C=O 
I—CH, —C—NH, 
CHo 
S $ 
| > 
CH, CH, 
+ | B + E 
AA -G00 HsN—C—COO 


H 


CH, —S—S—CHy 


p-Cystine 





| | ! 
CHo —S+S—CH> 
I 
Mirror plane 


meso-Cystine 


13. 
14. 


There are eight Tyr residues in the protein. 


A water is removed from each amino acid when it is incorpo- 
rated into a protein, so the “molecular weight” of a residue is 
lower by 18 units. Also, most proteins have relatively more small 
side chains (Gly, Ala) and fewer Trp side chains than a statistical 
average would predict. 


15. Aspartame is composed of aspartic acid and phenylalanine, with 


a carboxymethyl cap. These amino acids are linked by a peptide 


16. 


17. 


18. 


19. 


20. 


(amide) bond in aspartame. Heating can cleave amide linkages. 
For this reason, drinks such as coffee and hot chocolate must be 
consumed relatively quickly after preparation. Aspartame kept 
hot for several hours is quite bitter tasting (based on the experi- 
ence of one of the authors). 


Phenylketonuria is an autosomal recessive genetic disease 
caused by a deficiency or absence of the enzyme phenylalanine 
hydroxylase (PAH), an essential enzyme that converts phenylala- 
nine to tyrosine. Without sufficient PAH activity, phenylalanine 
accumulates and is converted to phenylpyruvate (which can be 
detected in the urine). Without treatment, phenylketonurics 
eventually experience progressive mental retardation and 
seizures. Phenylketonuria can be controlled by eliminating 
phenylalanine from the diet, and phenylketonurics should be 
advised not to use aspartame. 


The process for distinguishing R- and S-configurations of chiral 
molecules is described in the text on page 84. Enzymes discrimi- 
nate between isomers of chiral molecules thanks to the asym- 
metric arrangement of amino acid residues in the enzyme active 
site. 


Appropriate ranges for buffering: 


Alanine—1.4—3.4, 8.7-10.7 
Histidine—0.8-2.8, 5.0-7.0, 8.2-10.2 
Aspartic acid—1.1—4.9, 8.8-10.8 
Lysine—1.2-3.2, 8-11.5 


With pK, of 8.3, cysteine would make a useful buffer except that 
cystine, the disulfide of cysteine, can form readily in this pH 
range. For every cystine formed, two cysteine-SH groups are 
eliminated, making the buffering capacity of cysteine of limited 
usefulness. Also, the cysteine sulfhydryl group is the most potent 
nucleophile among the side chains of the 20 common amino 
acids. 


L-threonine is (2S, 3R)-threonine. 
b-threonine is (2R, 3S)-threonine. 
t-allothreonine is (2S, 3S)-threonine. 
p-allothreonine is (2R, 3R)-threonine. 


Chapter 5 


1. 
2. 
3: 


Nitrate reductase is a dimer (2 Mo/240,000 M,). 
Phe-Asp-Tyr-Met-Leu-Met-Lys. 


Tyr-Asn-Trp-Met-(Glu-Leu)-Lys. Parentheses indicate that the rel- 
ative positions of Glu and Leu cannot be assigned from the in- 
formation provided. 


4. Ser-Glu-Tyr-Arg-Lys-Lys-Phe-Met-Asn-Pro. 


. Ala-Arg-Met-Tyr-Asn-Ala-Val-Tyr or Asn-Ala-Val-Tyr-Ala-Arg-Met- 


Tyr sequences both fit the results. (That is, in one-letter code, 
either ARMYNAVY or NAVYARMY.) 


6. Gly-Arg-Lys-Trp-Met-Tyr-Arg-Phe. 


. Actually, there are four possible sequences: NIGIRVIA, 


GINIRVIA, VIRNIGIA, and of course, VIRGINIA. 


8. Gly-Trp-Arg-Met-Tyr-Lys-Gly-Pro. 


10. 


Leu-Met-Cys-Val-Tyr-Arg-Cys-Gly-Pro. 


S— S$ 
Alanine, attached to a solid-phase matrix via its a-carboxyl 
group, is reacted with diisopropylcarbodiimide-activated lysine. 
Both the a-amino and e-amino groups of the lysine must be 
blocked with 9-fluorenyl-methoxycarbonyl (Fmoc) groups. To 


11; 


12. 


13: 


14. 


15. 


16. 
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add leucine to Lys-Ala to form a linear tripeptide, precautions 
must be taken to prevent the incoming Leu a-carboxyl group 
from reacting inappropriately with the Lys e-amino group 
instead of the Lys a-amino group. 

The mass of the myoglobin chain is calculated to be 16,947 + 

1 daltons. 


Unlike any amino acid side chain, the phosphate group (or 
more appropriately, the phosphoryl group) bears two equiva- 
lents of negative charge at physiological pH. Furthermore, re- 
placing an H atom on an S, T, or Y side chain with a phosphoryl 
group introduces a very bulky substituent into the protein struc- 
ture where none existed before. 


A graph of v versus [L] reveals that at v = 0.5, [L] = Kp = 
2.4 mM. 


IRS-1 has 1242 amino acids. Its average molecular mass is 
131,590.97. The amino acid sequence of the tryptic peptide of 
IRS-1 of mass of 1741.9629 is LVSEAAAVVLOLMNIR. The se- 
quence of the tryptic fragment containing the SHPTP-2 site is 
LCGAAGGLENGLNYIDLDLVK. 


Nucleophilic attack by the hydroxyl O of the active-site serine on 
the carbonyl carbon of a peptide bond. 


a. Amino acid changes in mutant hemoglobins that appear on 
the surface of the folded globin chains may affect quaternary 
structure. 

b. Amino acid substitutions on the surface on the quaternary 
hemoglobin structure that create hydrophobic patches might 
lead to polymerization. Such amino acids would include all 
of the hydrophobic amino acids. 


Chapter 6 
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The central rod domain of keratin is composed of distorted 
a-helices, with 3.6 residues per turn, but a pitch of 0.51 nm, 
compared with 0.54 nm for a true a-helix. 


(0.51 nm/turn) (312 residues) / (3.6 residues/turn) = 

44.2 nm = 442 A. 
For an a-helix, the length would be: 
(0.54 nm/turn) (312 residues) / (3.6 residues/turn) = 

46.8 nm = 468 A. 
The distance between residues is 0.347 nm for antiparallel 
B-sheets and 0.325 nm for parallel B-sheets. So 312 residues of 


antiparallel B-sheet amount to 1083 A and 312 residues of paral- 
lel B-sheet amount to 1014 A. 


. The collagen helix has 3.3 residues per turn and 0.29 nm per 


residue, or 0.96 nm/turn. Then: 


(4 in/year) (2.54 cm/in) (107 nm/cm)/ (0.96 nm/turn) = 
1.06 X 10° turns/year. 


(1.06 X 108 turns/year) (1 year/365 days) (1 day/24 hours) 

(1 hour/60 minutes) = 201 turns/minute. 
Asp: The ionizable carboxyl can participate in ionic and hydro- 
gen bonds. Hydrophobic and van der Waals interactions are 
negligible. 
Leu: The leucine side chain does not participate in hydrogen 
bonds or ionic bonds, but it will participate in hydrophobic and 
van der Waals interactions. 
Tyr: The phenolic hydroxyl of tyrosine, with a relatively high 
pX,, will participate in ionic bonds only at high pH but can 
both donate and accept hydrogen bonds. Uncharged tyrosine is 
capable of hydrophobic interactions. The relatively large size of 
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the tyrosine side chain will permit substantial van der Waals 
interactions. 


His: The imidazole side chain of histidine can act as both an ac- 
ceptor and donor of hydrogen bonds and, when protonated, 
can participate in ionic bonds. Van der Waals interactions are 
expected, but hydrophobic interactions are less likely in most 
cases. 


. As an imino acid, proline has a secondary nitrogen with only 


one hydrogen. In a peptide bond, this nitrogen possesses no 
hydrogens and thus cannot function as a hydrogen-bond 
donor in a-helices. On the other hand, proline stabilizes the 
cis-configuration of a peptide bond and is thus well suited to 
B-turns, which require the cis-configuration. 


. For a right-handed crossover, moving in the N-terminal to 


C-terminal direction, the crossover moves in a clockwise direc- 
tion when viewed from the C-terminal side toward the 
N-terminal side. The reverse is true for a left-handed crossover; 
that is, movement from N-terminus to C-terminus is accompa- 
nied by counterclockwise rotation. 


. The Ramachandran plot reveals allowable values of @ and y for 


a-helix and -sheet formation. The plots consider steric hin- 
drance and will be somewhat specific for individual amino acids. 
For example, peptide bonds containing glycine can adopt a 
much wider range of ¢ and w angles than can peptide bonds 
containing tryptophan. 


. The protein appears to be a tetramer of four 60-kD subunits. 


Each of the 60-kD subunits in turn is a heterodimer of two pep- 
tides, one of 34 kD and one of 26 kD, joined by at least one 
disulfide bond. 


. Hydrophobic interactions frequently play a major role in 


subunit-subunit interactions. The surfaces that participate in 
subunit—subunit interactions in the B4 tetramer are likely to 
possess larger numbers of hydrophobic residues than the corre- 
sponding surfaces of protein A. 


. The length is given by (53 residues) X (0.15 nm run/residue) = 


7.95 nm. The number of turns in the helix is given by 
(53 residues) / (3.6 residues/turn) = 14.7 turns. There are 
49 hydrogen bonds in this helix. 


. Glycines are essential components of tight turns (6-turns) and 


thus are often essential for maintenance of protein structure. 


. Asp, Glu, Ser, Thr, His, and perhaps also Asn, Gln, Cys, Arg, Lys. 
. The ability of poly-Glu to form a-helices requires that the gluta- 


mate carboxyls be protonated. Deprotonation produces a 
polyanionic peptide that is not amenable to helix formation. 


A coiled-coil formed from a-helices with 3.5 residues per turn 
would form a symmetrical seven-residue-repeating structure that 
would place the first and fourth residues of the seven-residue 
repeat at the same positions about the helix axis in every seven- 
residue repeat. This would allow the two helices of a coiled-coil 
structure to lie side by side with no twist about the coiled-coil 
axis. Such a structure would probably not be as stable as the 
twisted structure of coiled coils formed from a-helices with 

3.6 residues per turn. 


a. The third sequence would place hydrophobic residues on 
both sides of a B-strand and could thus be found in a parallel 
B-sheet. 

b. The second sequence would place hydrophobic residues on 
just one side of a B-strand and could thus be found in an 
anti-parallel 6-sheet. 


15. 
16. 


c. The sixth sequence consists of GPX repeats (where X is any 
amino acid) and could thus be part of a tropocollagen 
molecule. 

d. The first sequence consists of seven-residue repeats, with first 
and fourth residues hydrophobic, and could thus be part of a 
coiled coil structure. 


The solution to this exercise is to be completed by the student. 


AG” = —34.23 kJ/mol, a number that corresponds to one to 
two H bonds. 
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See structures in The Student Solutions Manual, Study Guide and 
Problems Book. 


. See structures and titration curve in The Student Solutions 


Manual, Study Guide and Problems Book. 


. The systematic name for stachyose is 6-D-fructofuranosyl-O-a-D- 


galactopyranosyl-(1 — 6)-O-a-D-galactopyranosyl-(1 — 6)-a-D- 
glucopyranoside. 


. Glycated hemoglobin can be separated from ordinary hemoglo- 


bin on the basis of charge difference (by ion-exchange chro- 
matography, high-performance liquid chromatography [HPLC] 
electrophoresis, and isoelectric focusing) or on the basis of 
structural difference (by affinity chromatography). 


. The systematic name for trehalose is a-D-glucopyranosyl- 


(1 > 1)-a-p-glucopyranoside. 


Trehalose is not a reducing sugar. Both anomeric carbons are 
occupied in the disaccharide linkage. 


. See structures in The Student Solutions Manual, Study Guide and 


Problems Book. 


. A sample that is 0.69 g a-p-glucose/mL and 0.31 g 


B-b-glucose/mL will produce a specific rotation of 83°. 


. A 0.2 g sample of amylopectin corresponds to 0.2 g/162 g/mole 


or 1.23 X 1073 mole glucose residues; 50 umole is 0.04 of the 
total sample or 4% of the residues. Methylation of such a 
sample should yield 1,2,3,6-tetramethylglucose for the glucose 
residues on the reducing ends of the sample. The amylopectin 
sample contains 1.2 X 10!* reducing ends. 


There are several target sites for trypsin and chymotrypsin in the 
extracellular sequence of glycophorin, and it would be reason- 
able to expect that access to these sites by trypsin and chymo- 
trypsin would be restricted by the presence of oligosaccharides 
in the extracellular domain of glycophorin. 


Energy yield upon combustion (whether by metabolic pathways 
or other reactions) depends on the oxidation level. Carbohy- 
drate and protein are at approximately the same oxidation level, 
and both of these are significantly less than that of fat. 


This mechanism could involve either an Syl or Sy2 mechanism. 
In the former, protonation of the bridging oxygen would result 
in dissociation to produce a carbo-cation intermediate, which 
could be attacked by the phosphate nucleophile. The observa- 
tion of retention of configuration at the anomeric carbon favors 
this mechanism. An Sy2 mechanism would presumably involve 
water attack at the anomeric carbon, with dissociation of the 
oxygen of the carbohydrate chain. This would be followed by 
Sn2 attack by phosphate. Two Sy2 attacks would result in reten- 
tion of configuration at the anomeric carbon atom. 


Laetrile contains a cyanide group. Breakdown of laetrile and 
release of this cyanide function in the body would be highly 
toxic. 
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Chondroitin and glucosamine are amino sugar components of 
cartilage and connective tissue. Dietary supplement with these 
substances could help replenish the cartilage matrix proteo- 
glycan, relieving pain and restoring the proper function of con- 
nective tissue structures. 


Two of the sugar units in stachyose are glucose and fructose, and 
the bond joining them is easily cleaved by stomach enzymes. 
However, the other two sugar units in stachyose are galactoses 

in B(1— 6)-linkages, which are not broken down by human 
enzymes. The result is that stachyose loses a fructose in the 
stomach but the resulting trisaccharide passes into the intestines, 
where bacterial enzymes degrade it, producing intestinal gas in 
the process. Beano contains an enzyme that hydrolyzes 

B(1— 6)-galactose linkages. Taking several Beano tablets before 
a meal of beans or legumes facilitates complete breakdown of 
stachyose and other related oligosaccharides in the stomach, 
avoiding the production of intestinal gas. The active enzyme in 
Beano is referred to as a B(1— 6)-galactosidase. 
6-D-Fructofuranosyl-O-a-D-galactopyranosyl-(1— 6)-O-a-D- 
galactopyranosyl-(1— 6)-a-b-glucopyranoside 

Starch phosphorylase cleaves glucose units, one at a time, 

from starch chains until a (1— 6)-branch is encountered. Thus, 
limit dextrins are amylopectin fragments with a glucose in 

(1— 6)-linkage at each nonreducing end. The mechanism of 
the reaction that cleaves these glucose units is essentially the 
same as that for starch phosphorylase, except that it occurs at a 
glucose unit that is (1— 6)-linked. 


In the beer-making process, the mash fermented by yeast con- 
tains starch, which is partially broken down by the amylase from 
the malt to produce limit dextrins (see problem 16). These limit 
dextrins add significant caloric content to regular beers. Joseph 
Owades used the enzyme amyloglucosidase, which hydrolyzes 
both (1— 4) and (1— 6) linkages in starch, thus breaking 
down the limit dextrins to simple glucose, which is fermented 
normally. (This raises the alcohol content of the beer above 
what would normally be produced, and water is typically added 
to adjust the alcohol content.) 


As described in problem 14, Beano is a B(1— 6)-galactosidase. 
This enzyme can also hydrolyze (1— 6)-glucose linkages, 
although somewhat more slowly than for its natural substrate. 
Thus, Beano can slowly convert the limit dextrins produced in 
the beer-brewing process into simple glucose units, much like 
the enzyme used by Joseph Owades (see problem 17). Amateur 
brewers have learned to use Beano to make their own light beer, 
often referred to as Beano beer. 


Assuming each glucose unit contributes 0.55 nm to the growing 
cellulose polymer (and the plant length), a growth rate of 1 foot 
per day corresponds to a rate for cellulose synthase of 6,400 
glucose units per second. 


Basic amino acid side chains (Arg, His, Lys) in antithrombin III 
present positively charged side chains for ionic interactions with 
sulfate functions on heparin; H-bond donating amino acid side 
chains could form H bonds with O atoms in —OH groups on 
the heparin carbohydrate residues; H-bond accepting amino 
acid side chains could form H bonds with H atoms in —OH 
groups of heparin. 


Because these glycosaminoglycans are rich in hydroxyl groups, 
amine groups, and anionic functions (carboxylates, sulfates), 
they interact strongly with water. The heavily hydrated proteogly- 
cans formed from these glycosaminoglycans are reversibly dehy- 
drated in response to the pressure imposed on a joint during 
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normal body movements. This dehydration has a cushioning 
effect on the joint; when the pressure is relieved, the glycos- 
aminoglycans are spontaneously rehydrated due to their affinity 
for water. 
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Because the question specifically asks for triacylglycerols that 
contain stearic acid and arachidonic acid, we can discount the 
triacylglycerols that contain only stearic or only arachidonic 
acid. In this case, there are six possibilities: 
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. See The Student Solutions Manual, Study Guide and Problems Book 


for a discussion. 


. a. Phosphatidylethanolamine and phosphatidylserine have a 


net positive charge at low pH. 

b. Phosphatidic acid, phosphatidylglycerol, phosphatidylinosi- 
tol, phosphatidylserine, and diphosphatidylglycerol normally 
carry a net negative charge. 

c. Phosphatidylethanolamine and phosphatidylcholine carry a 
net zero charge at neutral pH. 


. Diets high in cholesterol contribute to heart disease and stroke. 


On the other hand, plant sterols bind to cholesterol receptors 
in the intestines but are not taken up by the cells containing 
these receptors. There is substantial evidence that a diet that 
includes plant sterols can reduce serum cholesterol levels 
significantly. 


. Former Interior Secretary James Watt was well known during the 


Reagan administration for his comments on several occasions 
that trees caused and produced air pollution. As noted in this 
chapter, it is of course true that trees emit isoprenes that are the 
cause of the blue-gray haze that is common in still air in mid- to 
late-summer in the eastern United States. However, these iso- 
prene compounds are not significantly toxic to living things, 
and it would be misleading to call them pollutants. 


. Louis L'Amour clearly knew his biochemistry. His protagonist 


knew that fat carries a higher energy content than protein or 
carbohydrate. If meals are going to be scarce (as for a person 
living in the wild and on the run), it is wise to consume fat 
rather than protein or carbohydrate. The same reasoning 
applies for migratory birds in the weeks preceding their long 
flights. 


. Phospholipase Ay from snake bites operates without regulation 


or control, progressively breaking down cell membranes. Phos- 
pholipase A», action to produce cell signals is under precise 
control and is carefully regulated to produce just the right 
amounts of required cell signals. 


. The lethal dose for 50% of animals is referred to as the LD;o. 


The LDso for dogs is approximately 3 mg/kg of body weight. 
Thus, for a 40-lb dog (18.2 kg), consumption of approximately 
55 mg of warfarin would be lethal for 50% of animals. 


. See the Student Solution Guide and www.cengage.com/login for the 


solutions to this problem. 


Humans require approximately 2000 kcal per day. Seal blubber 
is predominantly triglycerides, which yield approximately 9 kcal 
per gram. This means that a typical human would need to 
consume 222 grams of seal blubber per day (about half a 
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pound) in order to obtain all of his or her calories from this 
energy source. 


Results for this problem will depend on the particular cookies 
chosen by the student. 


The only structural differences between cholesterol and stigmas- 
terol are the double bond of stigmasterol at Co.—C»s, and the 
ethyl group at Co4. 

Androgens mediate the development of sexual characteristics 
and sexual function in animals. Glucocorticoids participate in 
the control of carbohydrate, protein, and lipid metabolism. 
Mineralocorticoids regulate salt (Nat, K*, and C17) balances in 
tissues. 


The answers to this question will depend on the household 
products chosen and the isoprene substances identified. 


Hydroxide ions (in lye) catalyze the breakdown of triglycerides 
to produce fatty acid salts such as sodium stearate and sodium 
palmitate and leaving glycerol as a byproduct. Micelle formation 
by these “soaps” leads to emulsification of fats and other non- 
polar substances. 


Amphipathic phosphatidylcholine from egg yolk exerts a deter- 
gentlike action on mixtures of vegetable oil and water. Micelles 
made primarily of egg phosphatidylcholine emulsify the veg- 
etable oil, forming a stable suspension that persists indefinitely. 
(Thus, mayonnaise does not need to be mixed or shaken before 
use, whereas oil and vinegar mixtures do.) The micellar parti- 
cles of mayonnaise are large enough to scatter visible light, so 
mayonnaise appears milky white, whereas the separated layers of 
oil and water in containers of oil and vinegar appear clear. 


Stanol esters function in cholesterol reduction by binding to 
cholesterol receptors in the intestines. However, each serving of 
stanol esters consumed only blocks a fraction of all intestinal re- 
ceptors. Regular consumption of stanol esters eventually blocks 
all or most available receptors. Binding of stanol esters is tight 
and long lasting, but not indefinite, so stanol ester consumption 
must be continued to maintain the beneficial results. The graph 
on page 236 shows that reductions of serum cholesterol levels 
can approach 15%; given that the typical diet accounts for about 
15% of total serum cholesterol, it may be concluded that stanol 
esters are highly effective in preventing uptake of dietary 
cholesterol. 


Serum cholesterol is partly derived from diet and partly from 
synthesis in the liver. Stanol esters prevent uptake of dietary cho- 
lesterol but do not affect synthesis in the liver. Cholesterol- 
lowering drugs, on the other hand, block cholesterol synthesis 
but have no effect on uptake from diet. Thus, the effects of 
these two classes of cholesterol-lowering agents are additive. 


Research has shown that consumption of substantial quantities 
of plant fats can lead to significant reduction of serum choles- 
terol. The content of so-called phytosterols varies depending on 
the source. However, a sterol- or stanol-fortified spread like 
Benecol probably provides the highest concentration of dietary 
agents for cholesterol lowering. 


Tetrahydrogestrinone (THG) is synthesized by the catalytic hy- 
drogenation of gestrinone (which has an acetylene group in 
place of the ethyl group on the D-ring of tetrahydrogestrinone). 
Patrick Arnold, known as the “father of prohormones,” is cred- 
ited with the synthesis of THG from gestrinone and the promo- 
tion of THG as an anabolic steroid in preparations such as The 
Clear and The Cream. These substances were marketed aggres- 
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sively to top athletes in several sports by Arnold and Victor 
Conte of the Bay Area Laboratory Co-Operative. THG (at that 
time) was undetectable by existing laboratory analyses, and its 
use was not specifically prohibited in athletic competitions by 
the World Anti-Doping Code. Use of THG has since been pro- 
hibited by nearly all sports regulatory agencies, and effective 
methods for its detection have been developed. Numerous ath- 
letes have by now admitted to use of THG or have been con- 
victed for perjury in denial of its use, and many have been 
stripped of medals and awards for athletic accomplishments 
aided by THG. 


Most obviously, a diet of triglycerides (from the blubber of seals, 
the polar bears’ favorite food) provides high energy and mate- 
rial that can be reprocessed into other triglycerides and used as 
insulation under the skin. Less obvious is the need of the polar 
bear to stay warm and conserve water. (Polar bears cannot 
afford to eat snow or ice [they are too cold], and they cannot 
drink seawater [it is too salty].) The polar bear is adapted to 
conserve body water and stay warm, and thus it does not urinate 
for months at a time. (Urination would give up both water and 
heat.) To achieve this, it must consume little or no nitrogen, 
because a diet rich in nitrogen would require urination and 
defecation. Triglycerides contain no nitrogen and are thus ideal 
food for the adult polar bear. Juvenile polar bears, which have 
not yet reached their full adult body size, must consume protein 
in order to make their own proteins. Once the bear reaches its 
full size, it changes its diet and no longer consumes protein! 


Snake venom phospholipase As cleaves fatty acids from phos- 
pholipids at the C-2 position. The fatty acids behave as deter- 
gents and form micelles (see Chapter 9) that can remove lipids 
and proteins from the membrane and disrupt membrane struc- 
ture, causing pores and eventually rupturing the cell itself. 


Chapter 9 
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Glycerophospholipids with an unsaturated chain at the C-1 posi- 
tion and a saturated chain at the C-2 position are rare to nonex- 
istent. Glycerophospholipids with two unsaturated chains, or 
with a saturated chain at C-] and an unsaturated chain at C-2, 
are commonly found in biomembranes. 


. The phospholipid/ protein molar ratio in purple patches of 


H. halobium is 10.8. 


. See The Student Solutions Manual, Study Guide and Problems Book 


for plots of sucrose solution density versus percent by weight 
and by volume. The plot in terms of percent by weight exhibits 
a greater curve, because less water is required to form a solution 
that is, for example, 10% by weight (10 g sucrose/100 g total) 
than to form a solution that is 10% by volume (10 g sucrose/ 
100 mL solution). 


. r= (4D) "?. 


According to this equation, a phospholipid with D = 1 X 
1078 cm? will move approximately 200 nm in 10 milliseconds. 


. Fibronectin: For t = 10 msec, r= 1.67 X 1077 cm = 1.67 nm. 


Rhodopsin: r= 110 nm. 


All else being equal, the value of D is roughly proportional to 
(M) 12. Molecular weights of rhodopsin and fibronectin are 
40,000 and 460,000, respectively. The ratio of diffusion coeff- 
cients is thus expected to be (40,000) ~!/3/ (460,000) ~1/3 = 2.3. 


On the other hand, the values given for rhodopsin and fibro- 
nectin give an actual ratio of 4286.The explanation is that fibro- 
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nectin is anchored in the membrane via interactions with cyto- 
skeletal proteins, and its diffusion is severely restricted com- 
pared with that of rhodopsin. 


a. Divalent cations increase Tm- 

b. Cholesterol broadens the phase transition without signifi- 
cantly changing Tn. 

c. Distearoylphosphatidylserine should increase Tm, due to in- 
creased chain length and also to the favorable interactions 
between the more negative PS head group and the more pos- 
itive PC head groups. 

d. Dioleoylphosphatidylcholine, with unsaturated fatty acid 
chains, will decrease Tn. 

e. Integral proteins will broaden the transition and could raise 
or lower Ta, depending on the nature of the protein. 


. AG= RT In([Cy]/[C)]). 


AG= +4.0 kJ/mol. 


. AG= RT In([ Gud /[Cin]) + ZFAY. 


AG= +4.08 kJ/mol —2.89 kJ/mol. 
AG= +1.19 kJ/mol. 


The unfavorable concentration gradient thus overcomes the fa- 
vorable electrical potential and the outward movement of Na* is 
not thermodynamically favored. 





. One could solve this problem by going to the trouble of plotting 


the data in vvs. [S], 1/v vs. 1/[S], or [S]/v vs. [S] plots, but it is 
simpler to examine the value of [S]/v at each value of [S]. The 
Hanes—Woolf plot makes clear that [S]/v should be constant for 
all [S] for the case of passive diffusion. In the present case, 
[S]/vis a constant value of 0.0588 (L/min) ~!. It is thus easy to 
recognize that this problem describes a system that permits 
passive diffusion of histidine. 


This is a two-part problem. First calculate the energy available 
from ATP hydrolysis under the stated conditions; then use the 
answer to calculate the maximal internal fructose concentration 
against which fructose can be transported by coupling to ATP 
hydrolysis. Using a value of —30.5 kJ/mol for the AG®°’ of ATP 
and the indicated concentrations of ATP, ADP, and P;, one finds 
that the AG for ATP hydrolysis under these conditions (and at 
298 K) is —52.0 kJ/mol. Putting the value of +52.0 kJ/mol into 
Equation 9.1 and solving for Cg yields a value for the maximum 
possible internal fructose concentration of 1300 M! Thus, ATP 
hydrolysis could (theoretically) drive fructose transport against 
internal fructose concentrations up to this value. (In fact, this 
value is vastly in excess of the limit of fructose solubility.) 


Each of the transport systems described can be inhibited (with 
varying degrees of specificity). Inhibition of the rhamnose trans- 
port system by one or more of these agents would be consistent 
with involvement of one of these transport systems with rham- 
nose transport. Thus, nonhydrolyzable ATP analogs should 
inhibit ATP-dependent transport systems, ouabain should specif- 
ically block Na* (and K*) transport, uncouplers should inhibit 
proton gradient—dependent systems, and fluoride should inhibit 
the PTS system (via inhibition of enolase). 

N-myristoyl lipid anchors are found linked only to N-terminal 
Gly residues. Only the peptide in (e) of this problem contains 
an N-terminal Gly residue. 

Only the peptide in (a) possesses a CAAX sequence (where C = 
Cys, A = aliphatic (Ala, Val, Leu, Ile), and X = any amino acid). 
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The hydropathy plot of a soluble protein should show no sub- 
stantial stretches of hydrophobic residues, except for the signal 
sequence. 


Prolines are destabilizing to a-helices, and a short helix with a 
proline would not be likely to be stable. Where a proline occurs 
in an a-helix, the helix is bent or kinked. Helices with a proline 
kink tend to be longer than average, presumably because longer 
helices are more stable and more able to tolerate the loss of 

H bonds in the kink region. 


The structural consequences of a proline-induced kink in an 
a-helix are described in detail in the references provided in this 
problem. 


Porin proteins typically consist of 18-stranded f-barrels. With 

9 to 11 residues per strand, about 180-200 residues would be 
required to form the barrel, with a roughly equivalent number 
required to form the loops between strands. (The maltoporin 
chains from FE. coli consists of about 420 residues.) The trans- 
membrane domain of Wza consists of eight a-helical segments 
with about 25 residues per helix, about 200 residues in all. Thus, 
the number of residues needed to create the transmembrane 
pore in these two proteins is about the same, even though the 
number of residues per membrane spanning segment is less for 
a B-strand (9-11) than for an a-helix (21-25). 

From Figure 9.29, the area within a typical “fenced” area is 
about 0.3 um X 0.3 um, or 9 X 10° A®. Dividing by 60 AY, it 
appears that there are about 150,000 phospholipids in a 
monolayer-fenced area in the membrane (assuming a mem- 
brane of pure phospholipid, with no cholesterol, etc.). 


The lysine side chain N is about 5.5 A from the a-carbon (about 
1.1 A per bond). If a Lys side chain was reoriented toward the 
membrane center, the maximum change in position relative to 
the membrane center would be 10-11 A (assuming the position 
of the a-carbon did not change). Figure 9.15 indicates that the 
energy of a Lys side chain would change by 4kT over the 15-A 
distance from the membrane surface to the membrane center. 
A movement of 10 A would thus correspond to 2/3 of 4kT, or 
8kT/3—almost twice the average translation kinetic energy of a 
molecule in the gas phase (3kT/2). 


In a hydrophobic environment, the aspartate carboxyl group 
will be more stable in the protonated (uncharged) form than in 
the deprotonated (charged) form. 


In a hydrophobic environment, the side chains of lysine and 
arginine would be more stable in their deprotonated (un- 
charged) forms than in the protonated (charged) forms. As a 
result, the pK, values of these residues would be lowered signifi- 
cantly in a hydrophobic environment. 


Based on the discussion in problems 20 and 21 in this chapter, it 
would be reasonable to imagine that light-induced conformation 
changes could alter the pK, values of the proton-transferring 
moieties in the protein. For example, a conformation change 
that made the environment around an Asp carboxyl more polar 
would reduce the pK, of that group, promoting dissociation (i.e., 
proton release). An appropriate sequence of such changes could 
accomplish transmembrane proton transport. 


Point (b)—that proteins can be anchored to the membrane via 
covalent links to lipid molecules—was not part of the Singer- 
Nicolson fluid mosaic model. 


A-10 Abbreviated Answers to Problems 


Chapter 10 access, for example, by binding in the major groove, or the 
1. H o restriction endonuclease must be able to “read” the base se- 
| quence by indirect means, for example, if the base sequence 
+N UH : ; g ee : A 
o j O imparts some local variation in the cylindrical surface of the 
Il 4 | ak DNA that the enzyme might recognize. 
OCH, N NH c. The restriction endonuclease must be able to cleave both 
Si DNA strands, often in a staggered fashion. 
70 H H d. An obvious solution to the requirements listed in (a) and (c) 
H H would be a homodimeric subunit organization for restriction 
OH OH endonucleases. 


HO—P— 


2. See Figure 10.15. 13. a. The ribose group of nucleosides greatly increases the water 


= = = lubility of the base. 
3. fa = 0.29; fo = 0.22; fo = 0.25; fr = 0.28. 9 
à : PE b. Ribose has fewer hydroxyl groups and thus less likelihood to 
. The number of A and T residues = 3.14 X 10° each; the 


; T A undergo unwanted side reactions. 
nuniber OE ADE Ore 2.08 2 10" Sach, c. The absence of the 2-OH in 2-deoxyribose leads to a poly- 


aN 


5. 5'-TAGTGACAGTTGCGAT-3’. nucleotide sugar—phosphate backbone that is more stable 
6. 5'-ATCGCAACTGTCACTA-3’. because it is not susceptible to alkaline hydrolysis. 

7. 5!-TACGGTCTAAGCTGA-3’. 14. a. Phosphate groups bear a negative electrical charge at 

8. There are two possibilities, a and b. (E = EcoRI site; B = BamHI neutral pH. 


b. Cleavage of phosphoric anhydride bonds is strongly exer- 


site.) Note that b is the reverse of a. : x ET ee 
) gonic and can provide the thermodynamic driving force for 








a diverse metabolic reactions. 
B EB c. Cleavage of phosphoric anhydride bonds is strongly exer- 
m j gonic and can provide the thermodynamic driving force for 
1 3 0.5 5.5 phosphodiester bond formation in polynucleotide synthesis. 
5 15. Once in every 4.4 X 10"? nucleotides. 
l BE B 16. A DNA sequence 16 nucleotides long. 
a l 17. The strategy for protein sequencing by Edman degradation, as 
i a i i described on pages 80 and 102, could be adapted, replacing 
5.5 0.5 3 l Edman’s reagent with snake venom phosphodiesterase acting on 
9. a. GGATCCCGGGTCGACTGCAG; an immobilized nucleotide sequence. 
b. GITCGACCCGGGATCCTGCAG. 18. 3.59 X 10" D. 
Smal products: a. GGATCCC and GGGTCGACTGCAG. 19. The use of bases as “information symbols” in metabolism allows 
b. GICGACCC and GGGATCCTGCAG. the cell to allocate portions of its phosphorylation potential (as 


total NTP) to dedicated tasks, as in GTP for protein synthesis, 
CTP for phospholipid synthesis, UTP for carbohydrate synthesis, 
with ATP serving the central role. Enzymes in these pathways se- 
lectively bind the proper nucleotide through recognition of the 


10. Synthesis of a polynucleotide 100 residues long requires forma- 
tion of 99 phosphodiester bonds. A G°’ for phosphodiester syn- 
thesis (assuming it is the same magnitude but opposite sign as 
that for phosphoric anhydride cleavage of an NTP to give ; 
NMP + PP;) is +32.3 kJ/mol. AG" wera = (99) (+32.3) = particular base. 

3198 kJ/mol. 20. The most prominent structural feature of the DNA double helix 
in terms of structural complementarity is found in the canonical 
base-pairing of A with T and G with C. Base pairing is essential 
to DNA replication (preservation of the genetic information) 
and transcription (expression of genetic information). 


11. a. Hydrogen bonding and van der Waals interactions between 
amino acid side chains and DNA, and ionic interactions of 
amino acid side chains with the nucleic acid backbone phos- 
phate groups. Double-helical DNA does not present hydro- 
phobic regions for interaction with proteins because of 
base-pair stacking. 

b. Proteins can recognize specific base sequences if they can fit 1. 
within the major or minor groove of DNA and “read” the 
H-bonding pattern presented by the edges of the bases in the 
groove. The dimensions of an a-helix are such that it fits 
snugly within the major groove of B-DNA; then, depending 
on the amino acid sequence of the protein, the side chains 
displayed on the circumference of the a-helix have the po- 
tential to form H bonds with H-bonding functions provided 
by the bases. 


Chapter 11 


Qar Aa 


12. a. The restriction endonuclease must be able to recognize a 
specific nucleotide sequence in the DNA that has twofold 
rotational symmetry. 

b. In order to read a base sequence within DNA, either the re- 
striction endonuclease must interact with the bases by direct 


LHETTETTTiTeit 0 
rPOaHaAPA 








2. Original nucleotide: 5'-GATAGCGCAAAGATCAACCTT. 


. a. 10.5 base pairs per turn; b. Ad = 34.3°; c (true repeat) = 


6.72 nm. 


4. 27.3 nm; 122 base pairs. 
5. 4.35 um. 


LMA 


0 


10. 


Ti, 


12. 


13. 


14. 


15. 


16. 


17. 
18. 


19. 








12,L=T+W 
12/160 = —0.075. 


160 + (—12) = 148. 





ao = AL/ Lo = 


. For 1 turn of B-DNA (10 base pairs): Lg = 1.0 + Wp. 


For Z-DNA, 10 base pairs can only form 10/12 turn (0.833 turn), 
and Lz = 0.833 + Wz. 


anticodon 


Abbreviated Answers to Problems A-11 
For the transition B-DNA to Z-DNA, strands are not broken, so 
Ly = Lz; that is, 1.0 + Wy = 0.833 = W;, or W, — Wy = +0.167. 
(In going from B-DNA to Z-DNA, the change in W, the number 
of supercoils, is positive. This result means that, if B-DNA con- 
tains negative supercoils, their number will be reduced in 
Z-DNA. Thus, all else being equal, negative supercoils favor the 
B— Z transition.) 


. 6 X 10° bp/200 bp = 3 X 107 nucleosomes. 


The length of B-DNA 6 X 10° bp long = (0.34 nm) (6 X 10°) = 
2.04 X 10° nm (more than 2 meters!). The height of 3 X 107 
nucleosomes = (6 nm) (3 X 107) = 18 X 107 nm (0.18 meter). 


. From Figure 11.33: Similarly shaded regions indicate comple- 


mentary sequences joined via intrastrand hydrogen bonds: 


CCA end 


4 























B77 ee 


3 


nucleotide no. 


Increasing order of Ta: yeast < human < salmon < wheat < 
E. coli. 


In 0.2 M Na+, Ta (°C) = 69.3 + 0.41 (%G + C): 
Rats (%G + C) = 40%, Ta = 69.3 + 0.41 (40) = 85.7°C 
Mice (%G + C) = 44%, Tm = 69.3 + 0.41 (44) = 87.3°C 


Because mouse DNA differs in GC content from rat DNA, they 
could be separated by isopycnic centrifugation in a CsCl 
gradient. 

GC content = 0.714 (from Table 10.1 and equations used in 
problem 3, Chapter 10). p = 1.660 + 0.098(GC) = 1.730 g/mL. 


See Figure 11.35 and compare your structure with the base pair 
formed between G18 and 55 in yeast phenylalanine tRNA, 
which has a single H bond between the 2-amino group of 

G (2-NH» --- O=) and the O atom at position 4 in w. 


Assuming the plasmid is in the B-DNA conformation, where 
each pair contributes 0.34 nm to the length of the molecule, the 
circumference of a perfect circle formed from pBR322 would be 
(0.34 nm/bp) (4363 bp) = 1483 nm = 1.48 um. 


The E. coli K12 chromosome laid out as a perfect circle would 
have a circumference of (0.34 nm/bp) (4,639,000 bp) = 
1,577,260 nm = 1.58 mm. 


The diameter of a B-DNA molecule is about 2.4 nm (see 

Table 11.1). For pBR322, the length/diameter ratio would be 
1480/2.4 = 616.7. For the E. coli K12 chromosome, the 
length/diameter ratio would be 1,577,260/2.4 = 657,192. 

(a) Z-DNA; (b) cruciform; (c) triplex DNA; (d) tRNA; 

(e) type-II restriction endonuclease site. 

Rely on the features in Figure 11.33 to draw a tRNA cloverleaf 
with the anticodon positioned at nucleotides 34-36: CUG. 


Erythromycin is within the palm of the mitten. 

(a) B-globin; (b) hexosaminidase A (a-polypeptide); (c) insulin 
receptor; (d) neutral amino acid transporter. 

The DNA in such a thermophilic organism would be subjected 
to very high temperatures that might denature the DNA. 
Because G:C base pairs are more heat stable than A:T base 
pairs, one might expect DNA from a thermophilic organism to 
have a high G + C content. 


4 
O Ry 7 A 


20. 











60 70 


DNA is the material of heredity; that is, genetic information. 
This information is encoded in the sequence of bases in DNA, 
so this is its most important structural feature. The double- 
stranded nature of DNA and the complementary base sequence 
of the two strands are also crucial structural features in the 
transmission of genetic information through DNA replication. 
Beyond these points, one might cite the structural features of 
DNA that impart stability to the double helix and structural 
aspects that render DNA less susceptible to degradation. 


Chapter 12 


l. 


Linear and circular DNA molecules consisting of one or more 
copies of just the genomic DNA fragment; linear and circular 
DNA molecules consisting of one or more copies of just the 
vector DNA; linear and circular DNA molecules containing one 
or more copies of both the genomic DNA fragment and plasmid 
DNA. 


2, 
-GAATTCCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGC- 
GAATTC GGATCC GTCGAC GCATGC 
EcoRI BamHI Sall SphI 

CCCGGG TCTAGA CTGCAG 
Smal Xbal PstI 
3, a. AAGCTTGAGCTCGAGATCTAGATCGAT 
Hindi Xhol Xbal 
Sacl Bell Clal 
b. 

Vector: HindIII: 5'-A........ -gap-. -ooo CGAT-3': Clal 

3'-TTCGA. .. gap- o. oeenn TA-5' 


Fragment: HindIII: 5'-AGCTT(NNNN-etc-NNNN)AT-3' 


IO ow A 


3'-A(NNNN-etc-NNNN) TAGC-5' 


. N = 3480. 

. N = 10.4 million. 

. 5'-ATGCCGTAGTCGATCAT and 5'-ATGCTATCTGTCCTATG. 

. -Thr-Met-Ile-Thr-Asn-Ser-Pro-Asp-Pro-Phe-Ile-His-Arg-Arg-Ala-Gly-le- 


Pro-Lys-Arg-Arg-Pro. .. 
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10. 


ll. 


12. 


13. 


14. 


The junction between -galactosidase and the insert amino acid 
sequence is between Pro and Asp, so the first amino acid 
encoded by the insert is Asp. (The polylinker itself codes for Asp 
just at the BamHI site, but in constructing the fusion, this Asp 
and all of this downstream section of polylinker DNA is dis- 
placed to a position after the end of the insert.) 


. 5'(G)AATTCNGGNATGCAYCCNGGNAAR¢! Ty*GCNAGY TGG- 


TTYGTNGGGAATTCN- 


(Note: The underlined triplet AGY represents the middle Ser 
residue. Ser codons are either AGY or TCN [where Y = pyrimi- 
dine and N = any base]; AGY was selected here so that the 
mutagenesis of this codon to a Cys codon [TGY] would involve 
only an A —> T change in the nucleotide sequence.) 


Because the middle Ser residue lies nearer to the 3'-end of this 
EcoRI fragment, the mutant primer for PCR amplification 
should encompass this end. That is, it should be the primer for 
the 3— 5’ strand of the EcoRI fragment: 


5'-NNNGAATTCCCN,ACR,AACCAR,.CAN,.GC-3' 


where the mutated Ser — Cys triplet is underlined, NNN = 
several extra bases at the 5'-end of the primer to place the EcoRI 
site internal, N,= the nucleotide complementary to the nucleo- 
tide at this position in the 5’—3’ strand, and R,= the pyrimi- 
dine complementary to the purine at this position in the 5’3’ 
strand. 


. The number of sequence possibilities for a polymer is given by 


x, where x is the number of different monomer types and y is 
the number of monomers in the oligomers. Thus, for RNA 
oligomers 15 nucleotides long: x) = 4!° = 1,073,741,824. For 
pentapeptides, x? = 20° = 3,200,000. 


See Figure 12.17. You would need a GA/4-deficient yeast strain 
expressing a fusion protein composed of the GAL4 DB domain 
fused with a cytoskeletal protein (such as actin or tubulin) to 
serve as the bait. These GAL4~ cells would be transformed with 
a cDNA library of human epithelial proteins constructed so that 
the human proteins were expressed as GAL4 TA fusion proteins; 
these fused proteins are the target. Interaction of a target 
protein with the cytoskeletal protein fused to the GAL4 DB 
domain will lead to GAL4-driven expression of the lacZ gene, 
whose presence can be revealed by testing for B-galactosidase 
activity. 

See Figure 12.9 for isolation of mRNA and Figure 12.10 for 
preparation of a cDNA library from mRNA. The mRNA would 
be isolated, and the cDNA libraries would be prepared from two 
batches of yeast cells, one grown aerobically and one grown 
anaerobically. 


See Figure 12.11. Differently labeled single-stranded cDNA from 
separate libraries prepared from aerobically versus anaerobically 
grown yeast cells would be hybridized with a DNA microarray 
(gene chip) of yeast genes. 


An antibody against hexokinase A could be used to screen the 
yeast cDNA library to identify a yeast colony expressing this 
protein (see discussion on page 369). Once a sample of the pu- 
tative yeast hexokinase A protein was isolated, its identity could 
be confirmed by peptide mass fingerprinting using mass spec- 
trometry (see page 108). 


The experiment in problem 12 identified the cDNA clone for 
the protein, but this clone will not have the regulatory elements 
(promoter) necessary for the experiment at hand. Using the 
cDNA clone as a probe, the genomic clone for this nox gene 


could be identified in a yeast genomic library. Cloning and se- 
quencing a genomic clone would identify putative promoter 
regions that could be fused to the coding region of green fluo- 
rescent protein (GFP) to create a reporter gene construct (see 
pages 371-372) that would “light up” more and more as oxygen 
levels declined. 


15. Go to the NCBI site map at hitp://www.nchi.nlm.nih.gou/Sitemap/ 


index.html. In the alphabetical index, click on Genomes and 
Maps to go to the Entrez Genome site at hitp://www.ncbi.nlm.nih 
.gou/Sitemap/index. himl#Genomes. 


Under Organism Collections, Entrez Genomes section, click on 
the links for bacteria, archea, and eukaryotes to see: 


Number of bacterial genomes completed at 
hitp://www.ncbi.nlm.nih.gov/PMGifs/Genomes/eub_g.html 


Number of archeal genomes completed at 
hitp://www.ncbi.nlm.nih.gov/PMGifs/Genomes/a_g.himl 


Completed eukaryotic genomes accessible at 
hitp://www.ncbi.nlm.nih. gov/PMGifs/Genomes/euk_g.html 


16. Insertion of DNA at any of the following restriction sites would 


render cells harboring the recombinant plasmid ampicillin- 
sensitive: Sspl, Scal, Puul, PstI, and Ppal. 


17. The respective codons are Trp = UGG; Cys = UGC and UGU. 


Thus, changing only the third base G in the Trp codon to a 
pyrimidine (either C or U) would yield a Cys codon. In this 
case, there are now 6 differences in codon possibilities, so 
2° = 64 different oligonucleotides must be synthesized. 


Chapter 13 


l. v/Vinax = 0.8. 
2. v= 91 wmol/mL - sec. 


3. K, = 1.43 X 10° M; Km = 3 X 1074 M. Because ky is 20 times 
greater than k_,, the system behaves like a steady-state system. 


4. If the data are graphed as double-reciprocal Lineweaver—Burk 
plots: 
a. Vinx = 51 wmol/mL - sec and K,, = 3.2 mM. 
b. Inhibitor (2) shows competitive inhibition with a Kı = 
2.13 mM. Inhibitor (3) shows noncompetitive inhibition 
with a Kj = 4 mM. 








[S] 








[S] 








[S] 








[S] 











[S] 


In (a), Vinax doubles, but K„ is constant. 


In (b), Vnax halves, but K,,is constant. 
In (c), Vinax is constant, but the apparent Kn increases. 
In (d), Vinax decreases, but K,, is constant. 


In (e), Vinx decreases, K,, decreases, but the ratio K,,/Vmax is 
constant. 
a. The slope is given by K„P/ Vinax(Ks‘/[A] + 1). 
b. y-intercept = ((K,,°/[A]) + 1)1/Vinax- 
c. The horizontal and vertical coordinates of the point of 
intersection are 1/[B] = —K,A/Ks‘K,2 and 
l/v = 1/Vinax (1 > (K,,\/Ks)). 


. Top left: (1) Competitive inhibition (I competes with S for 


binding to E). (2) I binds to and forms a complex with S. 


Top right: (1) Pure noncompetitive inhibition. (2) Random, 
single-displacement bisubstrate reaction, where A doesn’t affect 
B binding, and vice versa. (Other possibilities include [3] Irre- 
versible inhibition of E by I; [4] 1/vvs. 1/[S] plot at two differ- 
ent concentrations of enzyme, E.) 


Bottom left: (1) Mixed noncompetitive inhibition. (2) Ordered 
single-displacement bisubstrate mechanism. 


Bottom right: (1) Uncompetitive inhibition. (2) Double- 
displacement (ping-pong) bisubstrate mechanism. 


. Clancy must drink 694 mL of wine, or about one 750-mL bottle. 


9. a. Ks = 5 uM b. Km = 30 uM; c. Rea = 5 X 10° sec; d. Reat/ Km = 


10. 


TI, 


1.67 X 10° M` 'sec™!; e. Yes, because keat/ Km approaches the 
limiting value of 10° M~ 'sec™!; f. Vinax = 107° mol/mL : sec; 

g. [S] = 90 uM; h. Vnax would equal 2 X 107° mol/mL : sec, but 
Ky = 30 uM, as before. 

a. Vinx = 132 umol mL! sec}; b. Kear = 44,000 sec}; c. Rear/ Km = 
24.4 X 108 M`! sec7}; d. Yes! Reat/ Km actually exceeds the theoreti- 
cal limit of 10° M~! sec”! in this problem; e. The rate at which E 
encounters S; the ultimate limit is the rate of diffusion of S. 

a. Vmax = 1.6 umol mL! sec™!; b. v = 1.45 umol mL“! sec7}; 

c. Rear / Km = 1.6 X 108 M7! sec7!; d. Yes. 


12. 


13. 


14. 


15. 


16. 


17. 


Abbreviated Answers to Problems A-13 
a. Vinx = 6 pmol mL™! sec™}; b. ken = 1.2 X 10° sec}; 

c. Rea/ Km = 1 X 108 M~! sec7!; d. Yes! Reat/ Km approaches the 
theoretical limit of 10? M~! sec™!; e. The rate at which E en- 
counters S; the ultimate limit is the rate of diffusion of S. 

a. Vinx = 64 umol mL“! sec}; b. Rea, = 1.28 X 104 sec}; 

C. Rea/ Ky = 1.4 X 108 M`! sec}; d. Yes! kea/ Km approaches 

the theoretical limit of 10? M~! sec™!. 


a. Vinx = 120 mmol mL”! sec™!; b. v = 104.7 mmol mL”! sec” !; 
C. Rea/ Ky = 3.64 X 108 M~! sec™}; d. Yes! kea/ Km approaches the 
theoretical limit of 10° M~! sec7!. 


a. Starting from Vmax = ko[E7] and Vina’ = ki [Ex] for the 
maximal rates of the forward and reverse reactions respec- 
tively, and the Michaelis constants K,°=(k-, + ko) /k, and 
K,?=(k_, + ko)/k-o for S and P, respectively, 


v= (Voa [S]/ Kn) — Vna [P]/KuP)/(1 + [S]/ Kn + [P]/ Kn’) 

b. At equilibrium, v = 0 and Keq = [P]/[S] = Va 
Ka S/V Kno. 

a. Sis the preferred substrate; its K„ is smaller than the K, for 
T, so a lower [S] will give v = Vinax/2, compared with [T]. 

b. ka/ Kn defines catalytic efficiency. kea/K,, for S = 
2 X 107 M7!sec7!s keat/ Kn for T = 4 X 107 M~'sec™!, so the 
enzyme is a more efficient catalysis with T as substrate. 

a. Because the enzyme shows maximal activity at or below 40°C, 
it seems more like a mammalian enzyme than a plant 
enzyme, which would be expected to have a broader temper- 
ature optimum because plants experience a broader range of 
temperatures. 

b. An enzyme from a thermophilic bacterium growing at 80°C 
would show an activity versus temperature profile similar to 
this one but shifted much farther to the right. 


Chapter 14 


1. 


2; 


a. Nucleophilic attack by an imidazole nitrogen of His*” on the 
—CH,— carbon of the chloromethyl group of TPCK cova- 
lently inactivates chymotrypsin. (See The Student Solutions 
Manual, Study Guide and Problems Book for structures.) 

b. TPCK is specific for chymotrypsin because the phenyl ring of 
the phenylalanine residue interacts effectively with the 
binding pocket of the chymotrypsin active site. This positions 
the chloromethyl group to react with His”. 

c. Replacement of the phenylalanine residue of TPCK with 
arginine or lysine produces reagents that are specific for 
trypsin. 

a. The structures proposed by Craik et al., 1987 (Science 
237:905-907) are shown here. (If you look up this reference, 
note that the letters A and B of the figure legend for Figure 
3 of this article actually refer to parts B and A, respectively. 
Reverse either the letters in the figure or the letters in the 
figure legend and it will make sense.) 


A \ 357 Wild type 
N— 
| / 
56 No H 
Zz H WA 
>9 
i= 
Ne 
Asp!02 On His”” 
A 
` N 
195 
sexttt ( H Ser 
oN 
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B \ 57 D102N mutant 
N— 
| / 
56 No H 
Zz H VA 
>o 
A H His?” 
Za 
Asn??? N oN N 


b. Asn™™ of the mutant enzyme can serve only as a hydrogen- 
bond donor to His”. It is unable to act as a hydrogen-bond 
acceptor, as aspartate does in native trypsin. As a result, His” 
is unable to act as a general base in transferring a proton 
from Ser'”. This presumably accounts for the diminished ac- 
tivity of the mutant trypsin. 


. a. The usual explanation for the inhibitory properties of pep- 
statin is that the central amino acid, statine, mimics the tetra- 
hedral amide hydrate transition state of a good pepsin 
substrate with its unique hydroxyl group. 

b. Pepsin and other aspartic proteases prefer to cleave peptide 
chains between a pair of hydrophobic residues, whereas 
HIV-1 protease preferentially cleaves a Tyr-Pro amide bond. 
Because pepstatin more closely fits the profile of a pepsin 
substrate, we would surmise that it is a better inhibitor of 
pepsin than of HIV-1 protease. In fact, pepstatin is a potent 
inhibitor of pepsin (K; < 1 nm) but only a moderately good 
inhibitor of HIV-1 protease, with a Kj of about 1 uM. 


. The enzyme-catalyzed rate is given by: 


v = k [ES] = ke [EX] 


gi- EX! 
: [ES] 
AGS = —RTINK? 


K t= e7 AGE/RT 
[EX}] = KŠ[ES] = ¢-4¢-'/RT[ ES] 


So ke [ES] = ke 4@'/F" [ES] 


= 1 —AG.+/RT 
or ko = kee 


Similarly, for the uncatalyzed reaction: 
k= ky e AGS! /RT 


Assuming that ky = ke 


k ep AGE/RT 
Then = ~=AGE/RT 
k e AGu 


u 


Re (AGt-AG,.2)/RT 
ky 
. This problem is solved best by using the equation derived in 
problem 4. 

ke 


= pAGul—AGel)/RT 


u 


8. 


9. 


10. 


TI, 


12. 


13. 


Using this equation, we can show that the difference in activa- 
tion energies for the uncatalyzed and catalyzed hydrolysis reac- 
tions (AG, — AG) is 92 kJ/mol. 


. Trypsin catalyzes the conversion of chymotrypsinogen to 


a-chymotrypsin, and chymotrypsin itself catalyzes the conversion 
of m-chymotrypsin to a-chymotrypsin. 


. The mechanism suggested by Lipscomb is a general base 


pathway in which Glu?” promotes the attack of water on the 


carbonyl carbon of the substrate: 


Carboxypeptidase 


`g CHR’ 
| “y= 
OH H 
Using the equation derived in problem 4, it is possible to calcu- 
late the ratio ke/ ku as 1.86 X 10”. 


If the concentration of free enzyme is equal to the concentra- 
tion of enzyme-ligand complex, the concentration of ligand 
would be 1 X 107°” M. This corresponds to 1.67 X 10° liters per 
molecule. 


AG = —154 kJ/mol. This value is intermediate between nonco- 
valent forces (H bonds are typically 10-30 kJ/mol) and covalent 
bonds (300-400 kJ/mol). 


Assuming that the rate of gluconeogenesis would be equal to 
the slowest step in the pathway, with k = 2 X 107°°/sec, and as- 
suming a cellular concentration of fructose-1,6-bisphosphatase 
of 0.031 (see Table 18.2), the rate of sugar synthesis would be 

2 X 10°°/sec X 0.031 mM, or 6.2 X 107? mM/sec. Assuming 
a total human cell volume of 40 L, this corresponds to 2.48 Xx 
107* moles glucose/sec. Assuming 30 ATP per glucose (under 
cellular conditions) and 50 kJ/mole of ATP hydrolyzed, we 

find that the rate of energy production is (2.48 X 107%) x 

(30 ATP/glucose) X (30.5 kJ/mole), or 2.269 X 107% kJ/sec. 
Converting to kilocalories and years, we find that the time to 
synthesize the needed 480 kilocalories with an uncatalyzed reac- 
tion would be 2.8 X 10” years, or roughly 200,000 times the life- 
time of the universe so far. 


The correct answer is c. The stomach is a very acidic environ- 
ment, whereas the small intestine is slightly alkaline. The lower 
part of the figure shows that enzyme X has optimal activity near 
pH 2, whereas enzyme Y works best at a pH near 8. 


The correct answer is a. The two enzymes have nonoverlapping 
pH ranges, so it is highly unlikely that they could operate in the 
same place at the same time. 


14. 


15, 


16. 


17. 


18. 


The correct answer is b. Only enzyme A has a temperature 
range that encompasses human body temperature (37°C). 


The correct answer is d. The activities of the two enzymes 
overlap between 40° and 50°C. 


The correct answer is c. We have no information on the pH be- 
havior of enzymes A and B, nor on the behavior of X and Yas a 
function of temperature. The only answer that is appropriate to 
the data shown is c. 


The only possible answer is b, because a “leveling off” implies 
that all the enzyme is saturated with S. 


The correct answer is c. In order to bring the substrate into the 
transition state, an enzyme must enjoy environmental conditions 
that favor catalysis. There is no activity apparent for enzyme Y 
below pH 5.5. 


Chapter 15 


1. 


3. 


a. As [P] rises, the rate of P formation shows an apparent 
decline, as enzyme-catalyzed conversion of P — S becomes 
more likely. 

b. Availability of substrates and cofactors. 

c. Changes in [enzyme] due to enzyme synthesis and 
degradation. 

d. Covalent modification. 

. Allosteric regulation. 

f. Specialized controls, such as zymogen activation, isozyme 
variability, and modulator protein influences. 


oO 


. Proteolytic enzymes have the potential to degrade the proteins 


of the cell in which they are synthesized. Synthesis of these 
enzymes as zymogens is a way of delaying expression of their ac- 
tivity to the appropriate time and place. 

Monod, Wyman, Changeux allosteric system: 


Lineweaver—Burk plot 
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4. Using the curves for negative cooperativity as shown in Figure 


15.8 as a guide: 
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5. For n = 2.8, Yungs = 0.98 and Yeapittaries = 0.77. 


For n = 1.0, Yiungs = 0.79 and Yoapitaries = 0.61. 


Thus, with an n of 2.8 and a Ps of 26 torr, hemoglobin becomes 
almost fully saturated with Oy in the lungs and drops to 77% 
saturation in resting tissue, a change of 21%. If n of 1.0 anda 
Ps of 26 torr, hemoglobin would become only 79% saturated 
with Oy in the lungs and would drop to 61% in resting tissue, a 
change of 18%. The difference in hemoglobin Oy, saturation 
conditions between the values for n = 2.8 and 1.0 (21% — 18%, 
or 3% saturation) seems small, but note that the potential for 
Os» delivery (98% saturation versus 79% saturation) is large and 
becomes crucial when pO» in actively metabolizing tissue falls 
below 40 torr. 


. More glycogen phosphorylase will be in the glycogen phos- 


phorylase a (more active) form, but caffeine promotes the less 
active T conformation of glycogen phosphorylase. 


. Over time, stored erythrocytes will metabolize 2,3-BPG via the 


pathway of glycolysis. If [BPG] drops, hemoglobin may bind O% 
with such great affinity that it will not be released to the tissues 
(see Figure 15.29). The patient receiving a transfusion of 
[BPG]-depleted blood may actually suffocate. 


. a. By definition, when [P;] = Kos, v = 0.5 Vmax 


b. In the presence of AMP, at [P;] = Kos, v = 0.85 Vmax (from 
Figure 15.14c). 

c. In the presence of ATP, at [Pi] = Kos, v = 0.12 Vinax (from 
Figure 15.14b). 


. IfG,-GTPase activity is inactivated, the interaction between G, 


and adenylyl cyclase will be persistent, adenylyl cyclase will be 
active, [CAMP] will rise, and glycogen levels will fall because 
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10. 


11. 


12. 


I3: 


14. 


15. 


glycogen phosphorylase will be predominantly in the active, 
phosphorylated a form. 


An excess of a negatively cooperative allosteric inhibitor could 
never completely shut down the enzyme. Because the enzyme 
leads to several essential products, inhibition by one product 
might starve the cell for the others. 

a. -R(R/K) X(S*/T*)- 

b. -KRKQIAVRGL- 

Ligand binding is the basis of allosteric regulation, and allosteric 
effectors are common metabolites whose concentrations reflect 
prevailing cellular conditions. Through reversible binding of 
such ligands, enzymatic activity can be adjusted to the momen- 
tary needs of the cell. On the other hand, allosteric regulation is 
inevitably determined by the amounts of allosteric effectors at 
any moment, which can be disadvantageous. Covalent modifica- 
tion, like allosteric regulation, is also rapid and reversible, 
because the converter enzymes act catalytically. Furthermore, 
covalent modification allows cells to escape allosteric regulation 
by covalently locking the modified enzyme in an active (or inac- 
tive) state, regardless of effector concentrations. One disadvan- 
tage is that covalent modification systems are often elaborate 
cascades that require many participants. 


Sickle-cell anemia is the consequence of Hb S polymerization 
through hydrophobic contacts between the side chain of ValB6 
and a pocket in the EF corner of 6-subunits. Potential drugs 
might target this interaction directly or indirectly. For example, 
a drug might compete with Val86 side chain for binding in the 
EF corner. Alternatively, a useful drug might alter the conforma- 
tion of Hb S such that the EF corner was no longer accessible or 
accommodating to the Val@6 side chain. Another possibility 
might be to create drugs that deter the polymerization process 
in other ways through alterations in the surface properties 
between Hb S molecules. 


Nitric oxide is covalently attached to Cys938. Thus, the interac- 
tion is not reversible binding, as in allosteric regulation. On the 
other hand, the reaction of NO: with this cysteine residue is ap- 
parently spontaneous, and no converter enzyme is needed to 
add or remove it. Thus, the regulation of covalent modification 
that is afforded by converter enzyme involvement is obviated. 
The Hb:NO: interaction illustrates that nature does not always 
neatly fit the definitions that we create. 

Pro: Lactate is a metabolic indicator of the need for oxygen; it 
binds to Mb at a distinct site (the allosteric site?) and it lowers 
Mb’s affinity for O». 


Con: Mb is a monomeric protein. Traditionally, allosteric phe- 
nomena have been considered the realm of oligomeric proteins. 


16. 


17. 


18. 


How then is Mb “allosteric”? Since a ligand-induced conforma- 
tional change in a monomeric proteins can affect binding of 
“substrate” (i.e., O2), the definition of allostery may need to be 


broadened. 


nf \¥ 







The wedge-shaped protein monomers (red) assemble into 
trimers, but the alternative conformation for the monomer 
(square, green) forms tetramers. The substrate or allosteric reg- 
ulator (yellow) binds only to the square conformation, and its 
binding prevents the square from adopting the wedge confor- 
mation. Thus, if S or the allosteric regulator is present, equilib- 
rium favors a greater population of square tetramers among the 
morpheein ensemble at the expense of round trimers. 
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ATP or CTP 
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Negative cooperativity in NAD* binding to glyceraldehyde-3-P 
dehydrogenase: 


Where affinity for NADH is 
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19. Hyperventilation results in decreased blood pCO», which in turn 


leads to an increase in pH. Thus, the affinity of Hb for Og in- 
creases (less Os will be released to tissues). Hypoventilation has 
exactly the opposite effects on pCO, and pH; thus, hypoventila- 
tion leads to a diminished affinity of Hb for Os (more Oz will be 
released to the tissues). 


20. When hormone disappears, the hormone: receptor complex dis- 


sociates, the GTPase activity of the G, subunit cleaves bound 
GTP to GDP + P;, and the affinity of G,-GDP for adenylyl 
cyclase is low, so it dissociates and adenylyl cyclase is no longer 
activated. Residual cAMP will be converted to 5'-AMP by phos- 
phodiesterase, and the catalytic subunits of protein kinase A will 
be bound again by the regulatory subunits, whose cAMP ligands 
have dissociated. When protein kinase A becomes inactive, phos- 
phorylase kinase will revert to the unphosphorylated, inactive 
form through loss of phosphoryl groups. Phophoprotein phos- 
phatase 1 will act on glycogen phosphorylase a, removing the 
phosphoryl group from Ser! and thereby converting glycogen 
phosphorylase to the less active, allosterically regulated b form. 


Chapter 16 


1. The pronghorn antelope is truly a remarkable animal, with nu- 


merous specially evolved anatomical and molecular features. 
These include a large windpipe (to draw in more oxygen and 
exhale more carbon dioxide), lungs that are three times the size 
of those of comparable animals (such as goats), and lung alveoli 
with five times the surface area so that oxygen can diffuse more 
rapidly into the capillaries. The blood contains larger numbers 
of red blood cells and thus more hemoglobin. The skeletal and 
heart muscles are likewise adapted for speed and endurance. 
The heart is three times the size of that of comparable animals 
and pumps a proportionally larger volume of blood per contrac- 
tion. Significantly, the muscles contain much larger numbers of 
energy-producing mitochondria, and the muscle fibers them- 
selves are shorter and thus designed for faster contractions. All 
these characteristics enable the pronghorn antelope to run at a 
speed nearly twice the top speed of a thoroughbred racehorse 
and to sustain such speed for up to 1 hour. 


. Refer to Figure 16.9. The step in which the myosin head confor- 
mation change occurs, is the step that should be blocked by 
8,y-methylene-ATP, because hydrolysis of ATP should occur in 
this step and B,y-methylene-ATP is nonhydrolyzable. 


. Phosphocreatine is synthesized from creatine (via creatine 
kinase) primarily in muscle mitochondria (where ATP is readily 
generated) and then transported to the sarcoplasm, where it 
can act as an ATP buffer. The creatine kinase reaction in the sar- 
coplasm yields the product creatine, which is transported back 
into the mitochondria to complete the cycle. Like many mito- 
chondrial proteins, the expression of mitochondrial creatine 
kinase is directed by mitochondrial DNA, whereas sarcoplasmic 
creatine kinase is derived from information encoded in nuclear 
DNA. 


. Note in step 3 of Figure 16.9 that it is ATP that stimulates disso- 
ciation of myosin heads from the actin filaments—the dissocia- 
tion of the cross-bridge complex. When ATP levels decline (as 
happens rapidly after death), large numbers of myosin heads 
are unable to dissociate from actin and the muscle becomes stiff 
and unable to relax. 


. The skeletal muscles of the average adult male have a cross- 
sectional area of approximately 35,000 cm?. The gluteus 
maximus muscles represent approximately 300 cm? of this total. 
Assuming 4 kg of maximal tension per square centimeter of 


11. 


12. 


13. 


14. 
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cross-sectional area, one calculates a total tension for the gluteus 
maximus of 1200 kg (as stated in the problem). The same calcu- 
lation shows that the total tension that could be developed by all 
the muscles in the body is 140,000 kg (or 154 tons)! 


. Taking 55,000 g/mol divided by 6.02 X 10°*/mol and by 


1.3 g/mL, one obtains a volume of 7.03 X 107% mL. Assume a 
sphere and use the volume of a sphere (V = (4/3) r°) to 
obtain a radius of 25.6 A. The diameter of the tubulin dimer 
(see Figure 16.12) is 8 nm, or 80 A, and two times the radius we 
calculated here is approximately 51 A, a reasonable value by 
comparison. 


. A liver cell is 20,000 nm long, which would correspond to 5000 


tubulin monomers if using the value of Figure 16.12, and about 
7800 tubulin monomers using the value of 25.6 A calculated in 
problem 6. 


. 4inches (length of the giant axon) = 10.16 cm. Movement at 


2 to 5 w/sec would correspond to a time of 5.6 to 14 hours to 
traverse this distance. 


. 14 nm is 140 A, which would be approximately 93 residues of a 


coiled coil. 


. Using Equation 9.1, one can calculate a AG of 17,100 J/mol for 


this calcium gradient. 


Using Equation 3.13, one can calculate a cellular AG of 
—48,600 J/mol for ATP hydrolysis. 


17,100 J are required to transport 1 mole of calcium ions. Two 
moles of calcium would require 34,200 J, and three would cost 
51,300 J. Thus, the gradient of problem 10 would provide 
enough energy to drive the transport of two calcium ions per 
ATP hydrolyzed. 


Energy (or work) = force X distance. If 1 ATP is hydrolyzed 
per step and the cellular value of A G°’ for ATP hydrolysis is 
—50 kJ/mol, then the calculation of force exerted by a motor is 
force = 50 kJ/mol/ (step size). The SI unit of force is the 
newton, and 1 newton : meter = 1 J. 


For the kinesin-1 motor, 50 kJ/mol/(8 X 107° m) = 6.25 X 
10'* newtons. 


For the myosin-V motor, 50 kJ/mol/(36 X 107° m) = 1.39 x 
10!? newtons. 


For the dynein motor, 50 kJ/mol/(28 X 107? m) = 1.78 X 
10!? newtons. 


Perhaps the simplest way to view this problem is to consider 
the potential energy change for lifting a 10-kg weight 0.4 m. 
E = mgh = (10 kg) - (9.8 m/sec?) - (0.4 m) = 39.2 J. Now, if 
1 ATP is expended per myosin step along an actin filament, 
39.2 J/ (50 kJ/mol) = 7.8 X 10-4 mol. 


Then, (7.8 X 1074 mol) - (6.02 X 107°) = 4.72 X 10% molecules 
of ATP expended, one per step. So there must be 4.72 X 10% 
myosin steps along actin. However, the stepping process is 
almost certainly not 100% efficient. As shown in the box on 
page 489, a step size of 11 nm against a force of 4 pN would cor- 
respond to an energy expended per myosin per step of 4.4 X 
107% J. With this assumed energy expended per step, we can 
calculate 39 J/ (4.4 X 107% J) = 8.86 X 10% steps total. 


If we take the step size value for skeletal myosin from the box 
on page 489 as 11 nm, one myosin head would have to take 

0.4 m/11 X 10°°m or 3.64 X 10” steps to raise the weight a dis- 
tance of 0.4 m. Taking (8.86 xX 10% steps total) / (3.64 x 107 
steps per myosin) = 2.43 X 10!° myosin heads involved per 
11-nm step. 
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15. 


16. 
17. 


18. 


19. 


20. 


All these are smooth muscle except for the diaphragm. (See 
hitp://chanteur.net/contribu/index. him#http://chanteur.net/contribu/ 
c[Mdiaph.htm for an explanation of the common misconception 
that the diaphragm is smooth muscle.) 


1, a; 2, e; 3, d; 4, c; 5, b. 


This exercise is left to the student and will presumably be differ- 
ent for every student. 


The 80 A step of the kinesin motor is a 0.008 um step. To cover 
10.16 cm would require 12.7 million steps. 


The correct answer is d, although each answer is reasonable. 
ATP is needed for several processes involved in muscle relax- 
ation. Salt imbalances can also prevent normal muscle function, 
and interrupted blood flow could prevent efficient delivery of 
oxygen needed for ATP production during cellular respiration. 


The correct answer is a. Understanding the inheritance pattern 
of sex-linked traits is essential. Males always inherit sex-linked 
traits (on the X chromosome) from their mothers. In addition, 
sex-linked traits are much more commonly expressed in males 
because they have only one X chromosome. 


Chapter 17 


1. 
. Consult Table 17.2. 
x Oo, HO, and COs. 
. See Section 17.2. 


a OF BRB OO PDO 


10. 


11. 


12. 


13. 


6.5 X 10” (6.5 trillion) people. 


. Consult Figure 17.8. 
. Consult Corresponding Pathways of Catabolism and Anabolism Differ 


in Important Ways, p. 520. See also Figure 17.8. 


. See The ATP Cycle, p. 521; NAD* Collects Electrons Released in Ca- 


tabolism, p. 522; and NADPH Provides the Reducing Power for Ana- 
bolic Processes, p. 523. 


. In terms of quickness of response, the order is allosteric regula- 


tion > covalent modification > enzyme synthesis and degrada- 
tion. See The Student Solutions Manual, Study Guide and Problems 
Book for further discussions. 


See Metabolic Pathways Are Compartmentalized within Cells, p. 527, 
and discussions in The Student Solutions Manual, Study Guide and 
Problems Book. 


Many answers are possible here. Some examples: Large numbers 
of metabolites in tissues and fluids; great diversity of structure of 
biomolecules; need for analytical methods that can detect and 
distinguish many different metabolites, often at very low concen- 
trations; need to understand relationships between certain 
metabolites; time and cost required to analyze and quantitate 
many metabolites in a tissue or fluid. 


Mass spectrometry provides great sensitivity for detection of 
metabolites. NMR offers a variety of methods for resolving and 
discriminating metabolites in complex mixtures. Other compar- 
isons and contrasts are discussed in the references at the end of 
the chapter. 


See Figure 13.23; The Coenzymes of the Pyruvate Dehydroge- 
nase Complex, page 568, and the Activation of Vitamin By», 
page 712. 


The genome is the entire hereditary information in an organism, 
as encoded in its DNA or (for some viruses) RNA. The transcrip- 
tome is the set of all messenger RNA molecules (transcripts) pro- 
duced in one cell or a population of cells under a defined set of 


14. 


15. 


16. 


17. 


18. 


19. 


conditions. The proteome is the entire complement of proteins 
produced by a genome, cell, tissue, or organism under a defined 
set of conditions. The metabolome is the complete set of low 
molecular weight metabolites present in an organism or excreted 
by it under a given set of physiological conditions. 


A mechanism for liver alcohol dehydrogenase: 
H 
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TCA cycle: in mitochondria; converts acetate units to CO, plus 
NADH and FADHs». 


Glycolysis: in cytosol; converts glucose to pyruvate. 


Oxidative phosphorylation: in mitochondria; uses electrons 
(from NADH and FADHs) to produce ATP. 


Fatty acid synthesis: in cytosol; uses acetate units to synthesize 
fatty acids. 

Most discussions of ocean sequestration of CO», are devoted to 
capture of CO, by phytoplankton or injection of CO, deep in 
the ocean. However, the box on page 512 offers another 
possibility—carbon sequestration in the shells of corals, mol- 
lusks, and crustaceans. This approach may be feasible. However, 
there are indications that these classes of sea organisms are 
already suffering from global warming and from pollution of 
ocean water. 


3P and *°S both decay via beta particle emission. A beta particle 
is merely an electron emitted by a neutron in the nucleus. Thus, 
beta decay does not affect the atomic mass, but it does convert a 
neutron to a proton. Thus, beta emission changes **P to *°S and 
converts *°S to *°Cl: 

32P—> B- + 2%, ty = 14.3 days. 

35S —> B~ + Cl, tis = 87.1 days. 
The decay equation is a first-order decay equation: 

A/Ag = €79:599/ty, 

Thus, after 100 days of decay, the fraction of **P remaining 
would be 0.786% and the fraction of *°S remaining would 
be 45%. 


The correct answer is b. Obligate anaerobes can survive only in 
the absence of oxygen. 


The correct answer is a. Fiber provides little or no nutrition. 
The foods containing fiber, however, have other nutritional sub- 
stances that we are able to digest and absorb. 


Chapter 18 
I; 


a. Phosphoglucoisomerase, fructose bisphosphate aldolase, 
triose phosphate isomerase, glyceraldehyde-3-P dehydroge- 
nase, phosphoglycerate mutase, and enolase. 

b. Hexokinase/glucokinase, phosphofructokinase, phospho- 
glycerate kinase, pyruvate kinase, and lactate dehydrogenase. 

c. Hexokinase and phosphofructokinase. 

d. Phosphoglycerate kinase and pyruvate kinase. 


10. 


TI. 


12. 


e. According to Equation 3.13, reactions in which the number 
of reactant molecules differs from the number of product 
molecules exhibit a strong dependence on concentration. 
That is, such reactions are extremely sensitive to changes 
in concentration. Using this criterion, we can predict from 
Table 19.1 that the free energy changes of the fructose bis- 
phosphate aldolase and glyceraldehyde-3-P dehydrogenase 
reactions will be strongly influenced by changes in 
concentration. 

f. Reactions that occur with AG near zero operate at or near 
equilibrium. See Table 18.1 and Figure 18.22. 


. The carboxyl carbon of pyruvate derives from carbons 3 and 


4 of glucose. The keto carbon of pyruvate derives from carbons 
2 and 5 of glucose. The methyl carbon of pyruvate is obtained 
from carbons 1 and 6 of glucose. 


. Increased [ATP], [citrate], or [glucose-6-phosphate] inhibits 


glycolysis. Increased [AMP], [fructose-1,6-bisphosphate], or 
[fructose-2,6-bisphosphate] stimulates glycolysis. 


. See The Student Solutions Manual, Study Guide and Problems Book 


for discussion. 


. The mechanisms for fructose bisphosphate aldolase and 


glyceraldehyde-3-P dehydrogenase are shown in Figures 18.12 
and 18.13, respectively. 


. The relevant reactions of galactose metabolism are shown in 


Figure 18.24. See The Student Solutions Manual, Study Guide and 
Problems Book for mechanisms. 


. Iodoacetic acid would be expected to alkylate the reactive active- 


site cysteine that is vital to the glyceraldehyde-3-P dehydrogenase 
reaction. This alkylation would irreversibly inactivate the 


enzyme. 


. Ignoring the possibility that *°P might be incorporated into ATP, 


the only reaction of glycolysis that utilizes P; is glyceraldehyde- 
3-P dehydrogenase, which converts glyceraldehyde-3-P to 
1,3-bisphosphoglycerate. **P; would label the phosphate at 
carbon 1 of 1,3-bisphosphoglycerate. The label will be lost in 
the next reaction, and no other glycolytic intermediates will be 
directly labeled. (Once the label is incorporated into ATP, it 
will also show up in glucose-6-P, fructose-6-P, and fructose-1, 
6-bisphosphate.) 


. The sucrose phosphorylase reaction leads to glucose-6-P without 


the need for the hexokinase reaction. Direct production offers 
the obvious advantage of saving a molecule of ATP. 


All of the kinases involved in glycolysis, as well as enolase, are 
activated by Mg?* ion. A Mg?* deficiency could lead to reduced 
activity for some or all of these enzymes. However, other sys- 
temic effects of a Mg?* deficiency might cause even more 
serious problems. 


a. 7. 
b. 0.0266 mM. 


+1.08 kJ/mol. 


13. 


14. 


15. 


16. 
17. 


18. 
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. —13.8 kJ/mol. 
Keq = 194,850. 
[Pyr]/[PEP] = 24,356. 


—13.8 kJ/mol. 

Keg = 211. 

[FBP] /[F-6-P] = 528. 

—19.1 kJ/mol. 

Keq = 1651. 

[ATP]/[ADP] = 13.8. 

AG” = —33.9 kJ/mol. 

An 8% increase in [ATP] changes the [AMP] concentration to 
20 uM (from 5 uM). 
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UDPGlc Glc-1-P Gal-1-P UDPGal 
ky ky ko ks ky k4 ke ke 
ko ks, 

E E e UDPGIc ——— E-UMP e Glc-1-P E-UMP E-UMP e° Gal-l-P <~ E-UDPGal E 
ko k_s 


Pyruvate kinase deficiency primarily affects red blood cells, 
which lack mitochondria and can produce ATP only via glycoly- 
sis. Absence of pyruvate kinase reduces the production of ATP 
by glycolysis in red cells, which in turn reduces the activity of the 
Na,K-ATPase (see Chapter 9). This results in alterations of elec- 
trolyte (Na* and K*) concentrations, which leads to cellular dis- 
tortion, rigidity, and dehydration. Compromised red blood cells 
are destroyed by the spleen and liver. 


The best answer is d. PFK is more active at low ATP than at high 
ATP, and F-2,6-bisP activates the enzyme. 


Hexokinase is inhibited by high concentrations of glucose-6-P, 
so glycolysis would probably stop at such a high level of glucose- 
6-P. Also, the increased concentration of glucose-6-P would 
change the cellular AG of the hexokinase reaction to approxi- 
mately —22,000 J/mol. 


Chapter 19 


1. 


Glutamate enters the TCA cycle via a transamination to form 
a-ketoglutarate. The y-carbon of glutamate entering the cycle is 
equivalent to the methyl carbon of an entering acetate. Thus, 
no radioactivity from such a label would be lost in the first or 
second cycle, but in each subsequent cycle, 50% of the total 
label would be lost (see Figure 19.15). 


. NAD* activates pyruvate dehydrogenase and isocitrate dehydro- 


genase and thus would increase TCA cycle activity. If NAD* 
increases at the expense of NADH, the resulting decrease in 
NADH would likewise activate the cycle by stimulating citrate 
synthase and a-ketoglutarate dehydrogenase. ATP inhibits pyru- 
vate dehydrogenase, citrate synthase, and isocitrate dehydroge- 
nase; reducing the ATP concentration would thus activate the 
cycle. Isocitrate is not a regulator of the cycle, but increasing its 
concentration would mimic an increase in acetate flux through 
the cycle and increase overall cycle activity. 


. For most enzymes that are regulated by phosphorylation, the co- 


valent binding of a phosphate group at a distant site induces a 
conformation change at the active site that either activates or in- 
hibits the enzyme activity. On the other hand, X-ray crystallo- 
graphic studies reveal that the phosphorylated and 
unphosphorylated forms of isocitrate dehydrogenase share iden- 
tical structures with only small (and probably insignificant) con- 
formation changes at Ser!!*, the locus of phosphorylation. What 
phosphorylation does do is block isocitrate binding (with no 
effect on the binding affinity of NADP*). As shown in Figure 2 
of the paper cited in the problem (Barford, D., 1991. Molecular 
mechanisms for the control of enzymic activity by protein phos- 
phorylation. Biochimica et Biophysica Acta 1133:55-62), the 
y-carboxyl group of bound isocitrate forms a hydrogen bond 
with the hydroxyl group of Ser'!®. Phosphorylation apparently 
prevents isocitrate binding by a combination of a loss of the 
crucial H bond between substrate and enzyme and by repulsive 
electrostatic and steric effects. 


4. A mechanism for the first step of the a-ketoglutarate dehydroge- 


nase reaction: 


CH,—CH,—COO ~ CH,—CH,—COO ~ 
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Covalent TPP intermediate 


. Aconitase is inhibited by fluorocitrate, the product of citrate syn- 


thase action on fluoroacetate. In a tissue where inhibition has 
occurred, all TCA cycle metabolites should be reduced in con- 
centration. Fluorocitrate would replace citrate, and the concen- 
trations of isocitrate and all subsequent metabolites would be 
reduced because of aconitase inhibition. 


. FADH; is colorless, but FAD is yellow, with a maximal ab- 


sorbance at 450 nm. Succinate dehydrogenase could be conve- 
niently assayed by measuring the decrease in absorbance in a 
solution of the flavoenzyme and succinate. 


. The central (C-3) carbon of citrate is reduced, and an adjacent 


carbon is oxidized in the aconitase reaction. The carbon 
bearing the hydroxyl group is obviously oxidized by isocitrate 
dehydrogenase. In the a-ketoglutarate dehydrogenase reaction, 
the departing carbon atom and the carbon adjacent to it are 
both oxidized. Both of the —CH»— carbons of succinate are 
oxidized in the succinate dehydrogenase reaction. Of these four 
molecules, all but citrate undergo a net oxidation in the TCA 
cycle. 


. Several TCA metabolite analogs are known, including malonate, 


an analog of succinate, and 3-nitro-2-S-hydroxypropionate, the 
anion of which is a transition-state analog for the fumarase 
reaction. 
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9. A mechanism for pyruvate decarboxylase: 
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10. [isocitrate]/ [citrate] = 0.1. When [isocitrate] = 0.03 mM, 
[citrate] = 0.3 mM. 

11. 14CO, incorporation into TCA via the pyruvate carboxylase reac- 
tion would label the —CH2—COOH carboxyl carbon in oxalo- 
acetate. When this entered the TCA cycle, the labeled carbon 
would survive only to the a-ketoglutarate dehydrogenase reac- 
tion, where it would be eliminated as !*COs. 


14. O 


Glyoxalate 


L 





Acetyl-CoA 


CH, 
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12. 


13. 





o 
Il 


R" 
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14C incorporated in a reversed TCA cycle would label the two 
carboxyl carbons of oxaloacetate in the first pass through the 
cycle. One of these would be eliminated in its second pass 
through the cycle. The other would persist for more than two 
cycles and would be eliminated slowly as methyl carbons in 
acetyl-CoA in the reversed citrate synthase reaction. 

The labeling pattern would be the same as if methyl-labeled 
acetyl-CoA was fed to the conventional TCA cycle (see 

Figure 19.15). 
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16. For the malate dehydrogenase reaction, 
Malate + NAD+ == oxaloacetate + NADH + H* 
with the value of AG” being +30 kJ/mol. Then 


AG” = -RTIn Kaq 
= — (8.314 J/mol-K) 0m ae | 
~30,000 J/mol [20] [2.2 X 1074] 
~ 9478 J/mol = In (x/4.4 X 1074) 
—12.1 = In (x/4.4 X 1073) 
x= —RTIn Ka 
x = [oxaloacetate] = 0.024 uM 


Using the dimensions given in the problem, one can calculate 

a mitochondrial volume of 1.57 X 107! L. (0.024 x 107° M)- 
(1.57 X 107! L): (6.02 X 10° molecules/mole) = 14.4, or about 
14 molecules of OAA in a mitochondrion (“pOAA” = 7.62). 


17. This exercise is left to the student. Review of the calculation of 
oxidation numbers should be a prerequisite for answering this 
problem. 


18. This exercise is left to the student. 


19. 2R, 3R-fluorocitrate is converted by aconitase to 2-fluoro-cis- 
aconitate. This intermediate then rotates 180 degrees in the 
active site. Addition of hydroxide at the C4 position is followed 
by double bond migration from C3-C4 to C2-C3, to form 
4-hydroxy-trans-aconitate. This product remains tightly bound 
at the aconitase active site, inactivating the enzyme. Studies by 
Trauble et al. have shown that the inhibitory product can be dis- 
placed by a 10°-fold excess of isocitrate. 





Mitochondrial 
membrane 


20. Only eight ATPs would be generated in the succinyl-CoA synthe- 
tase (and nucleoside diphosphate kinase) reaction of the TCA 
cycle itself. 


21. d is false. Succinyl-CoA is an inhibitor of citrate synthase. 


Chapter 20 


1. The cytochrome couple is the acceptor, and the donor is the 
(bound) FAD/FADH, couple, because the cytochrome couple 
has a higher (more positive) reduction potential. 


AG,’ =0.254 V—0.02 V* 
*This is a typical value for enzyme-bound [FAD]. 
AG= —n¥A€,’ = —43.4 kJ/mol. 
2. AS,’ = —0.03 V; AG = +5790 J/mol. 


. This situation is analogous to that described in Section 3.3. The 
net result of the reduction of NAD* is that a proton is con- 
sumed. The effect on the calculation of free energy change is 
similar to that described in Equation 3.26. Adding an appropri- 
ate term to Equation 20.12 yields: 


é = C — RT\In [H*] + (RT/nF In ([ox]/[red]) 


4. Cyanide acts primarily via binding to cytochrome a3, and the 
amount of cytochrome as in the body is much lower than the 
amount of hemoglobin. Nitrite anion is an effective antidote for 
cyanide poisoning because of its unique ability to oxidize ferro- 
hemoglobin to ferrihemoglobin, a form of hemoglobin that 
competes very effectively with cytochrome as for cyanide. The 
amount of ferrohemoglobin needed to neutralize an otherwise 


10. 


11. 


12. 


13. 


14. 


lethal dose of cyanide is small compared with the total amount 
of hemoglobin in the body. Even though a small amount of 
hemoglobin is sacrificed by sequestering the cyanide in this 
manner, a “lethal dose” of cyanide can be neutralized in this 
manner without adversely affecting oxygen transport. 


. You should advise the wealthy investor that she should decline 


this request for financial backing. Uncouplers can indeed 
produce dramatic weight loss, but they can also cause death. 
Dinitrophenol was actually marketed for weight loss at one time, 
but sales were discontinued when the potentially fatal nature of 
such “therapy” was fully appreciated. Weight loss is best achieved 
by simply making sure that the number of calories of food eaten 
is less than the number of calories metabolized. A person who 
metabolizes about 2000 kJ (about 500 kcal) more than he or she 
consumes every day will lose about a pound in about 8 days.) 


. The calculation in Section 20.4 assumes the same conditions as 


in this problem (1 pH unit gradient across the membrane and 
an electrical potential difference of 0.18 V). The transport of 

1 mole of H* across such a membrane generates 23.3 kJ of 
energy. For three moles of H*, the energy yield is 69.9 kJ. Then, 
from Equation 3.13, we have: 


69,900 J/mol = 30,500 J/mol + RTIn([ATP]/[ADP] [P;]) 
39,400 J/mol = RT In ([ATP]/[ADP] [P;]) 
[ATP]/[ADP] [P;] = 4.36 X 10° M~! at 37°C. 


In the absence of the proton gradient, this same ratio is 7.25 X 
1078 M~! at equilibrium! 


. The succinate/fumarate redox couple has the highest (that is, 


most positive) reduction potential of any of the couples in gly- 
colysis and the TCA cycle. Thus, oxidation of succinate by NAD* 
would be very unfavorable in the thermodynamic sense. 


Oxidation of succinate by NAD*: 
A,’ ~ —0.35 V; AG” ~ 67,500 J/mol. 


On the other hand, oxidation of succinate is quite feasible using 
enzyme-bound FAD. 


Oxidation of succinate by [FAD]: 


AZ,’ ~ 0, AG” ~ 0, depending on the exact reduction 
potential for bound FAD. 


a. —73.3 kJ/mol. 

b. Keq = 7-1 X 102, 
[ATP] /[ADP] = 19.7. 
—151.5 kJ/mol. 

Keq = 3.54 X 10°, 
[ATP]/[ADP] = 8.8. 
—219 kJ/mol. 

Keg = 2.63 X 10”. 
[ATP]/[ADP] = 54.4. 


—217 kJ/mol. 
Kaq = 1.08 X 10°, 
[ATP]/ [ADP] = —3479. 


—26 kJ/mol. 

Keg = 3.68 X 104. 
[NAD*]/[NADH] = 1.48. 
Keg = 4.8 x10°. 

+48 kJ/mol. 
[ATP]/[ADP] = 2.3. 


This answer should be based on the information on pages 
603-606 and should be in the student’s own words. 
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17. 


18. 


19. 


20. 


A-23 
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. This answer should be based on the information on pages 


606-609 and should be in the student’s own words. 


. Several mechanisms are possible. See, for example, the mecha- 


nisms in the following references: Pryor, W. A., and Porter, 

N. A., 1990. Suggested mechanisms for the production of 
4-hydroxy-2-nonenal from the autoxidation of polyunsaturated 
fatty acids. Free Radical Biology and Medicine 8:541-543; and Niki, 
E., Yoshida, Y., Saito, Y., and Noguchi, N., 2005. Lipid peroxida- 
tion: Mechanisms, inhibition, and biological effects. Biochemical 
and Biophysical Research Communications 338:668-676. 


A calculation similar to that in problem 16 in Chapter 19 yields 
about 15 H* in a typical mitochondrion. 


In the succinate dehydrogenase reaction, a two-electron transfer 
reaction (the conversion of succinate to fumarate) must be 
coupled to iron-sulfur centers, which can only transfer one 
electron at a time. FAD can carry on both one-electron and 
two-electron transfers and is thus ideally suited for this 

reaction. 


The answer is a. The mitochondrial ATPase reaction proceeds 
via nucleophilic Sy2-type substitution. 


At 298 K, doubling the proton gradient would change AG by 
1717 J/mole. The mitochondrial membrane potential would 
have to change by approximately 18 mV. The student should 
decide whether this change should be positive or negative. 


Chapter 21 


l. 


Efficiency = A G” / (light energy input) = n¥A€,’/(Nhc/dA) = 
0.564. 


. AG" = —n#AE,’, so AG" /—nF¥ = A. n= 4 for 2 HO —> 


4e +4H* + Ov. AG,’ = (€,’ (primary oxidant) — 
8E (E O2/H.O)) = +0.0648 V. Thus, €,’ (primary oxidant) = 
0.881 V. 


. AG for ATP synthesis = +50,336 J/mol. AG,’ = —0.34 V. 
. Reduced plastoquinone = PQHs; oxidized plastoquinone = PQ; 


Reduced plastocyanin = PC(Cu*); plastocyanin = PC(Cu**): 
PQH» +2 H" stroma ae PC(Cu?*) — 
PQ +2 PC(Cu*) +4 H” thylakoia lumen 


. Noncyclic photophosphorylation: 2.57 ATP would be synthe- 


sized from 8 quanta, which equates to 3.11 hv/ATP. Cyclic 
photophosphorylation theoretically yields 2 H*t/hv, so if 14 H* 
yield 3 ATP, 7 hv yield 3 ATP, which equates to 2.33 hv/ATP. 


. In mitochondria, H* translocation leads to a decline in inter- 


membrane space pH and hence cytosolic pH, because the 
outer mitochondrial membrane is permeable to protons. Thus, 
the eukaryotic cytosol pH is at risk from mitochondrial proton 
translocation. So, mitochondria rely on a greater membrane 
potential (Aw) and a smaller ApH to achieve the same proton- 
motive force. Because proton translocation in eukaryotic pho- 
tosynthesis deposits H* into the thylakoid lumen, the cytosol 
does not experience any pH change and ApH is nota 
problem. Moreover, the light-induced efflux of Mg?* from the 
lumen diminishes Ay across the thylakoid membrane, so a 
greater contribution of ApH to the proton-motive force is 
warranted. 


. Replacement of Tyr by Phe would greatly diminish the possibil- 


ity of e` transfer between water and P6807, limiting the ability of 
P680* to regain an e~ and return to the P680 ground state. 


A-24 Abbreviated Answers to Problems 


8. Assuming 12 c-subunits means 12 H* are needed to drive one 
turn of the c-subunit rotor and the synthesis of 3 ATP by the 
CF; part of the ATP synthase. If the R. viridis cytochrome bc, 
complex drives the translocation of 2 H*/e™, then 2 hv gives 
4 H*, and 6 hv gives 12 H* and thus 3 ATP (thus, 2 hv yield 
1 ATP). 


9. AG? = —nFAE,! = —2(96,485 J/volt - mol) 
(8€, (1,3-BPG/Gal3P) — €,/(NADP*/NADPH)) = 
— (192,970 J/volt - mol) (—0.29 V — (—0.32 V)) = —192,970 
(0.03) J/mol = —5,789 J/mol. 


10. Use the first eight reactions in Table 21.1 to show that: 


CO, + Ru-1,5-BP + 3 HO + 2 ATP + 2 NADPH + 2 H* —> 
glucose + 2 ADP + 4 P; + 2 NADP+ 

11. Radioactivity will be found in C-1 of 3-phosphoglycerate; C-3 
and C-4 of glucose; C-1 and C-2 of erythrose-4-P; C-3, C-4, 
and C-5 of sedoheptulose-1,7-bisP; and C-1, C-2, and C-3 of 
ribose-5-P. 

12. Light induces three effects in chloroplasts: (1) pH increase in 
the stroma, (2) generation of reducing power (as ferredoxin), 
and (3) Mg** efflux from the thylakoid lumen. Key enzymes in 
the Calvin—Benson CO, fixation pathway are activated by one or 
more of these effects. In addition, rubisco activase is activated 
indirectly by light, and, in turn, activates rubisco. 





13. The following series of reactions accomplishes the conversion of 
2-phosphoglycolate to 3-phosphoglycerate: 
1.2 phosphoglycolate + 2 H:O —> 2 glycolate + 2 P; 
2.2 glycolate + 2 Os —> 2 glyoxylate + 2 HO» 
3.2 glyoxylate + 2 serine —> 2 hydroxypyruvate + 2 glycine 
4.2 glycine —> serine + CO, + NHs 


5. hydroxypyruvate + glutamate —> serine + a-ketoglutarate 
(an aminotransferase reaction) 


6.a-ketoglutarate + NH; + NADH* + H* —> glutamate + 
HO + NAD* (the glutamate dehydrogenase reaction) 


7. hydroxypyruvate + NADH* + H+ —> glycerate + NAD+ 
8. glycerate + ATP —> 3-phosphoglycerate + ADP 








Net: 2 phosphoglycolate + 2 NADH* + 2 H* + ATP + 2 Os 4 
HO —> 3-phosphoglycerate + CO, + 2 P; + ADP + 2 H»O» 
+ 2 NAD* 





14. Considering the reactions involving water separately: 
1.ATP synthesis: 18 ADP + 18 P; —> 18 ATP + 18 H,O 
2.NADP* reduction and the photolysis of water: 

12 H:O +12 NADP* —> 12 NADPH + 12 H* + 6 Og 


3. Overall reaction for hexose synthesis: 


6 CO, + 12 NADPH + 12 H+ + 18 ATP + 12 HO —> 
glucose + 12 NADP* + 6 O + 18 ADP + 18 P; 





Net: 6 COs + 6 HO —> glucose + 6 Og 


Of the 12 waters consumed in Oy production in reaction 2 and 
the 12 waters consumed in the reactions of the Calvin—Benson 
cycle (reaction 3), 18 are restored by H,O release in phosphoric 
anhydride bond formation in reaction 1. 


15. a. If the number of c-subunits increases, the Ht /ATP ratio will 
increase. 
b. If the number of c-subunits increases, Ap can be smaller. 
(More protons moving down a shallower Ap will yield the 
same overall AG for ATP synthesis.) 


16. a. 870-nm light: AG = —192 kJ/mol 
b. 700-nm light: AG = —259 kJ/mol 
c. 680-nm light: AG = —269 kJ/mol 


17. AG = +35.1 KJ 
18. AG = —950 kJ 


19. Ideally, accessory light-harvesting pigments would absorb visible 
light of wavelengths that the chlorophylls do not absorb, that is, 
light in the 470-620 nm wavelength range. 


Thylakoid 
space 


—— 
Dark Light Dark 


20. 


[Mg?*] 





Chapter 22 


1. The reactions that contribute to the equation on page 668 are: 
2 Pyruvate + 2 H* + 2 HO —> glucose + Op 
2 NADH + 2 H* + O. —> 2 NAD* + 2 HO 
4 ATP + 4 HO —> 4 ADP + 4P; 
2 GTP + 2 H:O —> 2 GDP+ 2P; 


Summing these four reactions produces the equation on 
page 668. 





2. This problem essentially involves consideration of the three 
unique steps of gluconeogenesis. The conversion of PEP to 
pyruvate was shown in Chapter 18 to have a AG” of —31.7 
kJ/mol. For the conversion of pyruvate to PEP, we need only 
add the conversion of a GIP to GDP + P; (equivalent to ATP to 
ADP + P;) to the reverse reaction. Thus: 


Pyruvate —> PEP: AG” = +31.7 kJ/mol — 30.5 kJ/mol = 
1.2 kJ/mol 


Then, using Equation 3.13: 


AG= 1.2 kJ/mol + RT In ([PEP][ADP]2[Pi] /[Py] [ATP]2) 
AG (in erythrocytes) = —30.1 kJ/mol 


In the case of the fructose-1,6-bisphosphatase reaction, A G” = 
—16.3 kJ/mol (see Equation 18.6). 


AG= —16.7 kJ/mol + RT In ([F-6-P] [P;]/[F-1,6-BP]) 
AG (in erythrocytes) = —36.5 kJ/mol 


For the glucose-6-phosphatase reaction, AG” = —13.9 kJ/mol 
(see Table 3.3). AG = —13.9 kJ/mol + RT In ([Glu] [P;]/ 
[G-6-P]) = —21.1 kJ/mol. From these AG values and those in 
Table 18.1, AG for gluconeogenesis = —87.7 kJ/mol. 


3. Inhibition by 25 uM fructose-2,6-bisphosphate is approximately 
94% at 25 uM fructose-1,6-bisphosphate and approximately 44% 
at 100 uM fructose-1,6-bisphosphate. 


4. The hydrolysis of UDP-glucose to UDP and glucose is character- 
ized by a AG” of —31.9 kJ/mol. This is more than sufficient to 
overcome the energetic cost of synthesizing a new glycosidic 
bond in a glycogen molecule. The net AG°’ for the glycogen 
synthase reaction is —13.3 kJ/mol. 


5. According to Table 24.1, a 70-kg person possesses 1920 kJ of 
muscle glycogen. Without knowing how much of this is in fast- 


twitch muscle, we can simply use the fast-twitch data from A 
Deeper Look (page 680) to calculate a rate of energy consump- 
tion. The plot on page 680 shows that glycogen supplies are ex- 
hausted after 60 minutes of heavy exercise. Ignoring the 
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8. a. Increased fructose-1,6-bisphosphate would activate pyruvate 
kinase, stimulating glycolysis. 
b. Increased blood glucose would decrease gluconeogenesis 
and increase glycogen synthesis. 


curvature of the plot, 1920 kJ of energy consumed in 60 minutes c. Increased blood insulin inhibits gluconeogenesis and stimu- 


corresponds to an energy consumption rate of 533 J/sec. 


6. Although other inhibitory processes might also occur, enzymes 


with mechanisms involving formation of Schiff base intermedi- 
ates with active-site lysine residues are likely to be inhibited by 
sodium borohydride (see Chapter 18, problem 18). The trans- 


aldolase reaction of the pentose phosphate pathway involves this 


type of active-site intermediate and would be expected to be in- 
hibited by sodium borohydride. 


7. Glycogen molecules do not have any free reducing ends, 


regardless of the size of the molecule. If branching occurs 
every 8 residues and each arm of the branch has 8 residues (or 
16 per branch point), a glycogen molecule with 8000 residues 
would have about 500 ends. If branching occurs every 12 
residues, a glycogen molecule with 8000 residues would have 
about 334 ends. 


lates glycogen synthesis. 

d. Increased blood glucagon inhibits glycogen synthesis and 
stimulates glycogen breakdown. 

e. Because ATP inhibits both phosphofructokinase and pyru- 
vate kinase, and because its level reflects the energy status of 
the cell, a decrease in tissue ATP would have the effect of 
stimulating glycolysis. 

f. Increasing AMP would have the same effect as decreasing 
ATP—stimulation of glycolysis and inhibition of 
gluconeogenesis. 

g. Fructose-6-phosphate is not a regulatory molecule and de- 
creases in its concentration would not markedly affect either 
glycolysis or gluconeogenesis (ignoring any effects due to de- 
creased [G-6-P] as a consequence of decreased [F-6-P]). 


9. At 298 K, assuming roughly equal concentrations of glycogen 
molecules of different lengths, the glucose-1-P concentration 
would be about 0.286 mM. 


10. All four of these reactions begin with the formation of N-carboxybiotin: 
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The carboxylations of geranyl-CoA and urea are shown below: 


XS Xe) 


Bio H 





The mechanism of the carboxyltransferase reaction is a 
combination of the pyruvate carboxylase mechanism (shown in 
Figure 22.3) and a reverse propionyl-CoA carboxylase reaction. 
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ll. a. The mechanism of the acetolactate synthase reaction. 
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b. The phosphoketolase reaction. 
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12. Metformin both stimulates glucose uptake by peripheral tissues 
and enhances the binding of insulin to its receptors. Glipizide 
complements the actions of metformin by stimulating increased 
insulin secretion by the pancreas. 


13. Epalrestat and tolrestat do not resemble the transition state for 
aldose reductase, and evidence from studies by Franklin Pren- 


CH; — C-O <——— 


Acetyl-TPP 


ie 
=r 
= POLJ 
bin 


dergast and others (Ehrig, T., Bohren, K., Prendergast, F., and 
Gabbay, K., 1994. Mechanism of aldose reductase inhibition: 
Binding of NADP*/NADPH and alrestatin-like inhibitors. 
Biochemistry 33:7157-7165) show that these inhibitors probably 
bind to the enzyme: NADP* complex at a site other than the 
active site. 


14. Glucose-6-P 
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Fructose-6-P 
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15. Carbons 1, 2, and 5 of ribose-5-P will be labeled. 


16. 


6 NADP+ 6 NADPH 6 moe ae 6 NADPH 
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17. 
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3 NADP+ 3NADPH 3 NADP+ car as NADPH 


Glucose-6-P —______ 6-Phosphogluconate == Ribulose-5-P 
— > 


a CO» 
Xylulose-5-P Ribose-5-P Xylulese5P 
TK 
Glyceraldehyde-3-P Sedoheptulose-7-P 
TA 
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2 ADP A 


Fructose-1,6-bisP 


Jd 


DHAP —__¥ Glyceraldehyde-3-P 


18. Carbons 2 and 4 end up in carbons 1 and 3 of pyruvate. 


19. The reference by Hurley et al. in the Further Reading for this 
chapter suggests a mechanism based on structural data on glyco- 
genin. Nucleophilic attack by Asp!” on the C-1 of glucose (in 

UDP-glucose) forms a covalent enzyme-substrate intermediate. 

Subsequent nucleophilic attack by the hydroxyl oxygen of Tyr! 

of glycogenin produces the tyrosyl glucose that forms the foun- 


dation for synthesis of glycogen particles. 


The creatine kinase reaction 


O 
| -H cn, 
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During periods of energy abundance, muscles store ATP equiva- 
lents in the form of creatine phosphate. When the muscles 
demand energy, creatine kinase runs in the reverse direction, 
converting creatine phosphate to ATP. 
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20. During the first few seconds of exercise, existing stores of ATP 
are consumed and creatine phosphate then provides additional 
ATP via the creatine kinase reaction. For the next 90 seconds or 
so, anaerobic metabolism (conversion of glucose to lactate via 
glycolysis) provides energy. At this point, aerobic metabolism 
begins in earnest, delivering significant energy resources to 
sustain long-term exercise. 


21. Creatine phosphate provides ATP to muscle via the creatine 
kinase reaction: 


—COooO”~ + ADP 


Chapter 23 


1. Assuming that all fatty acid chains in the triacylglycerol are 
palmitic acid, the fatty acid content of the triacylglycerol is 95% 
of the total weight. On the basis of this assumption, one can cal- 
culate that 30 lb of triacylglycerol will yield 118.7 L of water. 


2. 11-cis-Heptadecenoic acid is metabolized by means of seven 
cycles of B-oxidation, leaving a propionyl-CoA as the final 
product. However, the fifth cycle bypasses the acyl-CoA dehydro- 
genase reaction, because a cis-double bond is already present at 
the proper position. Thus, -oxidation produces 
7 NADH (= 17.5 ATP), 6 FADH; (= 9 ATP), and 7 acetyl-CoA 
(= 70 ATP), for a total of 96.5 ATP. Propionyl-CoA is converted 
to succinyl-CoA (with expenditure of 1 ATP), which can be con- 
verted to oxaloacetate in the TCA cycle (with production of 
1 GTP, 1 FADH», and 1 NADH). Oxaloacetate can be converted 
to pyruvate (with no net ATP formed or consumed), and pyru- 
vate can be metabolized in the TCA cycle (producing 1 GTP, 

1 FADHs, and 4 NADH). The net for these conversions of propi- 
onate is 16.5 ATP. Together with the results of 6-oxidation, the 
total ATP yield for the oxidation of one molecule of 
11-cis-heptadecenoic acid is 113 ATP. 


3. Instead of invoking hydroxylation and f-oxidation, the best 
strategy for oxidation of phytanic acid is a-hydroxylation, 
which places a hydroxyl group at C-2. This facilitates oxidative 
a-decarboxylation, and the resulting acid can react with CoA to 
form a CoA ester. This product then undergoes six cycles of 
B-oxidation. In addition to COs, the products of this pathway 
are three molecules of acetyl-CoA, three molecules of propionyl- 
CoA, and one molecule of 2-methylpropionyl-CoA. 


4. Although acetate units cannot be used for net carbohydrate syn- 
thesis, oxaloacetate can enter the gluconeogenesis pathway in 
the PEP carboxykinase reaction. (For this purpose, it must be 
converted to malate for transport to the cytosol.) Acetate 
labeled at the carboxyl carbon will first label (equally) the C-3 
and C-4 positions of newly formed glucose. Acetate labeled at 
the methyl carbon will label (equally) the C-1, C-2, C-5, and C-6 
positions of newly formed glucose. 


5. This exercise is left to the student, in consultation with the refer- 
ences suggested in the problem. 


6. Fat is capable of storing more energy (37 kJ/g) than carbohy- 
drate (16 kJ/g). Ten pounds of fat contains 10 X 454 X 37 = 
167,980 kJ of energy. This same amount of energy would 
require 167,980/16 = 10,499 g, or 23 lb, of stored carbohydrate. 


CoA—SH + 
O OH O 





I | 
~O—C—CH,—C—CH,—C 
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7. The enzyme methylmalonyl-CoA mutase, which catalyzes the 
third step in the conversion of propionyl-CoA to succinyl-CoA, is 
Byo-dependent. If a deficiency in this vitamin occurs, and if large 
amounts of odd-carbon fatty acids were ingested in the diet, 
L-methymalonyl-CoA could accumulate. 


8. a. Myristic acid: 
CH; (CHo)12COOH + 92 P; + 92 ADP + 20 O —> 
92 ATP + 14 CO, + 106 HO + CoA 


b. Stearic acid: 
CH3(CHe) 16 COOH T 120 P; F 120 ADP + 26 O —> 
120 ATP + 18 CO, + 138 HO + CoA 











c. a-Linolenic acid: 
Cı7HCOOH + 113.5 P; + 113.5 ADP + 24.5 Oo 
113.5 ATP + 18 CO, + 128.5 HO + CoA 





d. Arachidonic acid: 
Ci9HsiCOOH + 125 P; + 125 ADP + 27 Os —> 
125 ATP + 20 CO, + 141 HO + CoA 


9. During the hydration step, the elements of water are added 
across the double bond. Also, the proton transferred to the 
acetyl-CoA carbanion in the thiolase reaction is derived from 
the solvent, so each acetyl-CoA released by the enzyme would 
probably contain two tritiums. Seven tritiated acetyl-CoAs would 
thus derive from each molecule of palmitoyl-CoA metabolized, 
each with two tritiums at C-2. 








10. A carnitine deficiency would presumably result in defective or 
limited transport of fatty acids into the mitochondrial matrix 
and reduced rates of fatty acid oxidation. 


11. 1.3 grams X 37 kJ/gram = 48,000 J. This at first seems like a re- 
markably small amount of energy to sustain the hummingbird 
during a 500-mile flight at 50 mph. However, keep in mind that 
the hummingbird weighs only 3 to 4 grams, and see problem 12 
following. 


12. 48,000 J in 10 hours is 4800 J/hr. If the hummingbird consumes 
250 mL per hour during migration, this means that it is consum- 
ing 4800/250 or 19.2 J/mL of oxygen consumed. If a human 
consumes 12.7 kcal/min while running, this is equivalent to 
12,700 X 4.184 J/cal = 53.1 kJ/min; 48,000 J/53,100 J/min = 
0.9 minute. So a human could run for only less than a minute 
on the energy consumed by the hummingbird (only 3 to 4 g) 
on a 500-mile flight. A typical person would have to run about 
40 miles to lose 1 lb of fat. 


13. A mechanism for the HMG-CoA synthase reaction: 


H:B 


V e 


H,C—C—CH,—C—SCoA 


OH O 





| 
CH,—C—CH,—C—SCoA 
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15. The changes in oxidation state of cobalt in the course of any 
Bjo-dependent reaction arise from homolytic cleavages of the 
Co**—C bonds involved. Moreover, in the Co*t—C bonds 
shown in a typical B;»-dependent reaction, the two electrons of 
the bond are not ascribed to the Co** atom in the calculation 
of the Co oxidation state. Thus, when the Co—C bond is 
cleaved homolytically, one electron reverts to the Co, changing 
its oxidation state from 3+ to 2+. 
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16. See the following mechanisms. 


a-Methyleneglutarate mutase 
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17. 








NH,* 


Hypoglycin A 


CH, i 
H,C==C — CH— CH,—C 








Methylenecyclopropylacetyl-CoA 


(MCPA-CoA) 


bow 





O 
CH HC 
/y a || —_ E 
e E E C SCoA 
4, 
EGE 


Reactive intermediate 


18. This exercise is left to the student and should be based on the 
reference provided in the problem. 


19. It may be presumed that the oxidation of the acyl chain is ac- 
complished via a two-electron transfer, whereas the steps in- 
volved in reoxidation of FADH; by ETF are one-electron 
transfers. FAD/FADHs can participate both in one-electron and 
two-electron transfers, whereas NAD*/NADH can participate 
only in two-electron transfers. 


20. The sequence of reactions involving creation of a double bond, 
then hydration across it, followed by oxidation is what happens 
to succinate in the TCA cycle. (This same sequence of reactions 
is also employed in fatty acid synthesis and in both the catabo- 
lism and anabolism of amino acids.) Clearly, this sequence of 
three reactions must represent an optimal mechanistic strategy 
for the chemistry achieved. 


Chapter 24 


1. The equations needed for this problem are found on page 730. 
See The Student Solutions Manual, Study Guide and Problems Book 
for details. 


2. Carbons C-1 and C-6 of glucose become the methyl carbons of 
acetyl-CoA that is the substrate for fatty acid synthesis. Carbons 
C-2 and C-5 of glucose become the carboxyl carbon of acetyl- 
CoA for fatty acid synthesis. Only citrate that is immediately 
exported to the cytosol provides glucose carbons for fatty acid 
synthesis. Citrate that enters the TCA cycle does not immedi- 
ately provide carbon for fatty acid synthesis. 


3. A suitable model, based on the evidence presented in this 
chapter, would be that the fundamental regulatory mechanism 
in ACC is a polymerization-dependent conformation change in 
the protein. All other effectors—palmitoyl-CoA, citrate, and 
phosphorylation-dephosphorylation—may function primarily 
by shifting the inactive protomer-active polymer equilibrium. 


o 
HC, I 
>C—CH=CH—C—SCoA 
HC 
| 
N N O 
Z w 
N 
O 
O 


n CH=CH—C—SCoA + FAD 


Polymerization may bring domains of the protomer (that is, 
bicarbonate-, acetyl-CoA-, and biotin-binding domains) closer 
together or may bring these domains on separate protomers 
close to each other. See The Student Solutions Manual, Study Guide 
and Problems Book for further details. 


4. The pantothenic acid group may function, at least to some 
extent, as a flexible “arm” to carry acyl groups between the 
malonyl transferase and ketoacyl-ACP synthase active sites. The 
pantothenic acid moiety is approximately 1.9 nm in length, 
setting an absolute upper-limit distance between these active 
sites of 3.8 nm. However, on the basis of modeling considera- 
tions, it seems likely that the distance between these sites is 
smaller than this upper-limit value. 


5. Two electrons pass through the chain from NADH to FAD to 
the two cytochromes of the cytochrome b; reductase and then 
to the desaturase. Together with two electrons from the fatty 
acyl substrate, these electrons reduce an Oz to two molecules of 
water. The hydrogen for the waters that are formed in this way 
comes from the substrate (2H) and from two protons from 
solution. 


6. Ethanolamine + glycerol + 2 fatty acyl-CoA + 2 ATP + CTP + 
HO —> phosphatidylethanolamine + 2 ADP + 2 CoA + CMP 
+ PP; + P; 

7. The conversion of acetyl-CoA to lanosterol can be written as: 


18 Acetyl-CoA + 13 NADPH + 13 H* + 18 ATP + 0.5 O —> 
lanosterol + 18 CoA + 13 NADP* + 18 ADP + 6 P; + 6 PP; + CO» 


The conversion of lanosterol to cholesterol is complicated; 
however, in terms of carbon counting, three carbons are lost in 
the conversion to cholesterol. This process might be viewed as 
1.5 acetate groups for the purpose of completing the balanced 
equation. 





8. The numbers 1-4 in the cholesterol structure indicate the 
carbon positions of mevalonate as shown (note that the num- 
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bering shown here is not based on the systematic numbering of 





mevalonate): 
4 
HC. , „OH 
Saar Be R 
OOC., „C, -CH,OH 
CH, CH, 
HO 
Mevalonate Cholesterol 
9. The O-linked saccharide domain of the LDL receptor probably 11. It “costs” 1 ATP to form a malonyl-CoA. Each cycle of the fatty 
functions to extend the receptor domain away from the cell acyl synthase consumes 2 NADPH molecules, each worth 
surface and above the glycocalyx coat so that the receptor can 3.5 ATP. Thus, each of the seven cycles required to form a 
recognize circulating lipoproteins. palmitic acid consumes 8 ATPs. A total of 56 ATPs are con- 
10. As shown in Figures 24.19 and 24.22, the syntheses (in eukary- sumed to synthesize one molecule of palmitic acid. 
otes) of phosphatidylcholine, phosphatidylethanolamine, 12. The mechanism of the 3-ketosphinganine synthase reaction: is 
phosphatidylinositol, and phosphatidylglycerol are dependent shown in the following figure. 


upon CTP. A CTP deficiency would be likely to affect all these 
synthetic pathways. 
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13: 


16. 
17. 
18. 
19. 


20. 


FAD is required for the eukaryotic reaction that converts stearic 
acid to oleic acid because the oxidation of a single bond to a 
double bond in stearic acid involves NADH and thus requires 

a two-electron transfer, whereas the reoxidation of FADHs, to 
FAD is accomplished by electron transfer to cytochrome bs. 
Cytochromes are capable of one-electron transfers only, so 


6-Ketoacyl ACP synthase 


Y 
S| 
1 

o ie C—S—ACP 

I 
H,C—C 

k ss 
S— KSase 
B:H 


This exercise is left to the student. 
This exercise is left to the student. 
This exercise is left to the student. 


Palmitic acid must be elongated and then unsaturated to form 
linoleic acid. The elongation process involves a thiolase reaction 
to add two carbons to palmitoyl-CoA and then reduction of a 
carbonyl to a hydroxyl, dehydration to form a double bond, and 
then reduction of the double bond to a single bond. These last 
three reactions are the reverse of what happens in TCA, in the 
conversion of succinate to fumarate, then malate, then oxalo- 
acetate. These same three reactions occur in B-oxidation, in fatty 
acid synthesis, and in amino acid synthesis and degradation. 


11 Acetyl-CoA + 10 ATP* + 20 NADPH + 10 Ht —> 
behenoyl-CoA + 20 NADP+ + 10 CoASH + 10 ADP?” + 10 P~ 





Chapter 25 


Į 


The oxidation number of N in nitrate is +5; in nitrite, +3; in 
NO, +2; in NoO, +1; and in No, 0. 


. a. Assume that nitrate assimilation requires 4 NADPH equiva- 


lents per NO; reduced to NH,*. Four NADPH have a meta- 
bolic value of 12 ATP. 

b. Nitrogen fixation requires 8 e~ (see Equation 25.3) and 
16 ATPs (see Figure 25.6) per No reduced. If 4 NADH 
provide the requisite 8 e~, each NADH having a metabolic 
value of 3 ATPs, then 28 ATP equivalents are consumed per 
Np reduced in biological nitrogen fixation (or 14 ATP equiva- 
lents per NH,* formed). 


. a. [ATP] increase will favor adenylylation; the value of n will be 


greater than 6 (n>6). 

b. An increase in Py,/Pyp will favor adenylylation; the value of 
n will be greater than 6 (n >6). 

c. An increase in the [wKG]/[Gln] ratio will favor deadenylyla- 
tion; n <6. 

d. [P;] decrease will favor adenylylation; n >6. 


. Two ATPs are consumed in the carbamoyl-P synthetase-I reac- 


tion, and 2 phosphoric anhydride bonds (equal to 2 ATP equiv- 
alents) are expended in the argininosuccinate synthetase 
reaction. Thus, 4 ATP equivalents are consumed in the urea 


—> H,C—C—CH,—C 


FADH»2/FAD is required because it can participate both in one- 
electron and two-electron transfers. 


14. See Chapter 23, problem 13, for a mechanism for the HMG- 


CoA synthase reaction. 


15. A mechanism for the B-ketoacyl ACP synthase reaction: 


O O 
I | 





S—ACP 


Acetoacetyl — ACP 


cycle, as 1 urea and 1 fumarate are formed from 1 COs, 1 NHs, 
and 1 aspartate. 


. Protein catabolism to generate carbon skeletons for energy pro- 


duction releases the amino groups of amino acids as excess ni- 
trogen, which is excreted in the urine, principally as urea. 


. One ATP in reaction 1, one NADPH in reaction 2, one NADPH 


in reaction 11, and one succinyl-CoA in reaction 12 add up to 
10 ATP equivalents, assuming each NADPH is worth 4 ATPs. 
(The succinyl-CoA synthetase reaction of the citric acid cycle 
[see p. 575] fixes the metabolic value of succinyl-CoA versus 
succinate at 1 GTP [= 1 ATP].) 


. From Figure 25.36: C-labeled carbon atoms derived from C-2 


of PEP are shaded yellow. 
l. 2 aspartate —> transamination —+> 2 oxaloacetate. 


2. 2 oxaloacetate + 2 GTP —> PEP carboxykinase —> 
2 PEP + 2 CO; + 2 GDP. 


3. 2 PEP +2 HO —> enolase——> 2 2-PG. 
4. 2 2-PG —> phosphoglyceromutase —> 2 3-PG. 


5. 2 3-PG + 2 ATP —> 3-P glycerate kinase — > 2 1,3-bisPG + 
2 ADP. 


6. 2 1,3-bisPG + 2 NADH + 2 Ht —> G-3-P dehydrogenase —> 
2 G-3-P + 2 NAD* + 2P.. 
7. 1 G-3-P —> triose-P isomerase——> 1 DHAP. 
8. G-3-P + DHAP —> aldolase—— fructose-1,6-bisP. 
9. F-1,6-bisP + HO —~> FBPase—> F-6-P + P.. 
10. F-6-P — > phosphoglucoisomerase —> G-6-P. 





11. G-6-P + HO —> glucose phosphatase —— glucose + P;. 
Net: 2 aspartate + 2 GTP + 2 ATP + 2 NADH + 6 H* + 
4 H:O —> glucose + 2 CO, + 2 GDP + 2 ADP + 4 P; + 2 NAD* 
(Note that 4 of the 6 H* are necessary to balance the charge on 
the 4 carboxylate groups of the 2 OAA and the loss of amino 
groups from Asp is ignored.) 











(As a consequence of reaction [1], 2 a-keto acids [for example, 
a-ketoglutarate] will receive amino groups to become 2 a-amino 
acids [for example, glutamate ].) 


9. 


10. 


ll. 


12; 


13. 


14. 


Alanine: glutamate : pyruvate aminotransferase. 


Arginine: from glutamate via ornithine, so it’s glutamate 
dehydrogenase. 


Aspartate: glutamate:oxaloacetate aminotransferase. 


Asparagine: from aspartate, so it’s the glutamate:oxaloacetate 
aminotransferase. 


Cysteine: cysteine is formed from serine, so it’s glutamate via 
3-phosphoserine aminotransferase. 


Glutamate: glutamate dehydrogenase. 
Glutamine: glutamine synthetase. 


Glycine: glycine is formed from serine, so it’s glutamate via 
3-phosphoserine aminotransferase. 


Histidine: from glutamate via L-histidinol phosphate 
aminotransferase. 


Isoleucine: glutamate:a-keto-6-methylvalerate aminotransferase. 
Leucine: glutamate:a-ketoisocaproate aminotransferase. 


Lysine: from glutamate via saccharopine formation by a 
glutamate-dependent NADPH dehydrogenase; in bacteria, lysine 
is synthesized from aspartate, so it’s glutamate: oxaloacetate 
aminotransferase. 


Methionine: from aspartate, so it’s glutamate:oxaloacetate 
aminotransferase. 


Phenylalanine: glutamate : phenylpyruvate aminotransferase 
(= phenylalanine aminotransferase). 


Proline: from glutamine, so it’s glutamate dehydrogenase. 
Serine: glutamate via 3-phosphoserine aminotransferase. 


Threonine: from aspartate, so it’s glutamate: oxaloacetate 
aminotransferase. 


Tryptophan: from serine via tryptophan synthase, so its a-amino 
group comes from serine, which gets its amino group from glu- 
tamate via 3-phosphoserine aminotransferase. 


Tyrosine: glutamate : 4-hydroxyphenylpyruvate aminotransferase 
(= tyrosine aminotransferase). 


Valine: glutamate : a-ketoisovalerate aminotransferase. 


Pyridoxal (vitamin Bg), because it is the precursor to pyridoxal- 
P, the key coenzyme in aminotransferase reactions, as well as 
other aspects of amino acid metabolism. 


The conversion of homocysteine to methionine is folate-depen- 
dent; dietary folate absorption is dependent on vitamin By», for 
removal of methyl groups added to folate during digestion; 
finally, the a-amino group of homocysteine formed in the me- 
thione biosynthetic pathway comes from aspartate via the pyri- 
doxal-P-—dependent glutamate : oxaloacetate aminotransferase. 


The BCKAD complex is structurally and functionally analogous 
to the pyruvate dehydrogenase complex and the a-ketoglutarate 
dehydrogenase complexes. See Figure 19.4; the reaction mecha- 
nism for the pyruvate dehydrogenase complex is essentially the 
same as that of the BCKAD complex. 

Aspartame is a N-a-L-aspartyl1-phenylalanine-l-methyl ester that 
is broken down in the digestive tract and phenylalanine is re- 
leased. Phenylalanine is the substance that phenylketonurics 
must avoid. 


Glyphosate inhibits 3-enolpyruvylshikimate-5-P synthase, an es- 
sential enzyme in the biosynthesis of chorismate. Not only is 
chorismate the precursor for synthesis of the aromatic amino 
acids Phe, Tyr, and Trp, it is also the precursor for formation of 
lignin, a major structural component in plant cell walls, as well 


15s 


16. 


17: 


18. 


19. 


20. 
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as other essential substances such as folate, coenzyme Q, plasto- 
quinone, and vitamins E and K. 


Glycine is formed from serine, which is formed from 
3-phosphoglycerate. Carbons 3 and 4, 2 and 5, and 1 and 6 of 
glucose contribute carbons 1, 2, and 3 of 3-phosphoglycerate, 
respectively. The B-carbon of serine derives from the 3-C in 
3-phosphoglycerate, the C,-carbon atom in serine comes from 
3-phosphoglycerate C-2, and the carboxyl-C in serine comes 
from C-1 of 3-PG. It is these latter two carbons of 3-PG that are 
found in glycine; that is, C-1 and C-2 of 3-PG; C-1 of 3-PG came 
from C-3 and C-4 in glucose, and C-2 of 3-PG came from C-2 
and C-5 in glucose. 


Serine is an important precursor for glycine synthesis. 
Consult www.pdb.org to view pdb id = 1LM1. 


The liver converts amino acids to glucose to provide a source of 
energy for other cells, such as nerve and red blood cells. Con- 
version of amino acids to glucose by gluconeogenesis requires 
energy; for example, six ATP equivalents are needed to form 
glucose from aspartic acid. 


GS monomers are inactive because GS active sites require ele- 
ments of protein structure contributed by adjacent subunits in 
the GSi dodecamer. 


Four of the six carbons in the ring of Tyr come from erythrose- 
4-P and the remaining two from PEP; the carboxyl-C, C,, and 
Cg of Tyr come from PEP. Degradation of Tyr to acetoacetate + 
fumarate yields acetoacetate composed from the C, and Cg plus 
2 C atoms from the ring which are either both from E-4-P or 
one each from E-4-P and PEP, depending on the orientation of 
the ring (rotation about the Cg-ring C bond). Fumarate will be 
composed from 2 C atoms from PEP and 2 from E-4-P or 3 C 
atoms from E-4-P and 1 from PEP. 


Chapter 26 
T; 
2. 


See Figure 26.2 (purines) and Figure 26.13 (pyrimidines). 
Assume ribose-5-P is available. 


Purine synthesis: 2 ATP equivalents in the ribose-5-P pyrophos- 
phokinase reaction, 1 in the GAR synthetase reaction, 1 in the 
FGAM synthetase reaction, 1 in the AIR carboxylase reaction, 

1 in the CAIR synthetase reaction, and 1 in the SACAIR syn- 

thetase reaction yields IMP, the precursor common to ATP and 

GTP. Net: 7 ATP equivalents. 

a. ATP: 1 GTP (an ATP equivalent) is consumed in converting 
IMP to AMP; 2 more ATP equivalents are needed to convert 
AMP to ATP. Overall, ATP synthesis from ribose-5-P onward 
requires 10 ATP equivalents. 

b. GTP: 2 high-energy phosphoric anhydride bonds from ATP, 
but 1 NADH is produced in converting IMP to GMP; 2 more 
ATP equivalents are needed to convert GMP to GTP. Overall, 
GTP synthesis from ribose-5-P onward requires 8 ATP 
equivalents. 

Pyrimidine synthesis: Starting from HCO; and Gln, 2 ATP 
equivalents are consumed by CPS-II, and an NADH equiva- 
lent is produced in forming orotate. OMP synthesis from 
ribose-5-P plus orotate requires conversion of ribose-5-P to 
PRPP at a cost of 2 ATP equivalents. Thus, the net ATP 
investment in UMP synthesis is just 1 ATP. 

c. UTP: Formation of UTP from UMP requires 2 ATP equiva- 
lents. Net ATP equivalents in UTP biosynthesis = 3. 

d. CTP: CTP biosynthesis from UTP by CTP synthetase con- 
sumes 1 ATP equivalent. Overall ATP investment in CTP syn- 
thesis = 4 ATP equivalents. 
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3. a. See Figure 26.6. 
b. See Figure 26.17. 
c. See Figure 26.17. 
4. a. Azaserine inhibits glutamine-dependent enzymes, as in steps 


2 and 5 of IMP synthesis (glutamine : PRPP amidotransferase 
and FGAM synthetase), as well as GMP synthetase (step 2, 
Figure 26.5), and CTP synthetase (Figure 26.16). 

b. Methotrexate, an analog of folic acid, antagonizes THF- 
dependent processes, such as steps 4 and 10 (GAR trans- 
formylase and AICAR transformylase) in purine biosynthesis 
(Figure 26.3), and the thymidylate synthase reaction (Figure 
26.26) of pyrimidine metabolism. 

c. Sulfonamides are analogs of p-aminobenzoic acid (PABA). 
Like methotrexate, sulfonamides antagonize THF formation. 
Thus, sulfonamides affect nucleotide biosynthesis at the same 
sites as methotrexate, but only in organisms such as prokary- 
otes that synthesize their THF from simple precursors such 
as PABA. 

d. Allopurinol is an inhibitor of xanthine oxidase (Figure 
26.10). 

e. 5-Fluorouracil inhibits the thymidylate synthase reaction 
(Figure 26.26). 


5. UDP, via conversion to (UDP (Figure 26.24), is ultimately a pre- 
cursor to dTTP, which is essential to DNA synthesis. 
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6. See Figure 26.23. 


7. Ribose, as ribose-5-P, is released during nucleotide catabolism 
(as in Figure 26.8). Ribose-5-P is catabolized via the pentose 
phosphate pathway and glycolysis to form pyruvate, which enters 
the citric acid cycle. From Figure Chapter 22, problem 17, 
note that 3 ribose-5-P (rearranged to give 1 ribose-5-P and 
2 xylulose-5-P) give a net consumption of 2 ATPs and a net pro- 
duction of 8 ATPs and 5 NADH (= 15 ATPs), when converted 
to 5 pyruvate. If each pyruvate is worth 15 ATP equivalents (as in 
a prokaryotic cell), the overall yield of ATP from 3 ribose-5-P is 
75 ATP + 23 ATP — 2 ATP = 96 ATP. Net yield per ribose-5-P is 
thus 32 ATP equivalents. 


8. Comparing Figures 26.3 and 25.40, note that AICAR (5- 
aminoimidazole-4-carboxamide ribonucleotide) is a common 
intermediate in both pathways. It is a product of step 5 of histi- 
dine biosynthesis (Figure 25.40) and step 9 of purine biosynthe- 
sis (Figure 26.3). Thus, formation of AICAR as a byproduct of 
histidine biosynthesis from PRPP and ATP bypasses the first 
nine steps in purine synthesis. However, cells require greater 
quantities of purine than of histidine, and these nine reactions 
of purine synthesis are essential in satisfying cellular needs for 
purines. 


9. See the following figure. 
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17; 


Aspartate + GTP^ + HO —> 
fumarate + NH,* + GDP? + P2- 
1. Uric acid + 2 HO + Oo —> urate oxidase —> allantoin + 
HO; + COs 
2. Allantoin + HO —> allantoinase —> allantoic acid + 
glyoxylic acid 








3. Allantoic acid + HO —> allantoicase —> 2 urea 
4. 2 Urea + 2 HO —> urease — 2 CO, + 2 HN; 


Uv 








[Aspartate ] 


ATCase (subunit organization a8.) consists of 6 functional 
units, each composed of 1 a-subunit and 1 B-subunit, best de- 
scribed as (@B)6¢. 


From these starting materials to UTP in a eukaryotic cell, where 
mitochondrial oxidation of dihydroorotate via a coenzyme 
Q-linked flavoprotein would yield 1.5 ATPs, has a net cost of 
2.5 ATPs. From UTP to dTTP would require an ATP at the CTP 
synthetase step; recovery of an ATP in the nucleoside diphos- 
phate kinase reaction ADP + CTP — CDP + ATP; an NADPH 
(= 4 ATP equivalents) in converting CDP > dCDP, recovery of 
an ATP equivalent in the deoxycytidylate kinase reaction dCDP 
+ ADP > dCMP + ATP, deamination of (CMP to (UMP by 
dCMP deaminase; conversion of (UMP to dTMP by thymidylate 
synthase and then 2 ATP equivalents to form dTTP. Net: 

7.5 ATP equivalents versus 10 ATP for ATP synthesis via the 
purine biosynthetic pathway. 

. UMP + ATP —> UDP + ADP 

. UDP + NADPH —> dUDP + NADP* + OH- 

dUDP + ATP —> dUTP + ADP 

dUTP + HO —> dUMP + PP; 

. PP; + HO —> 2 P; 


. GUMP + N°,N!°-methylene-THF —> dTMP + DHF 
Net: UMP + N°,N!°-methylene-THF + 2 ATP + NADPH + 
2 HO —> dTMP + DHF + 2 ADP + 2 P; + OH- + NADP+ 


Possibilities include glutamine phosphoribosyl pyrophosphate 
amidotransferase (see Figure 26.3), adenine phosphoribosyl 
transferase and hypoxanthine-guanine phosphoribosyl trans- 
ferase (see Figure 26.7), ribose-5-phosphate pyrophosphokinase 
(see Figure 26.3), and enzymes of the pyrimidine salvage 
pathway. 








Qo po nm 








Explore pdb id = 7DFR. Note the proximity of the 4-position of 
the nicotinamide ring (the hydride donor) to the 7-position of 
folate (the hydride acceptor). Note that polar groups on the 
substrates are solvent-exposed. 


18. View pdb id = 1RAA to see that the CTP-binding sites on the 


R-subunits are dominated by £-strands. View pdb id = 1RAA to 


19. 
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see the ATCase substrate-binding sites (evident from the 
location of the substrate analog N-2-(phosphonoacetyl)-L- 
asparagine). The allosteric (CTP-binding) sites and the active 
sites are at opposite ends of the protein. Thus, CTP binding is 
communicated to the active site via ligand-induced conforma- 
tional changes extending across the entire protein. R-state 
ATCase is represented by pdb id = 2IPO; T-state by pdb id = 
IRAA. 


If GMP levels are high (but GDP or AMP is not), ribose-5-P 
pyrophosphokinase will not be inhibited, glutamine-PRPP 
amidotransferase will be roughly 50% inhibited due to GMP 
binding at the G nucleotide allosteric site, and IMP dehydroge- 
nase will be blocked, so IMP is directed toward AMP synthesis to 
give a balanced amount of A versus G nucleotides. 


20. Carbons 1 and 6. 


Chapter 27 
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a. Keq = 360,333. 

b. Assuming [ATP] = [ADP], [pyruvate] must be greater than 
360,333 [PEP] for the reaction to proceed in reverse. 

c Keg = 0.724. 

d. [PEP]/ [pyruvate] = 724. 

e. Yes. Both reactions will be favorable as long as [PEP]/[pyru- 
vate] falls between 0.0000028 and 724. 


2. Energy charge = 0.945; phosphorylation potential = 1111 M7'. 


3. 8mM ATP will last 1.12 sec. Because the equilibrium constant 


for creatine-P + ADP > Cr + ATP is 175, [ATP] must be less 
than 1750 [ADP] for the reaction creatine-P + ADP —> Cr + 
ATP to proceed to the right when [Cr-P] = 40 mM and [Cr] 
= 4 mM. 


. From Equation 20.12, €(NADP*/NADPH) = —0.350 V; 


€(NAD*/NADH) = —0.281V; thus, AS = 0.069 V, and AG = 
—13,316 J/mol. If an ATP “costs” 50 kJ/mol, this reaction can 
produce about 0.27 ATP equivalent at these concentrations of 
NAD*, NADH, NADP+, and NADPH. 





. Assume that the Keq for ATP + AMP —> 2 ADP = 1.2 (see 


legend to Figure 27.2). When [ATP] = 7.2 mM, [ADP] = 
0.737 mM and [AMP] = 0.063 mM. When [ATP] decreases by 
10% to 6.48 mM, [ADP] + [AMP] = 1.52 mM and thus [ADP] 
= 1.30 mM and [AMP] = 0.22 mM. A 10% decrease in [ATP] 
has resulted in a 0.22/0.063 = 3.5-fold increase in [AMP]. 


6. J (the flux of F-6-P through the substrate cycle) at low [AMP] = 


0.1; J at high [AMP] = 8.9. Therefore, the flux of F-6-P through 
the cycle has increased 89-fold. 


. Inhibition of acetyl-CoA carboxylase will lower the concentra- 


tion of malonyl-CoA. Because malonyl-CoA inhibits uptake of 
fatty acids (see Figure 24.16), decreases in [malonyl-CoA] favor 
fatty acid synthesis. 


. Ethanol oxidation to acetate yields 2 NADH in the cytosol, 


each worth 1.5 ATPs. The acetyl thiokinase reaction consumes 

2 ATP equivalents in converting acetate to acetyl-CoA (due to 
ATP — AMP + PP;). Combustion of acetyl-CoA — 2 CO» in a 
liver cell yields a net of 10 ATPs. Therefore, the net yield of ATP 
from ethanol in a liver cell is 11 ATPs. Glucose — 6 CO» ina 
liver cell yields 30 ATPs (see Table 20.4) or 5 ATP/C atom. 
Ethanol — 2 CO» gives 11 ATPs or 5.5 ATP/C atom. 


. Palmitoyl-CoA = 8 acetyl-CoA yields 7 NADH and 7 [FADH»] = 


21 + 14 = 35. Eight acetyl-CoA — palmitoyl-CoA requires 
14 NADPH + 7 ATP = —63 ATP (negative sign denotes ATP 
consumed). The palmitoyl-CoA == 8 acetyl-CoA conversion is 
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favorable in both directions provided the free energy release is 
more than 1750 kJ/mol in the catabolic (acetyl-CoA forming) 
direction (the energy necessary to produce 35 ATP equivalents 
at a cost of 50 kJ/mol each) and less than 3150 kJ/mol in the 
anabolic (palmitoyl-CoA forming) direction (the energy re- 
leased from 63 ATP equivalents). 


Cellular respiration releases 2870 kJ/mol of glucose under stan- 
dard-state conditions (and about the same amount under cellu- 
lar conditions). Assuming the cell is a bacterial cell where 
cellular respiration produces 38 ATPs, the total cost of ATP syn- 
thesis is 38(50) = 1900 kJ/mol. Thus, the overall free energy 
change = —2870 + 1900 970 kJ/mol. Because A G” = 

—RT In Keg, Keq = € = 10'. This is a very large number! 
It will be even larger for a cell where ATP yields per glucose 
are less. 








Carbon dioxide fixation leads to glucose synthesis at the cost 
of 12 NADPH and 18 ATP = 66 ATP equivalents. At 50 kJ/mol, 
the energy investment from ATP is —3300 kJ/mol and the value 
of a glucose is +2870 kJ/mol. Thus, AG°’ = —430 kJ/mol = 
—RT In Keq. Keq = e13? = 10%, which is also a very large 
number! 

Glycogen phosphorylase, phosphorylase kinase, glycogen syn- 
thase, PFK-1, PFK-2, FBPase, and glucose-6-P phosphatase are 
liver enzymes that act specifically in glycogenolysis or gluconeo- 
genesis; these enzymes are potential targets to control blood 
glucose levels. 


From Figure 27.12: The “limit production” list would include 
NPY, AgRP, and ghrelin; the “raise levels” list would include 
cholecystokinin, leptin, insulin, melanocortins, and PYY3_36. 


Leptin injection in ob/ob mice decreases food intake, raises fatty 
acid oxidation levels, and lowers body weight. Obese humans are 
not usually defective in leptin production. Indeed, because leptin 
is produced in adipocytes, obese individuals may already have 
high levels of leptin, so leptin injection has limited success. 

No, although NPY is an orexic agent, it is not necessarily an 
obesity-promoting hormone. Mice deficient in melanocortin 
production, melanocortin receptors, or leptin receptors would 
have an obese phenotype, because all these agents act in ap- 
petite-suppressing pathways. 

Alcohol consumption lowers the NAD*/NADH ratio, which 
limits glycolysis because glyceraldehyde-3-P dehydrogenase re- 
quires NAD", stimulates gluconeogenesis because NADH 

favors glyceraldehyde-3-P dehydrogenase working in the glucose 
synthesis direction, and limits fatty acid oxidation because 
B-oxidation requires NAD*. NADH is also a negative regulator 
of several citric acid cycle enzymes. 


The T172D mutant of AMPK mimics the phosphorylated form 

in having negative charge on the side chain at position 172. 

a. Fatty acyl-CoA. 

b. Transgenic mice lacking functional hypothalamic fatty acid 
synthase should have elevated [malonyl-CoA]. If hypothala- 
mic levels of malonyl-CoA are high, eating should diminish, 
as should body fat content. Assuming physical activity is pro- 
portional to signals of nutrient availability, physical activity 
should increase. 

a. Leptin-deficient mice should respond to regular leptin injec- 
tions by losing weight. 

b. The leptin receptor. Assay wild-type and db/db mutant cells 
for leptin binding ability. 

The heart is the principal organ using both glucose and fatty 

acids in the well-fed state; the brain relies mostly on glucose 


20. 


(although adipose tissue also relies on glucose for glycerol pro- 
duction); muscle relies mostly on fatty acids; the brain never 
uses fatty acids; and muscle produces lactate, which is converted 
in the liver into glucose. 


a. Hexokinase is an R enzyme because it is a glycolytic enzyme. 
Glutamine: PRPP amidotransferase is not an ATP-dependent 
enzyme, so it is neither an R or a U enzyme. It responds to 
the adenine nucleotide pool through feedback inhibition by 
AMP, ADP, and ATP. 

b. If E.C. = 0.5, hexokinase activity is high and ribose-5-P 
pyrophosphokinase activity is likely to be low because R-5-P 
pyrophosphokinase is probably a U enzyme. 

c. At E.C. = 0.95, the situation reverses and hexokinase activity 
is low and R-5-P pyrophosphokinase activity is high. 
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a) 


Because DNA replication is semiconservative, the density of the 
DNA will be (1.724 + 1.710)/2 = 1.717 g/mL. If replication is 
dispersive, the density of the DNA will also be 1.717 g/mL. To 
distinguish between these possibilities, heat the dsDNA obtained 
after one generation in N to 100°C to separate the strands; 
then examine the strands by density gradient ultracentrifugation. 
If replication is indeed semiconservative, two bands of ssDNA will 
be observed, the “heavy” one containing N atoms and the 
“light” one containing '!N atoms. If replication is dispersive, only 
a single band of intermediate density would be observed. 


. The 5'-exonuclease activity of DNA polymerase I removes mis- 


paired segments of DNA sequence that lie in the path of the 
advancing polymerase. Its biological role is to remove mispaired 
bases during DNA repair. The 3’-exonuclease activity acts as a 
proofreader to see whether the base just added by the poly- 
merase activity is properly base-paired with the template. If not 
(that is, if it is an improper base with respect to the template), 
the 3’-exonuclease removes it and the polymerase activity can 
try once more to insert the proper base. An E. coli strain lacking 
DNA polymerase I 3'-exonuclease activity would show a high 
rate of spontaneous mutation. 


. Assume that the polymerization rate achieved by each half of 


the DNA polymerase III homodimer is 750 nucleotides per sec. 
The entire E. coli genome consists of 4.64 X 10° bp. At a rate of 
750 bp/sec per DNA polymerase III homodimer (one at each 
replication fork), DNA replication would take almost 3100 sec 
(51.7 min; 0.86 hr). When E. coli is dividing at a rate of once 
every 20 min, E. coli cells must be replicating DNA at the rate of 
4.64 X 10° bp per 20 min (2.32 X 10° bp/min or 3867 bp/sec). 
To achieve this rate of replication would require initiation of 
DNA replication once every 20 min at ori, and a minimum of 
3933/1500 = 2.57 replication bubbles per E. coli chromosome, 
or 5.14 replication forks. 


(2) (3) 


The E. coli chromosome replicating 
once every 20 min (3 replication 
bubbles, 6 replication forks) 


5. If there are 40 molecules of DNA polymerase II (as DNA poly- 
merase II homodimers) per E. coli cell and E. coli growing at its 
maximum rate has about 5 replication forks per chromosome, 
DNA polymerase II availability is sufficient to sustain growth at 
this rate. 


6. Okazaki fragments are 1000 to 2000 nucleotides in length. 
Because DNA replication in E. coli generates Okazaki fragments 
whose total length must be 4.64 X 10° nucleotides, a total of 
2300 to 4700 Okazaki fragments must be synthesized. Consider 
in comparison a human cell carrying out DNA replication. The 
haploid human genome is 3 X 10° bp, but most cells are diploid 
(6 X 10° bp). Collectively, the Okazaki fragments must total 
6 X 10° nucleotides in length, distributed over 3 to 6 million 
separate fragments (assuming they are 1-2 kb in length). 


7. DNA gyrases are ATP-dependent topoisomerases that intro- 
duce negative supercoils into DNA. DNA gyrases change the 
linking number, L, of double-helical DNA by breaking the 
sugar—phosphate backbone of its DNA strands and then reli- 
gating them (see Figure 11.23). In contrast, helicases are ATP- 
dependent enzymes that disrupt the hydrogen bonds between 
base pairs that hold double-helical DNA together. Helicases 
move along dsDNA, leaving ssDNA in their wake. 


8. A diploid human cell contains 6 X 10° bp of DNA. One origin of 
replication every 300 kbp gives 20,000 origins of replication in 
6 X 10° bp. If DNA replication proceeds at a rate of 100 bp/sec 
at each of 40,000 replication forks (2 per origin), 6 X 10° bp 
could be replicated in 1500 sec (25 min). To provide 2 molecules 
of DNA polymerase per replication fork, a cell would require 
80,000 molecules of this enzyme. 


9. For purposes of illustration, consider how the heteroduplex bac- 
teriophage chromosome in Figure 28.17 might have arisen: 
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10. The mismatch repair system of E. coli relies on DNA methylation 
to distinguish which DNA strand is “correct” and which is “mis- 
matched”; the unmethylated strand (the newly synthesized one 
that has not had sufficient time to become methylated is, by def- 
inition, the mismatched one). Homologous recombination in- 
volves DNA duplexes at similar stages of methylation, neither of 
which would be interpreted as “mismatched,” and thus the mis- 
match repair system does not act. 
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11. B-DNA normally has a helical twist of about 10 bp/turn, so the 
rotation per residue (base pair), Ad, is 36°. If the DNA is 
unwound to 18.6 bp/turn, A¢ becomes 19.4°. Thus, the change 
in Ad is 16.6°. 

12. Consider the two DNA duplexes, WC and wc, respectively, dis- 
played as “ladders” in the following figure. The gap at the right 
denotes the initial cleavage, in this case of the Wand w strands. 
Cleavage by resolvase at the arrows labeled 1 and ligation of like 
strands (Wwith wand Cwith c) will yield patch recombinants; 
cleavage at the arrows labeled 2 and ligation of like strands gives 
splice recombinants. 














13. The DNA is: GCTA 
CGAT 
a. HNO», causes deamination of C and A. Deamination of C 
yields U, which pairs the way T does, giving: 
GTTA and ACTA and (rarely) ATTA 
CAAT TGAT TAAT 
Deamination of A yields I, which pairs as G does, giving: 
GCTG and GCCA and (rarely) GCCG 
CGAC CGGT CGGC 
b. Bromouracil usually replaces T, and pairs the way C does: 
GCCA and GCTG and (rarely) GCCG 
CGGT CGAC CGGC 
Less often, bromouracil replaces C, and mimics T in its base- 
pairing: 
GTTA and ACAT and (rarely) ATTA 
CAAT TGTA TAAT 
c. 2-Aminopurine replaces A and normally base-pairs with T but 
may also pair with C: 
GCTG and CGGT and (rarely) GCCG 
CGAC GCCA CGGC 

14. Transposons are mobile genetic elements that can move from 
place to place in the genome. Insertion of a transposon within 
or near a gene may disrupt the gene or inactivate its expression. 

15. The PrP5° form of the prion protein accumulates in individuals 
with mutations in the gene for PrP* and is the cause of inherita- 
ble (genetic) forms of Creutzfeldt-Jacob disease. Furthermore, 
prion-related diseases may be acquired by ingestion of PrP*°, 
indicating the infectious properties of this form of the protein. 
Variant CJD (vCJD) is an example of an infectious version of a 
prion disease. 

16. Hexameric helicases unwind DNA at a cost of 1 ATP/nucleotide. 
The genome of E. coli K12 consists of 4,686,137 nucleotides, so 
4,686,137 ATP equivalents would be needed to completely 
unwind the E. coli K12 chromosome. 

17. Antibodies against specific proteins are commonly used 
protein identification tools. Armed with an antibody specific 


for DNA polymerase HI, another antibody specific for DNA 
polymerase IV, and a procedure to separate replication forks 
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(large macromolecular complexes) from soluble proteins (gel 
filtration, centrifugation?), addition of DNA polymerase IV 
should lead to association of IV with the replication fork and 
release of III. IV should now be found in the large macro- 
molecular complex and III among the soluble proteins. 


Eukaryotic translesion DNA polymerases with small and stubby 
thumb and finger domains are error-prone because they lack 
the structural elements ensuring high selectivity for the proper 
dNTP substrate. Since 3’-exonuclease activity is a proofreading 
activity, translesion DNA polymerases would not need it. 


a. Cells exit the cell at G; and enter Go, because this stage is 
prior to DNA replication. Once DNA replication takes place, 
cells move on to mitosis. 

b. Progression through the cell cycle is regulated at checkpoints. 
Checkpoints, which control whether the cell continues into 
the next phase, ensure that steps necessary for successful com- 
pletion of each phase of the cycle are satisfactorily accom- 
plished. If conditions for advancement to the next phase are 
not met, the cycle is arrested until they are. Checkpoints 
depend on cyclins and cyclin-dependent protein kinases 
(CDKs). Cyclins are synthesized at one phase of the cell cycle 
and degraded at another. CDK interaction with a specific 
cyclin is essential for the protein kinase activity of the CDK. In 
turn, these CDKs control events at each phase of the cycle by 
targeting specific proteins for activation (or inactivation) 
through phosphorylation. Destruction of the phase-specific 
cyclin at the end of the phase inactivates the CDK. 


Recombination in codon 97 suggests that the V, gene provides 
codons 94-96, which are GTT CAT CTT and specify Val-His- 
Leu. The fourth codon is either TTG (Leu), CTG (Leu), or 
CGG (Arg). CGG (Arg) would be the change. However, the 
desirability of a change cannot be anticipated. 
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The first AUG codon is underlined: 


5'-AGAUCCGUAUGGCGAUCUCGACGAAGACUCCUAGG 
GAAUCC... 
Reading from the AUG codon, the amino acid sequence of the 
protein is: 
(N-term) Met-Ala-Ile-Ser-Thr-Lys-Thr-Pro-Arg-Glu-Ser-... 


. See Figure 29.3 for a summary of the events in transcription ini- 


tiation by E. coli RNA polymerase. A gene must have a promoter 
region for proper recognition and transcription by RNA polym- 
erase. The promoter region (where RNA polymerase binds) typ- 
ically consists of 40 bp to the 5’-side of the transcription start 
site. The promoter is characterized by two consensus sequence 
elements: a hexameric TTGACA consensus element in the 

—35 region and a Pribnow box (consensus sequence TATAAT) 
in the —10 region (see Figure 29.4). 


. The fact that the initiator site on RNA polymerase for nu- 


cleotide binding has a higher K,„ for NTP than the elongation 
site ensures that initiation of mRNA synthesis will not begin 
unless the available concentration of NTP is sufficient for com- 
plete synthesis of an mRNA. 


. First, transcription is compartmentalized within the nucleus of 


eukaryotes. Furthermore, in contrast to prokaryotes (which have 
only a single kind of RNA polymerase), eukaryotes possess three 
distinct RNA polymerases—I, II, and I1]—acting on three dis- 
tinct sets of genes (see Section 29.3). The subunit organization 
of these eukaryotic RNA polymerases is more complex than that 
of their prokaryotic counterparts. The three sets of genes are 


recognized by their respective RNA polymerases because they 
possess distinctive categories of promoters. In turn, all three 
polymerases interact with their promoters via particular tran- 
scription factors that recognize promoter elements within their 
respective genes. Protein-coding eukaryotic genes, in analogy 
with prokaryotic genes, have two consensus sequence elements 
within their promoters, a TATA box (consensus sequence 
TATAAA) located in the —25 region, and Jnr, an initiator 
element that encompasses the transcription start site. In addi- 
tion, eukaryotic promoters often contain additional short, con- 
served sequence modules such as enhancers and response 
elements for appropriate regulation of transcription. 


Transcription termination differs as well as prokaryotes and 
eukaryotes. In prokaryotes, two transcription basic termination 
mechanisms occur: rho protein-dependent termination and 
DNA-encoded termination sites. Transcription in eukaryotes 
tends to be imprecise, occurring downstream from consensus 
AAUAAA sequences known as poly(A) addition sites. An impor- 
tant aspect of eukaryotic transcription is post-transcriptional 
processing of mRNA (see Section 29.5). 


. When DNA-binding proteins “read” specific base sequences in 


DNA, they do so by recognizing a specific matrix of H-bond 
donors and acceptors displayed within the major groove of 
B-DNA by the edges of the bases. When DNA-binding proteins 
recognize a specific DNA region by “indirect readout,” the 
protein is discerning local conformational variations in the 
cylindrical surface of the DNA double helix. These conforma- 
tional variations are a consequence of unique sequence infor- 
mation contributed by the base pairs that make up this region 
of the DNA. 


. The metallothionine promoter is about 265 bp long. Because 


each bp of B-DNA contributes 0.34 nm to the length of DNA, 
this promoter is about 90 nm long, covering 26.5 turns of 
B-DNA (10 bp/turn). Each nucleosome spans about 146 bp, so 
about 2 nucleosomes would be bound to this promoter. 


. If cells contain a variety of proteins that share a leucine zipper 


dimerization motif but differ in their DNA recognition helices, 
heterodimeric bZIP proteins can form that will contain two 
different DNA contact basic regions (consider this possibility in 
light of Figures 29.34 and 29.35). These heterodimers are no 
longer restricted to binding sites on DNA that are dyad- 
symmetric; the repertoire of genes with which they can interact 
will thus be dramatically increased. 


. Exon 17 of the fast skeletal muscle troponin T gene is one of 


the mutually exclusive exons (see Figure 29.45). Deletion of this 
exon would cut by half the alternative splicing possibilities, so 
only 32 different mature mRNAs would be available from this 
gene. If exon 7, a combinatorial exon, were duplicated, it would 
likely occur as a tandem duplication. The different mature 
mRNAs could contain 0, 1, or 2 copies of exon 7. The 32 combi- 
natorial possibilities shown in the center of Figure 29.45 repre- 
sent those with 0 or 1 copy of exon 7. Those with 2 copies are 
16 in number: 456778, 56778, 46778, 6778, 45778, 4778, 5778, 
778, 4577, 577, 5677, 45677, 4677, 477, 677, and 77. Given the 
mutually exclusive exons 17 and 18, 96 different mature mRNAs 
would be possible. 


. Modifications include acetylation and methylation of Lys 


residues, thus suppressing their positive charge: methylation 

of Arg residues, again suppressing the positive charge, and phos- 
phorylation of Ser residues, thus introducing negative charges. 
The positively charged Lys and Arg residues interact electrostati- 
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cally with phosphate groups on the DNA backbone, so these 
interactions would be diminished through acetylation and 
methylation. Phosphorylation of Ser residues would lead to elec- 
trostatic repulsion between the DNA backbone and the histone 
tails. Collectively, these modifications weaken histone: DNA 
interactions. 


Because the lac repressor is tetrameric and each subunit has 

an inducer-binding site, it seems likely that inducer binding 

is cooperative. Cooperative binding of inducer would be advan- 
tageous, because the slope of the lac operon expression versus 
[inducer] would be steeper over a narrower [inducer] range. 


Capping of the 5’-end and polyadenylation of the 3’-end of 
mRNA protect the RNA from both 5'- and 3’-exonucleolytic 
degradation. Methylations at the 5’-end enhance the interaction 
between cap-binding protein eIF4E (see Figure 30.29), thus in- 
creasing the likelihood that the mRNA will be translated. Note 
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also in Figure 30.29 that polyadenylylation provides a protein- 
binding site for proper assembly of the 40S initiation complex 
essential to translation. 


In the A Deeper Look box on page 870, the argument is made 
that polymerases have a common enzymatic mechanism for 
nucleotide addition to a growing polynucleotide chain based on 
two metal ions coordinated to the incoming nucleotide. These 
metal ions interact with two aspartate residues that are highly 
conserved in both DNA and RNA polymerases. The discovery of 
a second Mg** ion in the RNA polymerase II active site confirms 
to the universality of this model. 

50 bp of DNA is (50) (0.34 nm/bp) = 17 nm. Because 147 bp of 
DNA make 1.7 turns around the nucleosome (see Chapter 11) 

= 86.4 bp/turn of DNA, 50 bp = 0.58 turns, or slightly more 
than 3 turn of DNA around the histone core octamer. Promoter 
and response modules are about 20 to 25 bp. 





ii OH 
Ha 
oO 


a-Helices composed of 7 to 8 residues would have a length of 
1.05 to 1.2 nm. The overall diameter of B-DNA is about 2.4 nm, 
but the diameter at the bottom of major groove is significantly 
less (as a reference, the C,’—C,’ distance between base-paired 
nucleotides in separate chains is 1.1 nm). A distance along the 
B-DNA helix axis of 1.1 nm would correspond to about 3 base 
pairs, so if such an a-helix were laid along the major groove, it 
could not make contacts with more than 3 base pairs (see also 
Figure 29.32). 


Phe acts as a wedge to separate the template and nontemplate 
strands from one another; Phe is located at the end of the 
Y-helix. 
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a-Amanitin binds in the large cleft between the two largest RNA 
polymerase II subunits, in particular to an a-helix, the “bridge” 
helix, running across this cleft. Its position in the cleft slows in- 
hibits translocation of DNA and RNA and the entry of the next 
template nucleotide into the active site. 


The two protein chains in pdb id = 1GU4 are identical through 
positions 268 (D) to 333 (Q). Leucines are found at positions 
297, 304, 306, 313, 320, 324, 327, and 330. Those at positions 
306, 313, 320, and 327 are seven residues apart, as in a leucine 
zipper. Basic residues (K and R) precede these leucines, occur- 
ring at positions, 268, 275, 277, 280, 286, 287, 289, 291, 293, 
295, and 302. 
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19, 


20. 


A protein such as CAP that interacts with phosphate groups in 
the DNA backbone does so via electrostatic interactions based 
on positively charged Arg and Lys side chains. 

Chromatin is decompacted by HATs and chromatin remodeling 
complexes. Histone acetylation by HATs disrupts chromatin 
structure (see problem 9) and chromatin remodeling com- 
plexes then mediate ATP-dependent conformational changes in 
the chromatin that peels about 50 bp of DNA from the core 
octamer, exposing the DNA for access by the transcriptional 
machinery. 


Chapter 30 


J. 


5. 


The cDNA sequence as presented: 
CAATACGAAGCAATCCCGCGACTAGACCTTAAG... 
represents six potential reading frames, the three inherent in 
the sequence as written and the three implicit in the comple- 

mentary DNA strand, which is written 5’3’, is: 

... GTTAAGGTCTAGTCGCGGGATTGCTTCGTATTG 
Two of the three reading frames of the cDNA sequence given 
contain stop codons. The third reading frame is a so-called 
open reading frame (a stretch of coding sequence devoid of 
stop codons): 


CAATACGAAGCAATCCCGCGACTAGACCTTAAG... 
(alternate codons underlined) 





The amino acid sequence it encodes is: 
Gln-Tyr-Glu-Ala-Ile-Pro-Arg-Leu-Asp-Leu-Asn.... 
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The wobble rules state that a first-base anticodon U can recog- 
nize either an A or a G in the codon third-base position; first- 
base anticodon G can recognize either U or C in the codon 
third-base position; and first-base anticodon I can recognize 
either U, C, or A in the codon third-base position. Thus, codons 
with third-base A or G, which are degenerate for a particular 
amino acid (Table 30.3 reveals that all codons with third-base 
purines are degenerate, except those for Met and Trp), could 
be served by single tRNA species with first-base anticodon U. 
More emphatically, codons with third-base pyrimidines (C or U) 
are always degenerate and could be served by single tRNA 
species with first-base anticodon G. Wobble involving first-base 
anticodon I further minimizes the number of tRNAs needed to 
translate the 61 sense codons. Wobble tends to accelerate the 
rate of translation because the noncanonical base pairs formed 
between bases in the third position of codons and bases occupy- 
ing the first-base wobble position of anticodons are less stable. 
As a consequence, the codon: anticodon interaction is more 
transient. 


Two of the three reading frames of the complementary DNA se- 
quence also contain stop codons. The third may be an open 
reading frame: 
... GITAAGGTCTAGTCGCGGGATTGCTICGTATTG 
(alternate codons underlined) 





An unambiguous conclusion about the partial amino acid se- 
quence of this cDNA cannot be reached. 


2. Arandom (AG) copolymer would contain varying amounts of 
the following codons: 


AAA AAG AGA GAA AGG GAG GGA GGG 
codons for Lys Lys Arg Glu Arg Glu Gly Gly, respectively. 


Therefore, the random (AG) copolymer would direct the syn- 
thesis of a polypeptide consisting of Lys, Arg, Glu, and Gly. The 
relative frequencies of the various codons are a function of the 
probability that a base will occur in a codon. For example, 
if the A/G ratio is 5/1, the ratio of AAA/AAG is (5 X 5 X 5)/ 
(5 X 5X 1) = 5/1. If the 3A codon is assigned a value of 100, 
then the 2A1G codon has a frequency of 20. From this analysis, 
the relative abundances of these amino acids in the polypeptide 
should be: 
Lys = 120; Arg = 24; Glu = 24; Gly = 4.8 
(Normalized to Lys = 100: Arg = 20; Glu = 20; Gly = 4) 
3. Review the information in Section 30.2, noting in particular the 
two levels of specificity exhibited by aminoacyl-tRNA synthetases 
(1: at the level at ATP == PP; exchange and aminoacyl adeny- 
late synthesis in the presence of amino acid and the absence of 
tRNA; and 2: at the level of loading the aminoacyl group on an 
acceptor tRNA). 


4. Base pairs are drawn such that the B-DNA major groove is at the 
top (see following figure). 





ayoa a « 
oO O a ae > 
H 
< | a s gA À A 
N A Pa \ =o" =S = 
R H i y“ 
G:U U:G 


6. The stop codons are UAA, UAG, and UGA. 


Sense codons that are a single-base change from UAA include 
CAA (Pro), AAA (Lys), GAA (Glu), UUA (Leu), UCA (Ser), 
UAU (Tyr), and UAC (Tyr). 

Sense codons that are a single-base change from UAG include 
CAG (Gln), AAG (Lys), GAG (Glu), UCG (Ser), UUG (Leu), 
UGG (Trp), UAU (Tyr), and UAC (Tyr). 


Sense codons that are a single-base change from UGA include 
CGA (Arg), AGA (Arg), GGA (Gly), UUA (Leu), UCA (Ser), 
UGU (Cys), UGC (Cys), and UGG (Trp). 

That is, 19 of the 61 sense codons are just a single base change 
from a nonsense codon. 


7. The list of amino acids in problem 6 is a good place to start in 
considering the answer to this question. Amino acid codons in 
which the codon base (the wobble position) is but a single base 
change from a nonsense codon are the more likely among this 
list, because pairing is less stringent at this position. These 


10. 


11. 


12. 


13. 


include UAU (Tyr) and UAC (Tyr) for nonsense codons UAA 
and UAG, and UGU (Cys), UGC (Cys), and UGG (Trp) for 
nonsense codon UGA. 


. The more obvious answer to this question is that eukaryotic 


ribosomes are larger, more complex, and hence slower than 
prokaryotic ribosomes. In addition, initiation of translation re- 
quires a greater number of initiation factors in eukaryotes than 
in prokaryotes. It is also worth noting that eukaryotic cells, in 
contrast to prokaryotic cells, are typically under less selective 
pressure to multiply rapidly. 


. Each amino acid of a protein in an extended £-sheet-like con- 


formation contributes about 0.35 nm to its length. 10 nm + 
0.35 nm per residue = 28.6 amino acids. 


Larger, more complex ribosomes offer greater advantages in 
terms of their potential to respond to the input of regulatory in- 
fluences, to have greater accuracy in translation, and to enter 
into interactions with subcellular structures. Their larger size 
may slow the rate of translation, which in some instances may be 
a disadvantage. 


The universal organization of ribosomes as two-subunit struc- 
tures in all cells—archaea, eubacteria, and eukaryotes—suggests 
that such an organization is fundamental to ribosome function. 
Translocation along mRNA, aminoacyl-tRNA binding, peptidyl 
transfer, and deacylated-tRNA release are processes that require 
repetitious uncoupling of physical interactions between the 
large and small ribosomal subunits. 


Prokaryotic cells rely on N-formyl-Met-tRNA;™ to initiate 
protein synthesis. The tRNA;™* molecule has a number of dis- 
tinctive features, not found in noninitiator tRNAs, that earmark 
it for its role in translation initiation (see Figure 30.16). Further- 
more, N-formyl-MettRNA;™ interacts only with initiator codons 
(AUG or, less commonly, GUG). A second tRNAM*, designated 
tRNA,“ serves to deliver methionyl residues as directed by in- 
ternal AUG codons. Both tRNA{™* and tRNA,,™“ are loaded 
with methionine by the same methionyl-tRNA synthetase, and 
AUG is the Met codon, both in initiation and elongation. AUG 
initiation codons are distinctive in that they are situated about 
10 nucleotides downstream from the Shine-Dalgarno sequence 
at the 5’-end of mRNAs; this sequence determines the transla- 
tion start site (see Figure 30.18). Eukaryotic cells also have two 
tRNA™* species, one of which is a unique tRNAM* that func- 
tions only in translation initiation. Eukaryotic mRNAs lack a 
counterpart to the prokaryotic Shine-Dalgarno sequence; 
apparently the eukaryotic small ribosomal subunit binds to the 
5’-end of a eukaryotic mRNA and scans along it until it encoun- 
ters an AUG codon. This first AUG codon defines the eukary- 
otic translation start site. 


The Shine-Dalgarno sequence is a purine-rich sequence 
element near the 5’-end of prokaryotic mRNAs (see Figure 
30.18). It base-pairs with a complementary pyrimidine-rich 
region near the 3’-end of 16S rRNA, the rRNA component of 
the prokaryotic 30S ribosomal subunit. Base pairing between 
the Shine-Dalgarno sequence and 16S rRNA brings the transla- 
tion start site of the mRNA into the P site on the prokaryotic 
ribosome. Because the nucleotide sequence of the Shine- 
Dalgarno element varies somewhat from mRNA to mRNA, 
whereas the pyrimidine-rich Shine—Dalgarno-binding sequence 
of 16S rRNA is invariant, different mRNAs vary in their affinity 
for binding to 30S ribosomal subunits. Those that bind with 
highest affinity are more likely to be translated. 


14. 


15; 


16. 


17, 


18. 


19. 


20. 
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The most apt account is b. Polypeptide chains typically contain 
hundreds of amino acid residues. Such chains, attached as a pep- 
tidyl group to a tRNA in the P site, would show significant inertia 
to movement, compared with an aminoacyl-tRNA in the A site. 


Elongation factors EF-Tu and EF-Ts interact in a manner analo- 
gous to the GTP-binding G proteins of signal transduction path- 
ways (see page 467, as well as pages 1024-1028 and Figure 
15.19). EF-Tu binds GTP and in the GTP-bound form delivers 
an aminoacyl-tRNA to the ribosome, whereupon the GTP is 
hydrolyzed to yield EF-Tu: GDP and P;. EF-Ts mediates an ex- 
change of the bound GDP on EF-Tu: GDP with free GTP, regen- 
erating EF-Tu: GTP for another cycle of aminoacyl-tRNA 
delivery. The a-subunit of the heterotrimeric GTP-binding G 
proteins also binds GTP. G, has an intrinsic GTPase activity and 
the GTP is eventually hydrolyzed to form G,:GDP and P.. 
Guanine nucleotide exchange factors facilitate the exchange of 
bound GDP on G, for GTP, in analogy with EF-Ts for EF-Tu. 
The amino acid sequences of EF-Tu and G proteins reveal that 
they share a common ancestry. 


Four: two in the aminoacyl-tRNA synthetase reaction; one by 

EF-Tu; one by EF-G. 

a. The proteins are shown as ribbons. 

b. The rRNA is shown as a strand (line). 

c. The tRNAs (shown with the sugar—phosphate backbone as a 
strand and the bases as ring structures) are oriented such 
that their acceptor stems extend away from the 30S subunit, 
in the direction from which a 50S subunit will add. 

d. The tRNA acceptor stems will lie within the peptidyl trans- 
ferase center of the 50S subunit. 


a. 64,281. 

b. Ifyou use the cursor to rotate the structure, many bases 
around the periphery are easy to see (otherwise, the com- 
plexity of the structure makes it hard to see them). 

c. Many such regions are easy to find along the structure’s 
periphery. 

d. RNA. 

The QB phage replicase mRNA shows the most consecutive 

perfect matches (6) complementary to the Shine-Dalgarno 

sequence. 


Chloramphenicol consists of an aromatic ring and several polar 
functions (a nitro group, two —OH groups, and an amine-N). 
The aromatic ring might intercalate between bases in the 23S 
rRNA peptidyl transferase site, the polar functions might form 
H bonds with nitrogenous bases or sugar —OH groups in the 
rRNA. The —NOg group is also rather bulky in addition to 
being polar, so it could disrupt peptidyl transferase in a number 
of ways. 


Chapter 31 


1. 


Human rhodanese has 296 amino acid residues. Its synthesis 
would require the involvement of 4 ATP equivalents per residue 
or 1184 ATP equivalents. Folding of rhodanese by the Hsp60 
a4 complex consumes another 130 ATP equivalents. The total 
number of ATP equivalents expended in the synthesis and 
folding of rhodanese is approximately 1314. 


. Because a single proteolytic nick in a protein can doom it to 


total degradation, nicked proteins are clearly not tolerated by 
cells and are quickly degraded. Cells are virtually devoid of par- 
tially degraded protein fragments, which would be the obvious 
intermediates in protein degradation. The absence of such in- 
termediates and the rapid disappearance of nicked proteins 
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12. 


13. 


from cells indicate that protein degradation is a rigorously selec- 
tive, efficient cellular process. 


. Hsp70: Hsp70 binds to exposed hydrophobic regions of un- 


folded proteins. The Hsp70 domain involved in this binding is 
18 kD in size and therefore would have a diameter of roughly 

3 nm (see problem 5 for the math). Assuming the hydrophobic 
binding site of Hsp70 stretches across its diameter, it would in- 
teract with about 3 nm/0.35 nm = 8 or 9 amino acid residues. 
Multiple Hsp70 monomers could interact with longer hydropho- 
bic stretches. So, for Hsp70 to interact with a polypeptide chain 
does not necessarily depend on the protein’s size, but it does 
depend on its hydrophobicity (and absence of charged groups). 


Hsp70 and Hsp60 chaperonins: Proteins that interact with both of 
these classes of chaperones not only must fit the description for 
Hsp70 targets but must also be small enough to access the 
Hsp60 chaperonin chamber, which has a diameter of about 

5 nm. This restriction means that Hsp60 cannot interact with 
proteins more than roughly 50 kD in mass (see problem 5). 


. Many eukaryotic proteins have a multidomain or modular orga- 


nization, where each module is composed of a contiguous se- 
quence of amino acid residues that folds independently into a 
discrete domain of structure (see Figure 6.26 for examples of 
this type of sequence and structure organization). Such proteins 
would be ideal for co-translational folding: As each newly synthe- 
sized contiguous sequence emerges from the ribosome tunnel, 
it begins folding into its characteristic domain structure. The 
final, fully folded state of the complete protein would be 
achieved when the various domains assumed their proper 
spatial relationships to one another through hinge motions oc- 
curring at intradomain regions of the protein. 


. The maximal diameter for a spherical protein would be 5 nm 


(radius = 2.5 nm). The volume of this protein is given by V = 
irr? = 4/3(3.14) (2.5 X 10°? m)? = 65.4 X 1072) mL. If its 
density is 1.25 g/mL, its mass would be (1.25 g) (65.4 X 

10°?! mL) = 81.8 X 1077! g/molecule, so its molecular weight = 
(6.023 X 10%) (81.8 X 107% g) = 492.5 X 10? g/mol = 

49,250 g/mol, or about 50 kD. 


. There are 649 different phosphorylated forms for a protein 


having 7 separate phosphorylation sites. 


10. 


leads to release of bound polypeptide, giving it the opportunity 
to fold, which is an important step in DnaK function. EF-Ts cat- 
alyzes nucleotide exchange on EF-Tu, converting it to the con- 
formational form competent in aminoacyl-tRNA binding. 
Binding of the aminoacyl-tRNA: EF-Tu complex to the ribosome 
A site triggers the GTPase activity of EF-Tu as codon: anticodon 
recognition takes place and the aminoacyl-tRNA: EF-Tu complex 
conformationally adjusts to the A site. DnaJ delivers an unfolded 
polypeptide chain to DnaK, and its interaction with DnaK also 
triggers the ATPase activity of DnaK, whereupon the DnaK: ADP 
complex forms a stable complex with the unfolded polypeptide. 
In both instances, hydrolysis of bound nucleoside triphosphate 
is triggered and leads to conformational changes that stabilize 
the respective complexes. 


. Protein targeting information resides in more generalized 


features of the leader sequences such as charge distribution, 
relative polarity, and secondary structure, rather than amino 
acid sequence per se. Proteins destined for secretion have 
N-terminal amino acid sequence with one or more basic amino 
acids followed by a run of 6 to 12 hydrophobic amino acids. 
The sequence MRSLLILVLCFLPAALGK... has a basic residue 
(Arg) at position 2 and a run of 12 nonpolar residues 

(... LLILVLPLAALG...); thus, it appears to be a signal se- 
quence for a secretory protein. Cleavage by the signal pepti- 
dase would occur between the G and K. 


. Translocation proceeds until a stop transfer signal associated 


with a hydrophobic transmembrane protein segment is recog- 
nized. The stop-transfer signal induces a pause in translocation, 
and the translocon changes its conformation such that the wall 
of this closed cylindrical structure either opens or exposes a 
hydrophobic path, allowing the transmembrane segment to 
diffuse laterally into the hydrophobic phase of the membrane. 


The maximal diameter for a spherical protein would be 6 nm 
(radius = 3 nm). The volume of this protein is given by V = 
Sar? = 4/3(3.14) (3 X 10-9m)3 = 113 X 10-2! mL. If its density 
is 1.25 g/mL, its mass would be (1.25 g)(113 X 1077! mL) = 
141 X 10°?! ¢/molecule, so its molecular weight = (6.023 X 
10%) (141 X 1072! g) = 851 X 10? g/mol = 85,100 g/mol, or 
about 85 kD. 





N-terminal amino acid. Proteins with Met, Ser, Ala, Thr, Val, Gly, 
or Cys at the amino terminus are resistant to its action. Proteins 
having Arg, Lys, His, Phe, Tyr, Trp, Leu, Ie, Asn, Gln, Asp, or 
Glu at their N-terminus are susceptible. N-terminal Pro residues 
lack a free a-amino group, so such proteins are not susceptible. 


15. 
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E; ubiquitin protein ligase selects proteins by the nature of the 14. The cell cycle relies on the cyclic synthesis and destruction of 


cell cycle regulatory proteins. Lactacystin inhibits cell cycle pro- 
gression by interfering with programmed destruction of pro- 
teins by proteasomes. 


The temperature-induced switch could be based on a 
temperature-dependent conformational change in the HtrA 


16. 


17. 
18. 


19. 


20. 


protein that brings the His-Ser-Asp catalytic triad into the 
proper spatial relationship for protease function. 

There are many other post-translational modifications, includ- 
ing acylation, alkylation, amidation (at a protein’s C-terminus), 
biotinylation, formylation, glycosylation, isoprenylation, and oxi- 
dation. See also Table 5.5. 

This exercise is left to the student. 


A phosphodegron is defined as one or a series of phosphory- 
lated residues on a substrate protein that interact directly with a 
protein-protein interaction domain in an E3 ubiquitin ligase, 
thereby linking the substrate to the ubiquitin conjugation ma- 
chinery. Phosphorylation can affect the affinity of target protein 
binding; alternatively, phosphorylation can stimulate E2 activity. 


It depends on which amino acid is penultimate. If Arg, Lys, His, 
Phe, Tyr, Trp, Leu, Ile, Asn, Gln, Asp, or Glu follow immediately 
after Met, the protein will have a short half-life after Met 
removal. If Ser, Ala, Thr, Val, Gly, Cys, Pro, or another Met 
follow, the protein should have a long half-life after N-terminal 
Met removal. 


To array positively charged residues on one side of an a-helix 
and hydrophobic residues on the other requires a pattern with 
positively charged residues positioned every 3.6 residues and 
similarly for hydrophobic residues. A sequence with Arg or Lys 
at positions 1, 4, 7, 11, 14, and 18 and an array of hydrophobic 
residues at 2, 3, 5, 6, 8, 9, 10, 12, 13, 15, 16, 17, 19, and 20 would 
fit this pattern. 


Chapter 32 


i 


Polypeptide hormones constitute a larger and structurally more 
diverse group of hormones than either the steroid or amino 
acid-derived hormones, and it thus might be concluded that 
the specificity of polypeptide hormone-receptor interactions, at 
least in certain cases, should be extremely high. The steroid hor- 
mones may act either by binding to receptors in the plasma 
membrane or by entering the cell and acting directly with pro- 
teins controlling gene expression, whereas polypeptides and 
amino acid-derived hormones act exclusively at the membrane 
surface. Amino acid-derived hormones can be rapidly intercon- 
verted in enzyme-catalyzed reactions that provide rapid re- 
sponses to changing environmental stresses and conditions. See 
The Student Solutions Manual, Study Guide and Problems Book for 
additional information. 


. The cyclic nucleotides are highly specific in their action, 


because cyclic nucleotides play no metabolic roles in animals. 
Ca?* ion has an advantage over many second messengers 
because it can be very rapidly “produced” by simple diffusion 
processes, with no enzymatic activity required. IP; and DAG, 
both released by the metabolism of phosphatidylinositol, form a 
novel pair of effectors that can act either separately or synergisti- 
cally to produce a variety of physiological effects. DAG and 
phosphatidic acid share the unique property that they can be 
prepared from several different lipid precursors. Nitric oxide, a 
gaseous second messenger, requires no transport or transloca- 
tion mechanisms and can diffuse rapidly to its target sites. 


. Nitric oxide functions primarily by binding to the heme pros- 


thetic group of soluble guanylyl cyclase, activating the enzyme. 
An agent that could bind in place of NO—but that does not 
activate guanylyl cyclase—could reverse the physiological effects 
of nitric oxide. Interestingly, carbon monoxide, which has long 
been known to bind effectively to heme groups, appears to func- 
tion in this way. Solomon Snyder and his colleagues at Johns 


10. 
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Hopkins University have shown that administration of CO to 
cells that have been stimulated with nitric oxide causes attenua- 
tion of the NO-induced effects. 


. Herbimycin, whose structure is shown in the figure below, re- 


verses the transformation of cells by Rous sarcoma virus, pre- 
sumably as a direct result of its inactivation of the viral tyrosine 
kinase. The manifestations of transformation (on rat kidney 
cells, for example) include rounded cell morphology, increased 
glucose uptake and glycolytic activity, and the ability to grow 
without being anchored to a physical support (termed anchorage- 
independent growth). Herbimycin reverses all these phenotypic 
changes. On the basis of these observations, one might predict 
that herbimycin might also inactivate tyrosine kinases that bear 
homology to the viral pp60""* tyrosine kinase. This inactivation 
has in fact been observed, and herbimycin is used as a diagnos- 
tic tool for implicating tyrosine kinases in cell-signaling 
pathways. 


HCO, 





HCO 


CH, CH; OCH, 


. The identification of phosphorylated tyrosine residues on 


cellular proteins is difficult. Quantities of phosphorylated pro- 
teins are generally extremely small, and to distinguish tyrosine 
phosphorylation from serine/threonine phosphorylation is 
tedious and laborious. On the other hand, monoclonal antibod- 
ies that recognize phosphotyrosine groups on protein provide a 
sensitive means of detecting and characterizing proteins with 
phosphorylated tyrosines, using, for example, Western blot 
methodology. 


. Hormones act at extremely low concentrations, but many of the 


metabolic consequences of hormonal activation (release of cyclic 
nucleotides, Ca?* ions, DAG, etc., and the subsequent alterations 
of metabolic pathways) occur at and involve higher concentra- 
tions of the affected molecular species. As we have seen in this 
chapter, most of the known hormone receptors mediate hor- 
monal signals by activating enzymes (adenylyl cyclase, phospholi- 
pases, protein kinases, and phosphatases). One activated enzyme 
can produce many thousands of product molecules before it is 
inactivated by cellular regulation pathways. 


. Vesicles with an outside diameter of 40 nm have an inside diam- 


eter of approximately 36 nm and an inside radius of 18 nm. The 
data correspond to a volume of 2.44 X 107% L. Then 10,000 
molecules/6.02 X 10° molecules/mole = 1.66 X 107% mole. 
The concentration of acetylcholine in the vesicle is thus 
1.66/2.44 M or 0.68 M. 


. The evidence outlined in this problem points to a role for cAMP 


in fusion of synaptic vesicles with the presynaptic membrane 
and the release of neurotransmitters. GTPyS may activate an 
inhibitory G protein, releasing G,j(GTPyS), which inhibits 
adenylyl cyclase and prevents the formation of requisite cAMP. 


. In both cases, the fraction of receptor that is bound with 


hormone is approximately 50%. 


Statin drugs inhibit the synthesis of new cholesterol in human 
subjects, but they have no effect on dietary intake of cholesterol. 
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Dietary cholesterol accounts for approximately 2/3 of all body membrane. The Nernst equation is obtained by setting AG to 
cholesterol. Moreover, synthesis of steroid hormones is not gen- zero in Equation 9.2. Rearrangement yields 
erally dependent on ambient levels of cholesterol in the body. RT In (C/C) = -ZFA 


Thus the taking of statin drugs would not be expected to influ- 


ence the levels and functions of steroid hormones in the body. Using the data in Figure 32.49, one can use this equation to 


yield an equilibrium potential for Kt of —77 mV and an equilib- 


11. -Strands indeed provide more genetically economical means of rium potential for Na* of +53.4 mV. 


traversing a membrane. However, the only significant classes of 
proteins that employ -strands for membrane traversal are the 
porins and toxins such as hemolysin. These f-strand structures 


14. Using the Goldman equation, one can calculate Ay = —60 mV, 
in agreement with values measured experimentally in neurons. 


provide no easy way to communicate conformational changes 15. The resting potential in neurons is —60 mV. Local depolariza- 

across the membrane. On the other hand, bundles of a-helices tion of the membrane causes the potential to rise approximately 

are more able to convey and transmit conformational changes 20 mV to about —40 mV. This change causes the Na* channels 

across a bilayer membrane. to open and Na* ions begin to flow into the cell. This causes the 
12. The formation and breakdown of cAMP: potential to continue increasing to a value of about +30 mV. 


Because this value is close to the equilibrium potential for Na+, 
the Nat channels begin to close and K* channels open. Kt 
~O—P—O—P—O—P—O—CH, Adenine rushes out of the neuron, returning the membrane potential to 

oO a very negative value. There is a slight overshoot of the poten- 
tial, and the K* channels close and the resting membrane po- 
tential is restored. 





16. Pathway steps (in Figure 32.4) that involve amplification include 
the tyrosine kinase of the receptor protein, Raf, MAPKK, and 
MAPK. 


17. If Ras were mutated so as to have no GTPase activity, it would 
activate Raf indefinitely, and the pathway would run as if hor- 
mones were continually activating the receptor tyrosine kinase. 


18. There are several possible answers to this problem. One is that 
the original assays of kinase activities that reflected signaling 
effects were carried out on whole-cell preparations, in which the 
G-protein-dependent and G-protein-independent activities would 
appear as components of the overall activation. The availability 
of G-protein-uncoupled receptor mutants and ligands that could 





H stimulate arrestin recruitment without G-protein activation made 
it possible to distinguish the G-protein-dependent and G-protein- 
T independent effects. (See the Further Reading references in this 
HO—P—O—CH, Adenine chapter by Luttrell for additional discussion of these issues.) 
| O 


19. Malathion has been used for years in the eradication of boll 
weevils, but it has not presented a serious toxicity problem in 
this program. Apparently, malathion is very toxic to insects, and 
relatively less toxic to humans, particularly in the low volume 
applications used for this purpose. 





13. The point at which no ion flow occurs is the point of equilib- 
rium that balances the chemical and electrical forces across a 20. This exercise is left to the student, for obvious reasons. 





B = box; D = definition; F = figure; 
G = molecular graphic; S = structure; 
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A band, 483 
A site, 965 
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Acetohydroxy acid synthase, 793 
Acetyl phosphate, 61, 63 
free energies of hydrolysis of, 57T 
hydrolysis reaction of, 62F 
Acetylcholine 
acetylcholinesterase degrading, 1048, 
1050 
degradation of, 1050F 
Acetylcholine receptor, 1047, 1048F 
Acetylcholine transport protein, 1050 
Acetylcholinesterase, 1048, 1050 
Reat /Kn ratio of, 394T 
Acetyl-CoA 
control of gluconeogenesis by, 670 
in fatty acid synthesis, 729 
leucine catabolism and, 807 
mevalonate and, 751-752 
in TCA cycle 
carbon atoms in, 579, 581 
carbon dioxide produced from, 
571-575, 571K, 573F 
decarboxylation of pyruvate and, 
566-567 


Acetyl-CoA carboxylase, in fatty acid synthe- 
sis, 723-727, 726F 
phosphorylation of, 723F, 726-727, 
727F 
Acetyl-coenzyme A, 57T, 520 
N-Acetylneuraminic acid, 228 
Acetylphenylalanine methyl ester, 
438S 
O-Acetylserine sulfhydrylase, 796 
Acid 
acetic, titration curve for, 39F, 40F 
N-Acetylneuraminic, 228 
adenylic, 295 
aldaric, 188 
aldonic, 188 
amino, 70-92. See also Amino acid 
arachidic, 220T 
arachidonic, 220, 220T, 735-736, 736F, 
748F 
aspartic, 72S 
behenic, 220T 
bile, 761, 761F 
cholic, 233, 234S, 761F 
chorismic, 797, 798F 
conjugate, 36 
cytidylic, 295 
deoxycholic, 233, 234S 
docosahexaenoic, 736B 
eicosapentaenoic, 736B 
fatty, 219-222, 220T, 221G, 221S 
catabolism of, 701-721. See also Fatty 
acid catabolism 
essential, 531 
fusidic, 983S 
gibberellic, 230S 
gluconic, 188 
b-glucuronic, 188S 
glutamic, 72S 
glycocholic, 761F 
guanylic, 295 
hydroxyeicosanoic, 747 
L-iduronic, 188S 
inosinic, 814, 815F, 816-818, 819F 
lignoceric, 220T 
linoleic, 220, 220T, 735-736, 736F 
muramic, 190 
nervonic, 220T 
neuraminic, 190 
octadecanoic, 219 
oleic, 219, 220T, 221G, 221S 
palmitic, 219, 220T 
palmitoleic, 220T 
phosphatidic, 223G, 224 
phosphoric, 40, 41F 
polyamino, 142 


ribothymidylic, 304 
sialic, 228 
residues of, 208S 
stearic, 219, 220T 
sugar, 187-188, 188S 
taurocholic, 761F 
teichoic, 204 
tricarboxylic, 532B 
uric, 293S, 810, 824-825 
gout caused by, 824-825 
purine catabolism and, 823 
structure of, 814S 
uridylic, 295 
uronic, 188 
Acid dissociation constant, 37, 39T 
Acid hydrolysis, 99 
Acid-base behavior of glycine, 76-77 
Acid-base catalysis, 430-431 
Acid-base properties of amino acids, 
76-79, 76F, 77T 
Acidic amino acid, 71 
Acidity, 59T 
Acidosis 
lactic, 681B 
respiratory, 44B 
Aconitase 
citrate isomerized by, 572 
iron-sulfur cluster and, 573 
Aconitase reaction in tricarboxylic acid 
cycle, 573, 573F, 573G 
Acquired immunodeficiency syndrome, 
443B, 876B 
Acridine orange, 326, 327S 
Actin, 116F, 265, 481, 481D, 484, 484F 
Actin gene, 939 
Actin-anchoring protein, 486B 
Actinomycin D, 326, 327S 
Action potential, 1044-1046, 1045F 
Activation 
free energy of, 388 
light, 655 
Activation loop, 1017 
Activation sequence, upstream, 927 
Activator 
transcription, 934 
transcriptional, 923 
Activator gene, 885 
Activator site, 923 
Active site, 389, 398 
Active transport, 277-285 
calcium, 280 
cardiac glycosides and, 279, 280G, 2808, 
282B 
as energy-coupling devices, 278 
gastric H* ,K*-ATPase, 280-282, 282F 


I-2 Index 


Active transport (continued) 
monovalent cation, 278-279, 279F, 279G 
secondary, 286 
Active-site residues, 433B 
Acute lymphoblastic leukemia, 789B 
Acute myeloblastic leukemia, 789B 
Acyl carrier protein, 727, 727F 
Acyl-CoA dehydrogenase 
in beta-oxidation, 704—705 
mechanism of, 706F 
Acyl-CoA dehydrogenase reaction, 706F 
Acyl-CoA synthetase, 702 
Acyl-CoA:cholesterol acyltransferase, 757 
Acyl-coenzyme A, 665-666 
Acyldihydroxyacetone phosphate reduc- 
tase, 738-739 
Acyl-enzyme intermediate, 436 
ADA complex, 933 
ADA-SCID, biochemical defects in, 378B 
Adenine, 292, 293S 
Adenine nucleotide pool, total, 843-844 
Adenine phosphoribosyltransferase, 821 
Adenosine, 294B, 295, 295S 
Adenosine deaminase, 378B, 822B, 823 
Adenosine-5'-diphosphate 
free energies of hydrolysis of, 57T 
phosphorylation of, 66 
Adenosine-5'-monophosphate, 295 
free energies of hydrolysis of, 57T 
glycogen phosphorylase and, 464—465 
inosinic acid as precursor to, 814, 815F, 
816-818, 820F 
structure of, 296S 
Adenosine-5'-phosphosulfate, 319B 
Adenosine-5'-triphosphate, 64S 
S-Adenosylmethionine, 57T, 790, 792F 
S-Adenosylmethionine synthase, 790 
Adenovirus-mediated gene delivery, 377, 
378F, 379 
Adenylate kinase, 843 
Adenylic acid, 295 
Adenylosuccinase, 815F, 819 
Adenylosuccinate lyase, 815F, 823 
Adenylosuccinate synthetase, 819, 823 
Adenylyl cyclase, 465F, 466, 1051 
modulation of, 1025F 
Adenylyl transferase, 777 
Adenylylation, 461T 
Adipocyte, 697 
Adipose cell, 697 
electron micrograph of, 698F 
pentose phosphate pathway and, 684 
Adipose tissue 
energy metabolism of, 848T 
energy stored in, 698T 
fats in, 697, 700-701 
metabolic role of, 848F, 851-852 
A-DNA, 325T 
ADP 
free energies of hydrolysis of, 57T 
phosphorylation of, 66 
ADP ribosylated eEF-2, 980B 
a,-Adrenergic receptors, 1043 
Adrenocorticotropic hormone, 763, 1009F 
Adriamycin, 284S 
Aequorin, 81B 
Aerobe, types of, 512 
Affinity chromatography, 132, 132F 
Affinity label, 402 
Affinity purification, 97 


A-form DNA, 323, 324F 
Agar, 200 
Agaropectin, 200 
Agarose, 200, 200F, 200S 
Aggregate, proteoglycan—hyaluronate, 213 
Aggregation symmetries of globular pro- 
tein, 173T 
Agouti-related peptide, 854 
AICAR, 804 
AICAR transformylase, 815F 
AIDS, 876B 
protease inhibitors for, 443B 
AIR carboxylase, 815F 
AIR synthetase, 815F 
Akt1, 708B 
Alanine 
catabolism of, 804 
pK, value for, 77T 
S, 84B 
structure of, 72S 
synthesis of, 793 
transamination of pyruvate to, 851F 
L-Alanine, 84B 
Alanine tRNA, yeast, 346F, 347F, 347G 
Albuterol, 482B 
Alcohol 
metabolic effects of, 855B 
structure of, 186S 
sugar, 189, 189S 
Alcohol dehydrogenase, 428, 428F, 553, 
553F 
in chemical reaction, 420T 
NAD*-dependent dehydrogenase and, 
579B 
structure of, 173G 
Alcoholic fermentation, 553 
Aldaric acid, 188 
Aldehyde, 188 
Alditol, 189 
Aldohexose, 182F, 183 
Aldol condensation, 544 
Aldolase, 431T 
Aldonic acid, 188 
Aldopentose, 182S, 183 
Aldose, 182-183, 183S 
Aldose reductase, 159F, 687B 
Aldosterone, 764 
Aldotetrose, 182S, 183 
Aliskiren, 424B 
Alkaline phosphatase, 420T 
Alkalosis, respiratory, 44B 
Alkaptonuria, 810B 
1-Alkly-2-Acetylglycerophosphocholine, 744 
Alkylating agent, 893 
Allantoic acid, 825 
Allantoin, 825 
Allantoinase, 825 
Allosteric effector 
bisphosphoglycerate as, 475 
2,3-bisphosphoglycerate as, 475F 
Allosteric enzyme, 395 
membrane-associated, 1016-1017 
Allosteric inhibitor, 457 
Allosteric modifier in fatty acid metabolism, 
736 
Allosteric regulation, 453, 456-467 
covalent modification and, 462-467, 
462F, 464F, 465F, 465S 
definition of, 453D 
general features of, 456-457 


in gluconeogenesis, 669-671, 671F 
of glutamine synthetase, 777, 778F 
of hemoglobin, 467-468, 473 
symmetry model for, 457-459 
All-trans-retinal, 230S 
p-Altrose, 182S 
Alzheimer’s disease, 172B 
protein deposits in, 988B 
a-Amanitin, 924, 924F 
Amide group, coplanar relationship of 
atoms in, 88F 
Amide plane, in proteins, 87, 136-137, 137F 
Amide-linked glycosyl phosphatidylinositol, 
257 
Amide-linked myristoyl, 257, 258F 
Amino acid, 70-92. See also Amino acid 
sequencing; Amino acid synthesis 
acid-base properties of, 76-79, 76F, 77T 
alpha, 780S 
in alpha-helix, 139 
basic, 74 
carboxyl and amino group reactions of, 
77-78 
catabolism of, 804-810 
of C-3 family, 804 
of C-4 family, 806 
of C-5 family, 806 
intermediates of, 804 
of lysine, 809 
of valine, isoleucine and methionine, 
807 
cationic form of, 76F 
chromatographic separation of, 85-86 
commonly in proteins, 71, '72S—73S 
dietary requirements for, 781B 
essential, 531, 781T 
glucogenic, 531 
gluconeogenesis and, 662 
in green fluorescent protein, 81B 
helix behavior of, 142T 
ionization of side chains of, 78—79 
ketogenic, 531 
membrane transport and, 286 
nonessential, 781T 
nonpolar, 74 
not found in proteins, 75S, 76 
nuclear magnetic resonance and, 83-85, 
84F 
optical and stereochemical properties 
of, 79-82 
phenylthiohydantoin, 86, 86F 
pK, value for, 77T 
polar, uncharged, 74 
rarely in proteins, 76 
structure of, 10S, 70-76, 70F, 71F 
synthesis of, 779-810 
in transmembrane helices, 251-252 
as weak polyprotic acid, 76-78 
Amino acid aminotransferase, branched- 
chain, 793 
Amino acid analysis of protein, 99-100 
Amino acid analyzer, 99 
Amino acid neurotransmitter, 1052-1053 
Amino acid residue, 89, 139, 433B 
Amino acid sequence 
of chymotrypsinogen, 434, 435F 
of leader peptides, 921F 
nature of, 110-117, 110F, 113F, 115F, 
116G, 117T 
number of, 101B 


pathological variants of, 117T 
reconstruction of, 105, 106F 
role of, in structure, 136 
Amino acid sequencing 
chemical methods for, 100-105 
cleavage of disulfide bridges in, 101, 
102F 
fragmentation of polypeptide chain 
and, 103-105, 104F 
polypeptide chain separation in, 101 
reconstruction of sequence in, 105, 
106F 
terminal analysis in, 102-103 
enzymatic methods for, 103 
by mass spectrometry, 105-109, 106T, 
107F-108F, 109F 
sequence databases and, 109-110 
Amino acid synthesis, 779-810 
aminotransferase reaction in, 782B 
of aromatic amino acids, 797-802, 797F, 
798F-801F 
of aspartate family, 787-793, 790, 791K, 
792K, 793 
histidine, 802, 803F 
from a-keto acid transamination, 780F, 
781-782 
of a-ketoglutarate family, 781B, 781T, 
783-787, 783F, 784F—786F, 787B 
metabolic intermediates in, 781T 
purine, 802, 804 
of pyruvate family, 793, 794F, 795F 
urea cycle in, 785, 786F, 787, 787B 
Amino sugar, 190 
Aminoacyl-tRNA, 305, 957G 
EF-Tu and, 973B, 973G 
Aminoacyl-tRNA binding, 969-972, 970F, 
973B 
Aminoacyl-tRNA synthetase 
in anticodon, 958 
in protein synthesis, 953, 955-958, 955T, 
956F, 957G 
interpretation of second genetic code 
by, 953, 955-958 
recognition of tRNA by, 956-958 
two classes of, 955 
a-Aminoadipate, 787 
a-Aminoadipic-6-semialdehyde, 787 
Aminoglycoside antibiotics, 981-982 
5-Aminoimidazole ribonucleotide, 815F 
5-Aminoimidazole-4-carboxamide ribo- 
nucleotide, 804 
B-Aminopropionitrile, 155B 
2-Aminopurine, 892-893, 893F 
Aminotransferase, 782 
branched-chain amino acid, 793 
catalytic mechanism of, 407 
leucine, 793 
Aminotransferase reaction, 782B 
Aminotriazole, 801B 
Ammonia channels, 277F 
Ammonium 
assimilation of, 775-776, 775F 
metabolic fate of, 774—776, 775F, 776F 
titration curve for, 40, 41F 
in urea cycle, 787B 
Ammonotelic animal, 810 
Amobarbital, 617S 
AMP, 295 
free energies of hydrolysis of, 57T 
glycogen phosphorylase and, 464—465 


inosinic acid as precursor to, 814, 815F, 
816-818, 820F 
structure of, 296S 
AMP deaminase, 823 
AMP-activated protein kinase, 845-847 
Amphibolic intermediate, 520 
Amphipathic helix, 997 
Amphipathic molecule, 33 
Amphiphilic helix, 153 
basic, 1033 
Amphiphilic molecule, 33, 33F 
Amplification, 354D 
a-Amylase, 673 
salivary, 196 
B-Amylase, 673 
Amylo-(1,4-1,6)-transglycosylase, 677-678 
Amyloid-8, 988B 
Amyloid plaque, 988B 
Amyloidotic polyneuropathy, familial, 172B 
Amylopectin, 195, 195S 
Amyloplast, 196 
Amylose, 195S, 196S-197S 
a-Amylose, 195 
Amytal, 617S 
Anabolic steroid, 764, 764F 
Anabolism 
amphibolic intermediates in, 520 
as biosynthesis, 518 
catabolism and, 517, 518 
metabolic integration and, 839 
NADP* and, 517 
NADPH in, 523, 523F 
pathways of, 520-521 
Anaerobe, types of, 512 
Anaerobic 
condition, lactate accumulation in, 553 
definition of, 559 
Analog 
base, mutation induced by, 892-893 
fluoro-substituted, 834B 
folate, 818B 
transition-state, 423 
Analyzer, amino acid, 99 
Anaplerotic reaction, 582, 584 
Anchoring of membrane protein, 256-257 
Androgen, 233, 763 
4-Androstenedione, 238S 
Anemia, sickle cell, 476-478 
Anfinsen, Christian B., 161-162, 987 
Anfinsen cage, 990 
Angel dust, 1053 
Angiotensin, 437 
Angiotensin-converting enzyme inhibitors, 
424B 
Anhydride 
phosphoric acid, 59-60 
phosphoric-carboxylic, 61, 63 
Animal 
acetyl-CoA carboxylase in, 725 
metabolic water and, 710 
nitrogen excretion in, 810 
pyrimidine biosynthesis in, 829-830 
transgenic, 889B 
uric acid oxidation in, 825-826 
Animal cell, 21F 
polysaccharides in, 204 
Anion exchange media, 128F 
Anionic form of amino acid, 76F 
Annexin protein, 1031 
Anomer, 184 


Index l-3 


Anomeric carbon atom, 184 
Anserine, 42, 42F 
Antagonist, 1050 
Anthranilate synthase, 799 
Antibiotic, 981-982, 983S 
Antibody, 176 
catalytic, 413 
diversity of, 902 
as enzyme, 413 
immune response and, 897 
screening of cDNA library with, 370 
Anticancer drug 
fluoro-substituted pyrimidines for, 835B 
folate analog as, 818B 
prenylation reactions in, 259B 
proteasome inhibitors in, 1003B 
Anticodon, 345 
aminoacyl-tRNA synthetase in, 958 
Anticodon loop, 953 
Antifreeze glycoprotein, 207, 207S 
Antigen, 413F 
definition of, 413D 
immune response and, 897 
O, 203-204 
proliferating cell nuclear, 874 
Antigenic determinant, 203 
Anti-inflammatory drug, nonsteroidal, 
750B-751B 
Antimicrobial agent 
fluoro-substituted pyrimidines for, 835B 
folate analog as, 818B 
Antioxidants, 713B 
Antiparallel beta-pleated sheet, 143-144 
Antiparallel chain in chitin, 198 
Antiparallel nature of double helix, 300F 
Antiport, 286 
Antiterminator, 922 
a,-Antitrypsin, 171B, 171G 
Antp, 936 
AP endonuclease, 891 
Apaf-1, 625F 
Apoenzyme, 385 
Apoprotein, 758T 
Apoptosis, 624-626 
Apoptosome, 625-626 
Apoptotic protease-activating factor 1, 625F 
APRT, 821 
Aptamers, 348 
Apurinic acid, 307 
Aquaporin-1, 254F 
araBAD operon, 918-919 
AraC, 919 
Arachidic acid, 220T 
Arachidonate, 747 
Arachidonate release, 748—749 
Arachidonic acid, 220T, 221G, 221S, 748F 
fatty acid synthesis and, 735-736, 736F 
arachidonic acid synthesized from, 736F 
Archaea, 17, 19 
Archaebacterium, rRNA of, 348F 
Arginine 
pX, value for, 77T 
structure of, 73S 
synthesis of, 783 
urea cycle and, 785 
Arginine finger, 501 
Argininosuccinase, 784 
Argininosuccinate synthetase, 784 
Aromatic amino acid, 797-802, 797F, 
798F-801F 


l-4 Index 


Arrestin, 1041 
Arrhenius equation, 388 
Arsenate, 546 
Artificial chromosome, 360 
Artificial substrate, 438F 
Asialoglycoprotein receptor, 209 
Asp®!, 616 
Asparaginase, 806 
Asparagine, 787 
biosynthesis of, 789F 
leukemia and, 789B 
pK, value for, 77T 
structure of, 72S 
Aspartate, 780S, 787, 1052-1053 
Aspartate aminotransferase, 159F 
Aspartate family of amino acids, 787-793, 
791F, 792F 
Aspartate transcarbamoylase, 826 
Aspartic acid 
pK, value for, 77T 
structure of, 72S 
Aspartic protease, 437, 439T, 440-441, 
441F, 441S 
Aspartyl-6-phosphate, 790 
B-Aspartyl-semialdehyde, 790 
B-Aspartyl-semialdehyde dehydrogenase, 790 
Aspirin, 749-750 
Assay, dye binding, 98B 
Assimilation 
ammonium, 775-776, 775F 
nitrate, 768, 769-770 
single-strand, 882-883 
Associated transducers and activators of 
transcription, 1037 
Assumption, steady-state, 390 
Asymmetric molecule, amino acids as, 79 
Asymmetry 
in amino acids, 70 
cell membranes as, 260-261 
ATCase, 826, 829 
Atherosclerotic plaque, 155B 
Atomic fluctuation in globular proteins, 
165, 165T 
ATP, 2, 3S, 296, 297S 
catabolism and, 517 
in cellular energy cycle, 521 
energy charge and, 843-844 
equilibrium ratio and, 67B 
Fo effects on, 614-616 
formation of, 66 
free energies of hydrolysis of, 57T 
in gluconeogenesis, 668—669 
glucose oxidation and, 622-623, 623T 
glucose-6-phosphate utilization and, 691, 
692 
glycogen phosphorylase and, 464 
in glycolysis, 537 
Hatch-Slack pathway and, 658-659 
hexose synthesis and, 647-648 
human requirement for, 66 
hydrolysis of, 62F, 63-65 
hydrolysis of phosphoric acid anhy- 
drides and, 59-60 
as intermediate energy-shuttle molecule, 
57-58 
light-driven synthesis of, 648-650, 649F, 
650F 
metabolic roles of, 843 
NADPH and, 840-841 


oxidation of palmitic acid and, 708-709 
phosphorylation potential and, 844 
proton gradient and, 611-620, 614F, 
616F, 617F, 618F, 619B, 619F 
pyruvate carboxylase and, 666 
in TCA cycle, 576 
in tricarboxylic acid cycle, 563 
triphosphate chain of, 60F 
ATP coupling, 841-843 
ATP coupling stoichiometry, 842-843 
ATP derivative, sulfide synthesis from, 
796-797 
ATP hydrolysis, myosin and, 488-490 
ATP sulfurylase, 319B 
ATP synthase, 612F, 614F 
ATP synthase inhibitor, 618 
ATP-ADP translocase, 618, 619F 
ATPase, 283 
ATPase activity, 485 
ATP-citrate lyase, 580F, 582, 724 
ATP-dependent kinase in purine synthesis, 
820-821 
ATP:p-glucose-6-phosphotransferase, 385 
ATP:GS:adenylyl transferase, 777 
Atrial natriuretic peptide, 1021 
Atropine, 1051 
Attenuation, transcription, 920 
Autocoid, 294B 
Autogenous regulation, 920 
Automated DNA sequencing, 318-320 
Autophosphorylation, 1018, 1020 
Autoregulation, 920 
Autotroph, 511D 
Avian sarcoma virus, 1023 
Axial, 187 
Axis, twofold rotation, 174 
Axonemal dyneins, 494 
Axons, 495 
Azaserine, 814, 816S 
Azide, 617-618 
AZT, 877, 877S 


B lymphocyte, 897 

Baa helix, 1033 

Bacillus cereus, 275-276, 276F 

Bacillus thuringiensis, 257B 

Backbone, 167D 

Bacteria 
cell walls of, 201—204, 202S, 203F 
denitrifying, 769 
nitrifying, 768 
pyrimidine biosynthesis in, 829-830 
structure of, 19F 

Bacterial cell, virus infecting, 21 

Bacterial pathway of ornithine biosynthesis, 

784F 

Bacterial transformation experiment, 359F 

Bacteriophage, 21 

Bacteriophage T7 RNA polymerase, 908F 

Bacteriorhodopsin, 250-251, 251F, 285, 616 
light-driven proton transport and, 285, 

285F 

Bactoprenol, 231S 

Balance, nitrogen, 531 

BAR domain binding, 267, 267F 

Barrier, blood-brain, 1056 

Base 
chemical mutagens reacting with, 893 
conjugate, 36 


discriminator, 957 
methylated, 304 
nitrogenous, 291-299, 292F, 2938S, 294F, 
298S 
phosphate-linked, 342S 
phosphite-linked, 342S 
RNA, 305S 
Base analog, mutation induced by, 892-893 
Base excision repair, 887, 891, 891F 
Base pair, Watson-Crick, 301S, 321 
Base pairing, intrastrand, 342, 344-345 
ribosomal RNA and, 346, 348 
Base sequence of nucleic acids, 299 
Basic amino acid, 71, 76 
Basic amphiphilic alpha-helix, 1033 
Basic region, 938 
Basic region-leucine zipper motif, 938 
BCA method, 98B 
B-DNA, 321, 322F, 324F, 325T, 936 
Bear, polar, 223B 
Behavior, saturation, 271 
Behenic acid, 220T 
B-Bend, 145 
Benois, Christophe, 543B 
Benzoylalanine methyl ester, 438S 
Beta sheet 
antiparallel pleated, 143-144 
fibroin and -keratin as, 149 
of globular protein, 152-153 
parallel pleated, 143 
pleated, 141F, 142-144, 143B, 144F 
Beta-hydroxyl group, 706-707 
Beta-oxidation 
fatty acids and, 701-702, 701F, 704F 
peroxisomal, 714 
Beta-turn, 144-145, 145F 
Bicarbonate buffer system, 43B 
Bicarbonate ion in urea cycle, 787B 
Bicinchoninic acid, 98B 
Bifunctional enzyme, 671 
Bijvoet, J. M., 82B 
Bilayer 
lipid, 244-245, 244F 
phase transitions, 263-264 
Bile acid, 761, 761F 
Bile salt, 700 
Bimolecular reaction, 387 
Bimosiamose, 207B 
Binding 
cooperative, 178 
nucleotide, 832-833 
oxygen 
by hemoglobin, 469 
in myoglobin, 469 
proteoglycans and, 209-211 
Binding change mechanism, 613 
Binding isotherm, 122 
Binding site, ribosome, 966F, 967 
Bioinformatics, 367D 
Biological structure, 2 
Biomolecular recognition, 14-15 
Biomolecule 
anabolic pathways and, 520 
carbon as, 4—5 
definition of, 1 
dimensions of, 7T 
high-energy, 57-63 
radius of, 13T 
structural organization of, 5 


Biosphere, energy in, 2, 512-513 
Biosynthesis 
anabolism as, 518 
of bile acid, 761-762, 761F 
glucose and, 683-693 


lipid, 722-767. See also Fatty acid synthe- 


sis; Lipid Biosynthesis 
sphingolipid, 744-745 


Biosynthetic process, TCA cycle fueling, 581 


Biot, Jean-Baptiste, 82B 
Biotin, 524T, 665-666 
Biotin carboxyl carrier protein, 724 
Biotin carboxylase, 724 
BiP, 996 
Bird, migratory, 709-710 
Bisabolene, 230S 
Bisphosphate aldolase, 543-544, 545F 
1,3-Bisphosphoglycerate, 61, 63 
free energies of hydrolysis of, 57T 
hydrolysis reaction of, 62F 
NAD*-dependent dehydrogenase and, 
579B 
2,3-Bisphosphoglycerate, 475, 475F, 476F 
Bisubstrate reaction, 403 
Bloch, Konrad, 753B 
Blood 
bicarbonate buffering system in, 43B 
pH of, 44B 
sickle cell anemia and, 476-478 
Blood clotting, 455 
Blood pressure, 424B 
Blood-brain barrier, 1056 
Blotting 
Northern, 364B 
Southern, 364B 
Western, 364B 
Boat conformation of sugar, 187, 187S 
Body 
cell, 1044 
composition of, 5T 
ketone, 717-718, 717B, 718F 
Bohr, Niels, 474 
Bohr effect, 474 
Boltzmann’s constant, 51 
Bond 
covalent, 4—5, 4F, 430, 430F 
glycosidic, 192, 676, 678F 
high-energy, 58 
hydrogen, 12-14 
low barrier, 431-432, 432F 
in water, 29-30 
ionic, 14F 
noncovalent, 11 
peptide, 71D, 86-87, 87F, 88F 
Bone remodeling, 282-283 
Bovine pancreatic ribonuclease A, 94F 
Bovine ribonuclease, 152, 152S 
Box 
Pribnow, 908 
TATA, 927, 927F 
Boyer, Herbert, 358 
Boyer, Paul, 613-614 
Bradford assay, 98B 
Bradykinin, 1009F 
Brain 
docosahexaenoic acid in, 736B 
energy metabolism of, 848T 
metabolic role of, 847, 848F, 849 
Brain natriuretic peptide, 1021 


Branch migration, 879, 882 

Branch site, 942 

Branched polysaccharide, 674 

Branched-chain amino acid aminotrans- 
ferase, 793 

Branched-chain fatty acid, 714, 716, 716F 


Branched-chain a-keto acid dehydrogenase 


complex, 808 
Branching, glycogen, 677-678, 678F 
BRCAI, 885B 
BRCA2, 885B 
Breakage of DNA strand, 877-878 
Breast cancer gene, 885B 
Bridge 

disulfide, 101, 102F 
phosphodiester, 297, 298S 
salt, 473F 
Bromoaspirin, 751S 
Bromocriptine, 1054B 
Bromodomain, 932 
5-Bromouracil, 892-893, 892F 
Bronchodilators, 482B 
Brown fat, 852 
Buchanan, John, 813 
Buffer, 41-44, 41F 
bicarbonate as, 43B 
Building blocks, 8F 
anabolic pathways and, 520 
of macromolecules, 5 
of proteins, 70-76, 71S, 72F, 72S-738, 
75S 
Bulge, 343, 343F 
a-Bungarotoxin, 1051, 1051G 
Burst kinetics, 436F 
Butterfly, 282B 
bZIP motif, 936 


c cytochrome, 603 
Cig fatty acid, 733-734 
C gene, 900 
C3 plant, 658 
Ca?*-ATPase, 280 
Cactus, 659 
Cage, Anfinsen, 990 
Calcisome, 1031 
Calcitonin, 1009F 
Calcium, as second messenger, 1031 
Calcium carbonate, 512B 
Calcium ion, hydrolysis of ATP and, 64 
Calcium-binding protein, 1031 
Calcium-modulated protein, 1031 
Calmodulin, 1031 

alpha-helix from, 154G 

structure of, 1032G 
Calmodulin-binding domain, 495 
Calnexins, 213T, 215T 
Caloric homeostasis, 847D 
Calorie, 49 
Calorimeter, 50F 


Calvin—Benson cycle, 652-653, 653T, 6054F, 


655 
cAMP 
protein kinase A activated by, 1026, 
1027G 


as second messenger, 466, 1025-1026, 
1026F, 1026G, 1026T 
cAMP receptor protein, 917 
cAMP response element, 936B 
cAMP-dependent protein kinase, 670 
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Camphene, 230S 
Cancer 
breast, 885B 
fluoro-substituted pyrimidines for, 835B 
folate analog for, 818B 
oncogene and, 1029B 
positron emission tomography in, 555B 
prenylation reactions in, 259B 
proteasome inhibitors for, 1003B 
as protein folding disease, 172B 
telomeres and, 340B 
Canola oil, 220 
CAP, 916, 918F 
Cap structure, 940 
Capillary movement, 45 
Capping, 940 
helix, 141F, 142 
Capsid, 22 
Carbamoyl phosphate synthetase I, 783 
Carbamoyl phosphate synthetase I, 826, 
827F 
Carbanion intermediate, resonance-stabi- 
lized, 711F 
Carbanion mechanism, 577F 
Carbohydrate, 181-218 
exercise and, 680B 
glycoproteins and, 204-209. See also 
Glycoprotein 
monosaccharide, 182-191. See also 
Monosaccharide 
naming of, 181 
nutritional, 531 
oligosaccharide, 191-193, 192F, 192S 
polysaccharide, 194-204. See also 
Polysaccharide 
proteoglycans and, 209-213, 210F, 211F, 
2118S, 212F 
structural code, 213-215 
Carbohydrate recognition domains, 215 
Carbohydrate-responsive element-binding 
protein, 693 
Carbon 
acetate as source of, 587-589 
anomeric, 184 
as biomolecule, 4—5 
carboxyl, 580F 
covalent bonds of, 6S 
methyl, 580F 
in TCA cycle, 579, 580K, 581 
Carbon cycle, 512-513 
Carbon dioxide 
biospheric energy flow and, 512-513 
calcium carbonate and, 512B 
in desert plants, 659 
hemoglobin and, 474—475 
in photosynthesis 
limited fixation of, 656-659, 657K, 
658F 
organic molecules from, 650-656, 
651F, 651G, 653T, 654F, 655F 
in tricarboxylic acid cycle, 571-575, 
571F, 573F, 576 
Carbon dioxide fixation, 630 
Calvin—Benson cycle and, 652, 654F 
Hatch-Slack pathway and, 656-659, 
658F 
light regulation of, 655-656, 655F 
photorespiration limiting, 656-659, 
657F, 658F 
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Carbon monoxide, 617-618 
Carbonic anhydrase, 16G 
in chemical reaction, 420T 
Rear/ Kn ratio of, 394T 
Carbonium ion, 577S 
Carbonium ion mechanism, 577F 
Carbonyl cyanide-p-trifluoromethoxyphenyl 
hydrazone, 618F 
B-Carbonyl group, in fatty acid synthesis, 
729-730 
Carbonylphosphate, 665 
a-Carboxyglutamic acid, 75S 
2-Carboxy-3-keto-arabinitol, 651 
Carboxykinase, PEP, 582, 584, 666-667 
Carboxyl carbon, 580F 
Carboxylase 
biotin, 724 
PEP, 582, 583F 
pyruvate, 666 
Carboxylate, 188 
Carboxypeptidase, 103 
Cardiac glycoside, 279, 280S, 282B 
Cardiolipin, 225, 2258S, 624F 
Cardiotonic steroid, 279, 280G, 280S, 282B 
Carnauba wax, 229 
Carnitine, 702-703 
Carnitine acyltransferase I, 703 
B-Carotene, 6F 
Carrier, mobile electron, 606 
Carrier protein 
acyl, 727, 727F 
biotin carboxyl, 724 
ubiquitin, 998 
Cartilage, proteoglycans in, 209-210, 212F, 
243-244 
Cascade 
clotting, 455, 455F 
enzyme, 466-467 
signaling pathways and, 1011 
phosphorylase, 681-683 
Caspase-recruitment domain, 625 
Cassette, expression, 376-377 
Cat allergen, 169F 
Catabolism 
anabolism and, 518 
definition of, 517D 
fatty acid, 701-721. See also Fatty acid 
catabolism 
hydrogen and electrons released in, 
522S 
metabolic integration and, 839 
NAD* and, 517, 522, 522F 
pathways of, 520-521 
purine, 821, 823-826, 824F, 826F 
of pyrimidine, 830, 830F 
stages of, 519F 
tricarboxylic acid cycle and. See 
Tricarboxylic acid cycle 
Catabolite activator protein, 148B, 916, 
918F 
Catalase 
Rear / Kn ratio of, 394T 
naming of, 384 
Catalysis, 422F 
acid-base, 430-431 
covalent, 430, 430F 
glycogen synthase and, 676, 678F 
in isomerization of glucose-6-phosphate, 
541 


metal ion, 432-433 
ribonucleotide reductase and, 832-833 
Catalytic antibody, 413 
Catalytic power, 383 
definition of, 383D 
of enzyme, 178 
Catalytic proteins, 121 
Catalytic site, 831 
Catalytic triad, 434 
Catecholamines, 1047, 1054B, 1056, 1056F 
Catechol-O-methyl-transferase, 1054B 
CATH, 158 
Cathepsin D and E, 437 
Cation exchange media, 128F 
Cation exchangers, 127 
Cationic form of amino acid, 76F 
Cationic free radical, 636 
Caveolae, 267 
Caveolins, 267, 268F 
CCR5 receptor, 1042 
CDKs, 872 
cDNA, reverse transcriptase-driven synthe- 
sis of, 366F 
cDNA library, 362-363 
screening of, 370 
CDP-diacylglycerol, 722F, 740F, 741-743 
CDP-ethanolamine, 739-741 
CDRs, 897 
Celebrex, 751S 
Cell, 1 
adipose, 697 
electron micrograph of, 698F 
pentose phosphate pathway and, 
684 
animal, 21F 
ATP in, 521 
buffering system in, 42 
culture of, 203 
eukaryotic, 7, 17. See also Eukaryote 
DNA replication in, 872-874, 
872F-873F 
excitable, 1043 
glial, 1043 
glucose-6-phosphate utilization in, 691, 
691F 
lipids’ role in, 237 
membranes of, 242-290. See also 
Membrane 
molecular organization in, 8F 
multisubstrate reactions in, 409 
neurotransmitter, 1046 
nucleotide synthesis in, 813 
phosphorylating glucose and, 539 
postsynaptic, 1046 
presynaptic, 1046 
prokaryotic, 17. See also Prokaryote 
major features of, 20T 
structure of, 19-20, 19F 
Cell body, 1044 
Cell growth, proteoglycans and, 211-212 
Cell lysis, 22 
Cell membrane, 9, 19, 20T 
of animal cell, 23T 
composition of, 243 
of plant cell, 24T 
Cell surface polysaccharide, 204 
Cell wall, 19, 20T 
bacterial, 201-204, 202S, 203F 
of Escherichia coli, 302F 


of gram-positive bacteria, 203F, 204 
of plant cell, 24T 
Cell-cell communication, 1048F 
Cellobiose, 192, 192S 
Cellular energy, 843-844, 844F 
Cellular lipidomics, 237-238 
Cellular metabolism, 15, 16F 
Cellulase, 198 
Celluloid, 201B 
Cellulose, 196-198 
as insoluble fiber, 532 
structure of, 197S, 198S 
Cellulose acetate, 197 
Center 
chiral 
of monosaccharide, 183 
naming of, 84B 
decoding, 964, 969 
factor-binding, 972 
peptidyl transferase, 969 
reaction, 641-647, 642F, 643F, 643G 
Ceramide, 227, 746-747, 1031 
Ceramide-activated protein kinase, 1031 
Cerebroside, 746-747 
Cerzyme, 207B 
Cesium chloride centrifugation, 332B 
Cetyl palmitate, 229B 
CF,CF)-ATP synthase, 648-649 
cfos:c-Jun, 939G 
Chain 
in chitin, 198 
heavy, 484 
hydrocarbon, 247 
light, 484, 484F 
peptide 
initiation of, 966-968, 966F, 967F, 
969T, 970F, 976-979, 977F 
termination of, 974, 975F 
polynucleotide, cleavage in, 309F 
polypeptide, 89 
in E. coli polymerase, 866, 866F 
fragmentation of, 103-105, 104F, 
105T 
myoglobin, 468 
separation of, 101 
Chain elongation, 911-912 
in fatty acid synthesis, 733, 733K, 734 
in fatty acid unsaturation, 734 
Chain termination, 316, 912 
Sanger’s, 317-320 
Chain termination protocol, 317-318 
Chair conformation of sugar, 187, 187S 
Chang, Annie, 358 
Changeux, Jean-Pierre, 457 
Channel 
ligand-gated ion, 1047 
oligomeric ion, 1047 
voltage-gated ion, 1044 
Channel proteins, 273F 
Channeling, metabolic, 828-829 
Channon, H. J., 753B 
Chaperone, molecular, 168 
definition of, 988 
Hsp70, 988D, 989F, 991F 
Chaperonin, 168, 988, 990 
Chargaff, Erwin, 300 
Chargaff’s analysis of double helix, 300 
Chargaff’s rules, 300T 
Charge in passive diffusion, 271 


Checkpoint in DNA replication, 872 
Chemical forces, weak, 11-12, 11T 
Chemical kinetics, 386 
Chemical methods 
of amino acid sequencing, 100-105. See 
also Amino acid sequencing 
of polypeptide cleavage, 105T 
of polypeptide fragmentation, 105 
Chemical mutagen, 893, 894F 
Chemical reaction mechanisms, 427B 
Chemical shift, 83-84, 85F 
Chemical synapse, 1046 
Chemical synthesis 
of gene, 340 
of nucleic acid, 339-340 
Chemiosmotic coupling, 611 
Chemiosmotic hypothesis, 609 
Chemiosmotic photophosphorylation, 648 
Chemoautotroph, 512T 
Chemoheterotroph, 512T 
Chemotherapy 
fluoro-substituted pyrimidines for, 835B 
folate analog for, 818B 
prenylation reactions in, 259B 
proteasome inhibitors in, 1003B 
Chemotroph, 511D 
Chemotrophic organism, 56 
Chimeric construct, 355 
Chimeric DNA molecule, 371-372 
Chimeric plasmid, 355 
Chip 
DNA, 361B 
gene, 367-368, 368F 
Chiral center 
of monosaccharide, 183 
naming of, 84B 
Chiral molecule 
amino acids as, 70, 79-82 
naming of, 80 
Chitin, 190, 198, 198S, 200 
Chloramphenicol, 983S 
Chloride channels, 277F 
Chlorophyll, 633D 
solar energy captured by, 633-634, 
634F-636F, 6348-6358, 636-637 
structure of, 634S 
Chlorophyll a, 6F 
Chloroplast, 20, 22, 630-631, 631F 
CF\CF)-ATP synthase in, 648-649 
function of, 9 
light-induced changes in, 655, 655F 
Cholecystokinin, 854 
Cholesterol, 233, 233G, 233S, 238S 
elevated, 760-761 
lovastatin and, 755B 
statin effects on, 424B, 755B 
structure of, 6F, 751S 
synthesis of, 750-757, 755F, 756F 
Cholesterol ester, 757—758 
Cholesterol ester transfer protein, 758 
Cholic acid, 233, 234S, 761F 
Choline plasmalogen, 227G, 227S 
Choline sphingomyelin, 228F 
Cholinergic synapse, 1047 
Chondroitin, 201 
Chondroitin-4-sulfate, 200S 
Chondroitin-6-sulfate, 200S 
Chorionic gonadotropin, 1009F 
Chorismate, 797—798 


Chorismate mutase, 442—448, 444F, 799 
Chorismic acid, 797, 798F 
Chromatin, 336-337 
Chromatin-remodeling complex, 932 
Chromatography 
affinity, 132, 132F 
amino acid, 85-86 
in amino acid separation, 99 
high-performance liquid, 132 
hydrophobic interaction, 132 
size exclusion, 130-131 
Chromodomain, 932 
Chromophore, 81B, 81G 
Chromosomal protein, nonhistone, 
302-303, 336 
Chromosome, 20 
artificial, 360 
DNA of, 302 
eukaryotic, 336-339, 336T, 338F 
model of, 338F 
organization of, 338 
replication of, 863F, 874-876 
telomeres and, 874, 875B, 875F 
Chromosome pairing, 878 
Chylomicron, 700, 757 
Chymotrypsin, 435G, 436-437, 436F, 438S 
in chemical reaction, 420T 
covalent intermediates of, 431T 
in polypeptide fragmentation, 103 
size of molecule, 90T 
structure of, 96F, 96G 
Chymotrypsinogen, 454—455, 454F 
CIC channels, 276 
cis double bond in fatty acid synthesis, 
733-734 
Citrate, 726-727 
Citrate synthase 
alpha-helix of, 153, 154G 
as dimer, 572 
in tricarboxylic acid cycle, 571-574, 
571K, 572G 
Citric acid cycle. See Tricarboxylic acid cycle 
Citronellal, 230S 
Claisen rearrangement, 443 
Clamp loader, 867 
Class, 158 
Clathrate structure, 31, 32F 
Clathrin-mediated endocytosis, 1041 
Cleavage 
by bisphosphate aldolase in glycolysis, 
543-544, 545F 
of disulfide bridge, 101, 102F 
homolytic, 711 
oligosaccharide, 208S, 209 
in polynucleotide chains, 309F 
proteolytic, in post-translational process- 
ing, 993 
in tricarboxylic acid cycle, 564, 566 
Cleft, synaptic, 1044, 1048, 1050 
Cloned fusion protein, 371T 
Cloned gene, 369-372, 369F, 370F 
Cloning, 354-360, 354D 
directional, 357, 358F 
of genes, 363F 
plasmids in, 354-360, 354D, 355F, 356F, 
359F 
Cloning site, 355 
Cloning vector, plasmid as, 355, 355F 
Closed promoter complex, 908 
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Closed quaternary structure, 177 
Closed system, 48, 49F 
Clostripain, 103 
Clot, blood, 455 
Clotting, 455, 455F 
Clotting factor, 232B 
Cluster, iron-sulfur, 573, 573F 
Coagulation pathway, 455, 455F 
Coated pit, 758 
Coated vesicle, 758 
Coaxial stacking, 343F, 344 
Cobalamin, 524T 
Cobratoxin, 1050, 1051G 
Cocaine, 1054B-—1055B 
Co-chaperonin, 990 
Code 
genetic, 952-953, 953F, 954B, 954T 
second, 953, 955 
nonoverlapping vs. overlapping, 953F 
operational, 958 
triplet, 952-953 
Codon, 345, 907B 
amino acids specified by, 953 
definition of, 952D 
DNA translation and, 292F 
multiple, 921 
nonsense, 960-961 
Codon-—anticodon pairing in protein syn- 
thesis, 958-961, 959F, 959T, 961T 
Coefficient, sedimentation, 303 
Coenzyme (s), 385, 523, 568B-569B 
Coenzyme A, 702, 727F 
Coenzyme Q 
cytochrome cand, 603-606 
electron transport and, 598 
functions of, 570B 
oxidation states of, 600F 
reduction of, 600F, 600S, 601-603, 602F 
Coenzyme Q-cytochrome c reductase, 598 
Cofactor, enzyme, 385 
Cognate tRNA molecule, 955, 956F 
Cognition, 936B 
Cohen, Stanley, 358 
Cohesin, 339 
Coil, 153 
Coiled coil, 147 
myosin, 485, 485F 
Coiled-coil motif, 148B 
Co-inducer, 914D 
Colchicine, 284S, 494B 
Collagen 
amino acid composition of, 99 
fibers of, 151F 
structure of, 150-151, 151F, 151S 
Collagen-related disease, 155B 
Collective motion in globular proteins, 165, 
165T 
Colligative properties of water, 33-34 
Colony hybridization, 362, 362F 
Combinatorial DNA library, 361B 
Combinatorial exon, 945 
Combined immunodeficiency syndrome, 
severe, 378B, 822B 
Committed step in fatty acid synthesis, 724 
Communication, cell-cell, 1048F 
Compartment 
in cell, 9 
chloroplast, 655, 655F 
mitochondrial functions in, 592-593 
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Compartmentalized pyruvate carboxylase, 
666 
Competing resonance, 60 
Competitive inhibition, 398-399, 399B, 
399F 
Complement regulatory factor, 212 
Complementarity, structural, 14, 14G 
Complementarity-determining region, 897 
Complex lipid synthesis, 737-747 
Complexin, 269 
Concentration 
free energy changes and, 54 
hydrolysis of ATP and, 65 
reduction potential and, 596 
Concentration gradient, 270 
Condensation 
of acetyl-CoA, 729 
aldol, 544 
Perkin, 571 
Condensin, 339 
Configuration 
of monosaccharide, 183 
protein, 96-97, 97F 
Conformation, 2 
of hemoglobin, 471, 473 
of myoglobin, 469 
protein, 96-97, 97F 
Conformational change, in globular pro- 
tein, 165 
Conjugate acid, 36 
Conjugate base, 36 
Conjugated protein, 120 
Consensus sequence, 908, 908D, 928T 
Constant 
coupling, 84 
dielectric, 30-31, 31T 
equilibrium, 59, 67B 
Faraday’s, 59, 271 
ionization, 37 
Michaelis, 391 
rate, 386 
Constitutive exon, 945 
Constitutive expression, 915 
Contact distance, van der Waals, 12 
Contact inhibition, 203F 
Contraction, muscle, 481—490. See also 
Muscle contraction 
Contrahelicase, 870 
Control system, positive vs. negative, 
917-918 
Conventions in expressing nucleic acid and 
protein sequences, 907B 
Conversion, metabolite, in compartmental- 
ized pyruvate carboxylase, 666 
Converter enzyme, 453 
Cooperative binding, by hemoglobin, 469 
Cooperative substrate binding, 456 
Cooperativity, 178 
Copper, 598 
CorA, 276 
Core, hydrophobic, 167D 
Core octamer, histone, 302F 
Core oligosaccharide, 205S 
Core polymerase, 907, 908 
Co-repressor, 914D 
Corey, Robert, 139 
Cori, Carl, 670B 
Cori, Gerty, 670B 
Cori cycle, 669F, 670B 


Corticosteroid, 763, 764 
Cortisol, 233, 764 
glycogen metabolism and, 683 
Cotton, 196-197 
Coumadin, 232B 
Couple, redox, 594 
Coupled process, 66 
ATP and, 67B 
Coupling 
chemiosmotic, 611 
in gluconeogenesis, 668-669 
Coupling constant, 84 
Coupling mechanism, 488-490 
Coupling stoichiometry, 841-843 
Covalent bond, 4—5, 4F, 6S, 430, 430F 
Covalent catalysis, 430, 430F 
Covalent electrophilic catalysis, 430 
Covalent intermediate, 430 
Covalent modification 
allosteric regulation and, 462-467, 462S, 
463F, 464F, 465F, 465S 
enzyme regulation by, 453 
glutamine synthetase and, 777, 778F, 
779 
glycogen synthesis and, 678-679 
COX, 750B-751B, 750G 
CP43 subunit, 644 
CP47 subunit, 644 
Cpn60 chaperonin, 168 
CPS-I, 783 
CPS-II, 826, 827F, 829B 
CRE, 936B 
C-reactive protein, 215, 215F 
Creatine kinase, 420T 
Creatine phosphate, 57T 
Creatine supplement, 850B 
CREB-type transcription factor, 936B 
Creutzfeldt-Jakob disease, 172B 
Crick, Francis H. C., 862, 959 
Cristae, 593 
Crixivan, 443S 
Cronin, John, 83B 
Cross-bridge, 484 
Crosslink in Gram-positive cell wall, 202S 
Cross-over, 166 
Crotonase, 706 
Reat/ Kn ratio of, 394T 
CRP, 917 
Cruciform structure, 327, 328F 
CTD, 926 
C-terminal analysis, 103 
C-terminal domain, 926 
C-terminal end, 100 
CTP, 828F, 829 
CTP synthetase, 829 
C-type lectins, 213T 
Culture, cells in, 203 
Curare, 1050 
Curve 
oxygen saturation, 474F 
oxygen-binding 
of hemoglobin, 468F, 470-471 
of myoglobin, 468F, 470 
sigmoid, 456 
substrate saturation, 389 
titration, 39 
Cyanide, 617-618 
Cyanobacteria, 19 
Cyanogen bromide, 103, 104F, 105 


Cycle 

Calvin—Benson, 652, 653, 653T, 654F, 
655 

carbon, 512-513 

elongation, 968, 969T 

energy, ATP in, 521 

glyoxylate, 587-589, 587F 

Krebs. See Tricarboxylic acid cycle 

nitrogen, 769F 

oxygen, 512-513 

phosphorylation—dephosphorylation, 
736 


purine nucleoside, 823 
ribosome life, 976F 
substrate, in gluconeogenesis, 672-673 
tricarboxylic acid, 532B, 563-591. See 
also Tricarboxylic acid cycle 
urea, 783, 785, 786F, 787, 787B 
Cycler, thermal, 373 
Cyclic adenosine 5'-monophosphate, 296S 
Cyclic AMP. See cAMP 
Cyclic AMP-dependent protein kinase, 
461F 
Cyclic electron transfer pathway, 641 
3',5'-Cyclic GMP, 296, 296S 
Cyclic hemiketal, 184 
Cyclic phosphodiester, 296, 296S 
Cyclic photophosphorylation, 649-650, 
650F 


Cyclic symmetry, 174 
Cyclin, 872 
Cyclin-dependent protein kinase, 872 
Cyclization, 747 
Cycloheximide, 983S 
Cyclooxygenase, 747, 750B-751B 
Cyclophilin A, 429, 429F 
Cystathionine, 790 
Cysteine, 795 
catabolism of, 804 
pK, value for, 77T 
structure of, 73S, 75 
Cystic fibrosis, 172B, 378B 
Cytidine, 294, 295S 
Cytidine 5'-diphosphate, 296 
Cytidine 5'-monophosphate, 295, 296S 
Cytidylic acid, 295 
Cytochrome, 598 
Cytochrome a, 603 
Cytochrome b, 734 
Cytochrome b; reductase, 734 
Cytochrome bs57, 769 
Cytochrome bc, complex, 604, 604G 
Cytochrome b6/f complex, 640-641, 641F 
Cytochrome c¢ 
amino acid sequences of, 112-113, 113F 
apoptosome assembly triggered by, 
625-626 
backbone structures of, 167F 
binding of, 607F 
cardiolipin and, 624F 
coenzyme Q and, 603-606 
mitochondrial, 606F 
oxygen and, 606-607 
phylogenetic tree for, 113, 114F 
size of molecule, 90T 
Cytochrome c oxidase, 598, 608, 608F, 
608G 
Cytochrome oxidase, 607F 
Cytochrome P-450, 717 


Cytochrome subunit, 642F 
Cytoplasmic calcium ion, 1031F 
Cytoplasmic dyneins, 494 
Cytoplasmic membrane, 242 
Cytosine, 292, 292F, 301S, 307S 
Cytosine methylation, 326 
Cytoskeleton, 20 

of animal cell, 23T 

microtubules of, 490 

in plant cell, 24T 
Cytosol, 601 

in prokaryotic cell, 20T 
Cytosolic acetyl-CoA, 723, 723F 
Cytosolic NADH, 620-623, 621F, 622F, 

623T 

Cytosolic ribosome, 964-965, 965T 


D gene, 900 

D1 subunit, 644 

D2 subunit, 644 

DAG, 1029 

DAHP, 798 

Dark reaction, photosynthetic, 632 
Database, amino acid sequence, 109-110 
dCMP deaminase, 835 

Deadenylylation, 777 

Debranching enzyme, 674, 674F 


Decarboxylase, pyrophosphomevalonate, 755 


Decarboxylation 
in fatty acid synthesis, 729 
isocitrate dehydrogenase and, 574 
of pyruvate, 566-567 
Decay system, nonsense mediated, 946 
Decay-accelerating factor, 206 
Decoding center, 964, 969 
Decorin, 209, 210F 
Degeneracy, third-base, 953 
Degenerate code, 953 
Degenerate oligonucleotide, 362 
DegP, 1003 
Degradation. See also Catabolism 
acetylcholine, 1050F 
Edman, 102 
of fructose-2,6-bisphosphatase, 672F 
glycogen, 680-683 
of lipoproteins, 758, 760 
of polyunsaturated fatty acids, 714 
protein, 208-209 
of protein, 998-1005 
purine, 821, 823-826 
of pyrimidine, 830 
Dehydratase, 799 
Dehydration of enolase, 549-550, 550F 
Dehydration synthesis, 191 
Dehydrogenase, 522 
B-aspartyl-semialdehyde, 790 
dihydroorotate, 826 
glutamate, 774, 774F 
glycerol-3-phosphate, 738 
glycerophosphate, 621 
homoserine, 790 
IMP, 819 
isocitrate 
electron transport and, 574 
regulation of, 586 
isopropylmalate, 793 
NAD*-dependent, 406, 579B 
NADPH-specific malate, 657 
Deisenhofer, Johann, 642 


Deletion mutation, 893 
Delivery, gene, adenovirus-mediated, 377, 
378F, 379 
Denaturation, 15, 15F, 159-161, 160F 
of DNA, 330-333, 331K, 333F 
protein, 51T 
Denitrifying bacteria, 769 
Density 
DNA, 332B 
water, 45 
Dental plaque, 196 
Deoxy sugar, 189 
Deoxycholic acid, 233, 234S 
3-Deoxy-D-arabino-heptulosonate-7- 
phosphate, 798 
2-Deoxy-a-b-ribose, 189S 
Deoxyguanosine, 325S 
Deoxyhemoglobin, 473 
B-2-Deoxyribofuranose, 295 
Deoxyribonucleic acid. See DNA 
Deoxyribonucleoside, 294 
Deoxyribonucleotide 
formation of, 830-833, 831F, 832F 
regulation of, 833F 
Deoxyribose—PO,, 320F 
2-Deoxy-streptamine, 981 
Depolarized membrane, 1044 
Deramaxx, 751S 
Dermatan sulfate, 200S 
Desaturase, 744 
Desert plant, 659 
Design, drug, 404B 
Designer enzyme, 414 
Desipramine, 1055B 
Desmolase, 238S, 762 
Destabilization 
electrostatic repulsion and, 60 
in enzyme-substrate complex, 421-423 
Dextran, 196 
Dextrin, limit, 674 
Dextrorotatory behavior, 80 
DHA, 736B 
DHFR gene, 940, 940F 
Diabetes mellitus, 188 
cataract in, 687B 
glucokinase and, 539 
gluconeogenesis inhibitors and, 668B 
glycation end products in, 677B 
ketone bodies in, 717B 
mouse model of, 851B 


Diacylglycerol, 235, 739, 740K, 741-743, 1029 


Diacylglycerol acyltransferase, 740 
Diagram, energy, 388F 
Dialysis bath, 127, 129F 
Diastase, 192 
Diastereomer, 80, 184 
Dicarboxylic acid, 717 
Dicer, 375 
Dideoxy method, 316, 318F 
Dideoxy sequencing gel, 318-320 
Dielectric constant of water, 30-31, 31T 
Diet. See also Nutrition 
fad, 532B 
high-fat, 697 
Dietary fiber, 532 
Diffusion 
membrane channel protein facilitation 
of, 272 
passive, 271, 272F 
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Digestive serine protease, 434, 435F 
Digitalis, 282B 
Dihedral symmetry, 174 
Dihydrofolate reductase, 429 
Dihydrofolate reductase gene, 940, 940F 
Dihydroorotase, 826 
Dihydroorotate dehydrogenase, 826 
Dihydrouridine, 305S 
Dihydroxy acid hydratase, 793 
Dihydroxyacetone, 183S 
Dihydroxyacetone phosphate, 554, 738, 
743-744 
3,4-Dihydroxyphenylalanine, 1056 
Diisopropylfluorophosphate, 438F, 458S 
Dimer 
citrate synthase as, 572 
hemoglobin, 469F 
of identical subunits, 441 
pyrimidine, 890 
Dimethoxytrityl, 341S 
Dimethylallyl pyrophosphate, 753 
Dinucleotide, 342S 
Dipeptide, 89D 
DIPF, 1051, 1051S 
Diphosphate kinase, nucleoside, 666 
Diphosphatidylglycerol, 225, 225G 
Diphtheria toxin, 980B 
Dipole, 13 
Directional character, 488 
Directional cloning, 357, 358F 
Disaccharide, 191-193, 192F, 192S 
Discriminator base, 957 
Dissociation 
by electrolyte, 36-37 
Henderson-Hasselbalch equation and, 
38 
of histidine-imidazole group, 42 
oxygen, 473-475 
carbon dioxide and, 474-475 
of water, 34 
Dissociation constant, 122 
Dissolved solute, 34 
Distance, van der Waals contact, 12 
Distribution coefficient, 129 
Disulfide bridge, 101, 102F 
Diterpene, 231 
Diversity 
antibody, 902 
metabolic, 511 
D,L naming system, 80-81 
DNA 
A-form, 323, 324F 
biological significance of, 307 
buoyant density of, 332B 
chimeric, 371-372 
in chromosomes, 302 
cloning and, 354-360. See also Cloning 
definition of, 297D 
density of, 332B 
digital encoding of, 301 
directed changes in heredity and, 
375-377, 377F, 378B, 378F 
DNA libraries and, 360-368 
double helix of, 4F, 299-303, 300K, 
301S, 321-323, 322F, 323F, 
326-327, 327F 
forms of, 324F 
genetic recombination and, 877-886. See 
also Genetic recombination 
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DNA (continued) 


higher complexity structures of, 
333-336, 336F 
left-handed, 323-324, 325F, 326 
nanotechnology, 302B 
nucleotide sequence of, 316-317 
polymerase chain reaction and, 
373-374, 374K, 375¥F, 377F, 378F 
quadruplexes, 329, 330F 
recombinant, 354-381 
artificial chromosomes from, 360 
repair of, 887-891, 890F, 892F 
replication of, 300F, 862-876 
chromosome ends in, 874-875 
in eukaryotic cells, 872-874, 
872F-873F 
Okazaki fragments and, 864-865, 
864F 
origin of, 862-863 
polymerase in, 865-870, 865T, 866F, 
867T, 868F, 868G, 869T, 870B, 
870F 
recombination-dependent, 884B, 885 
RNA genomes and, 876-877, 876B 
as semidiscontinuous, 863-864, 864F 
template and, 862D 
unwinding of helix in, 863 
right-handed, 323, 324F 
secondary structure of, 320-333 
denaturation and renaturation and, 
330-332, 331F, 333F 
double helix and, 321-323, 322F, 
323F, 326-327, 327F 
left-handed DNA and, 323-324, 325F, 
326 
right-handed DNA and, 323, 324F 
Watson—Crick base pairs and, 321 
single-stranded, 331 
structure of, 4F, 10F 
synthesis of, deoxyribonucleotide forma- 
tion in, 830-833, 831F, 834F 
template, 908, 910-911 
threadlike molecules of, 302 
toroidal supercoiled, 335-336 


DNA rearrangement of immunoglobulin 
genes, 899 
DNA sequence 
cloning of, 355-356, 356F 
gene regulatory proteins and, 935-939, 
937F, 938F, 939F 
NA sequencing, automated, 318-320 
NA strand exchange reaction, 881-883 
NA tweezers, 302B 
NA-binding protein, 923 
naG, 869 
nak, 989, 989F 
NA:protein interactions, 922 
Nase, 309 
Docosahexaenoic acid, 736B 
Dodecyl octaoxyethylene ether, 245S 
Dolichol, 231 
Dolichol phosphate, 231S 
Domain 
C-terminal, 926 
homeobox, 936 
PDZ, 1004 
protein, 155-156 
Dopa decarboxylase, 1056 
Dopamine, 1056 
Double helix, 299-303, 300F, 301S 
as dynamic structure, 326-327, 327F 
in solution, 326 
as stable structure, 321-323, 322F, 323F 
unwinding of, 863 
Double-bond character of peptide bond, 
87, 88F, 89 
Double-displacement reaction, 404, 
406-407, 407F 
Double-stranded RNA, 375 
Downstream, 908 
Dragline silk, 144 
Drug 
ATPase transport and, 283 
AZT, 877, 877S 
carbohydrate-based, 207B 
design of, 404B 
enzymes and, 386 
fluoro-substituted analog as, 834B 
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Editing, RNA, 945-946 
Edman 
glycogen, 680F 
of polyunsaturated fatty acids, 715F 
of protein, 999F, 1004B, 1004F 
purine, 823, 824F, 826F 
of pyrimidine, 830F 
Edman degradation, 102 
Edman’s reagent, 79, 80F, 102 
eEF2, 980B 
EF hand protein, 1031, 1033 
Effector, 1034-1037 
EF-G, 972, 973B, 973G 
EF-Tu, 968, 969T, 973B, 973G 
Ehlers—Danlos syndrome, 155B 
Eicosanoid, 220, 737 
synthesis of, 747-750 
Eicosanoids, 235 
Eicosapentaenoic acid, 736B 
elF-4 group, 978 
elFs, 976-977 
Elastase, 171B 
of chymotrypsinogen, 434, 435F 
Elastin, 171B 
Electrical potential, 129 
in neuron, 1044 
Electrical synapse, 1046 
Electroblotting, 364B 
Electrochemical potential, 271 
Electrogenic transport, 278 
Electrolyte 
acid dissociation constants for, 39T 
strong, 36 
weak, 37 
Electron, in catabolism, 522S 
Electron carrier, 606 
Electron transfer flavoprotein, 705 
Electron transfer pathway, cyclic, 641 
Electron transfer potential, 59T 
Electron transfer reaction, 641 
Electron transport, 592-629 
cytosolic NADH and, 620-623, 621F, 
622K, 623T 
isocitrate dehydrogenase and, 574 


DNA chip, 361B 

DNA footprinting, 908, 910B 
DNA glycosylase, 891 

DNA gyrase, 334-335, 335F, 863 
DNA helicase, 481 


folate analog, 818B 

microtubule polymerization and, 494B 

nonsteroidal anti-inflammatory, 
750B-751B 

protease inhibitor, 443B 


as membrane-associated process, 
592-593 

mitochondrial, 620 

mitochondrial electron-transport chain, 
599T 


DNA library, 360-368 


cDNA, 362-363 

combinatorial, 361B 

expression vectors in, 369-372, 369K, 
370F 

microarrays and, 367-368 

screening of, 362 

Southern hybridization and, 362 

from total DNA of organism, 360, 
361-362 


DNA ligase, 869 

DNA looping, 923 

DNA multiplexes, 328-329 

DNA polymerase, 317, 317F, 863-864, 874 


I, 865 

II, 865 

III holoenzyme, 867-868, 867T 

properties of, 865-870, 865T, 866F, 
867T, 868F, 868G, 869T, 870B, 
870F, 874T 


Drug resistance, 283, 285 

dTMP synthesis, 833, 834F 
Duchenne muscular dystrophy, 486B 
Duplex, hybrid, 332 

Dye binding assays, 98B 

Dynamite, 201B 

Dynein, 481, 493F, 494-498 
Dystrophy, muscular, 486B 


E. coli. See Escherichia coli 

El helicase, 502F 

El protein, 501 

E site, 965 

Earth’s crust, composition of, 5T 

Eating behavior, hormones controlling, 
853-854 

E3-binding protein, 566-567 

4E-BP, 978 

Economy, genetic, 177 

EcoRI fragment, 356F, 357 


organization of, 597-611, 597F, 598F, 
599T, 600B, 604G, 606F, 610F 
proton gradient and, 611-620, 614F, 
616F, 617F, 618F, 619B, 619F 
reduction potentials and, 593-596, 594F, 
595T 
thermodynamics and, 611 
Electron-transferring flavoprotein, 599 
Electron-transport chain, 598F, 599T 
Electron-transport pathway, 563 
Electrophilic catalysis, covalent, 430 
Electrophoresis, 129-131 
SDS-polyacrylamide gel, 130-131 
in Southern blotting, 364B 
Electrophoretic migration of polynu- 
cleotide fragment, 316-317 
Electrospray ionization mass spectrometry, 
106T, 107F-108F 
Electrostatic destabilization, 423 
Electrostatic effect, 422 


Electrostatic interaction, 135, 135F, 321 
Electrostatic repulsion, 60, 61F 
Element 
cAMP response, 936B 
consensus sequence, 908D 
mobile, 886 
response, 927, 928-929 
SECIS, 954B 
transposable, 886 
B-Elimination, 807B 
Elongation, 965 
of Cis fatty acids, 733-734 
chain, 911-912 
in fatty acid synthesis, 733, 733K, 734 
peptide chain, 981 
Elongation cycle, 968, 969T 
Elongation factor 
Escherichia coli, 969T 
G, 972 
Tu, 968, 969T 
Embden, G., 535 
Embden-Meyerhof pathway, 535 
Emphysema, 171B 
hereditary, 172B 
Enalapril, 424B 
Enantiomer, 184 
amino acids as, 80 
Endergonic, 53 
Endoamylase, 196 
Endocytosis, 209, 759T 
Endogenous uncoupler, 619B 
Endoglucanase A, 169F 
Endonuclease, restriction, 310-313, 311T, 
312F, 313 
Endopeptidase, 103 
Endopeptidase Lys-C, 103 
Endoplasmic reticulum, 20, 21F 
of animal cell, 23T 
of plant cell, 24T 
signal peptidase and, 996-997 
Endorphin, 1010, 1056 
Endothelin, 1056 
Endurance training, 708B 
Energy 
binding, in catalysis, 422F 
in biosphere, 512-513 
conservation of, 48—49 
free, 53 
of activation, 388 
concentration and, 54 
hydrolysis of ATP and, 64-65, 64F 
for hexose synthesis, 647-648 
internal, 48—49 
light, chlorophyll capturing, 633-634, 
634F, 6348, 635F, 6358, 636-637, 
636F 
in membrane transport, 277-285, 279F, 
279G, 282B, 282F 
from light, 285, 285F 
in metabolism, 512-513 
quantification of, 844F 
van der Waals interaction, 12F 
Energy cycle, ATP in, 521 
Energy diagram for chemical reaction, 
388F 
Energy dispersion, 51 
Energy landscape, 163 
Energy transduction, 522, 636 
Energy transformation, 30 


Energy-shuttle molecule, ATP as, 57-58 
Engineering, genetic, 354D 
Enhancer, 927 
Enkephalin, 1056 
Enol phosphates, 63 
Enolase, 549-550, 550F 
Enoyl-CoA hydratase, 706 
Enthalpy, 49-51 
Entropy, 51, 52B 
absolute zero and, 52 
in enzyme-substrate complex, 422F 
hydrolysis and ionization and, 60 
negative, 56 
Envelope, nuclear, 527 
Environment, water’s role in, 44—45 
Enzyme, 382-480 
allosteric, 395 
antibodies as, 413, 413S 
bifunctional, 671 
buffers vs., 42F 
of Calvin cycle, 652-653 
catalytic power of, 178, 421-423 
classification of, 384T 
cofactors of, 385T 
converter, 453 
debranching, 674 
defining characteristics of, 383-385, 
383F 
definition of, 382D 
designer, 414 
energy charge and, 844, 844F 
glycogen debranching, 674F 
inhibition of activity of, 398-403, 398T, 
399B, 399F, 400F, 401F, 402F 
Kn values for, 392T 
mechanism of action of, 426—448 
aspartic protease and, 437, 440-441, 
441F, 441S 
chymotrypsin and, 435G, 436-437, 
436F, 438S 
covalent catalysis, 430, 430F 
general acid-base catalysis, 430-431 
low barrier hydrogen bonds in, 
431-432, 432F 
metal ion catalysis, 432-433 
serine protease in, 434, 436F, 437, 
438F, 439B 
membrane-associated allosteric, 
1016-1017 
metabolic regulation and, 15-16 
mitochondrial matrix and, 593 
multienzyme system and, 517F 
in B-oxidation of fatty acids, 704F 
photoreactivating, 890 
rate acceleration and, 419-420, 420T 
in recombination, 880 
regulatory, 383D. See also Enzyme regula- 
tion 
restriction, 310-313 
reverse transcriptase as, 876-877 
ribozymes as, 410-412, 411F 
tandem, 671 
transition-state analogs and, 423 
transition-state stabilization and, 
420-421, 420F 
in tricarboxylic acid cycle, 572T 
ubiquitin-activating, 998 
Enzyme cascade, 465F, 466-467 
signaling pathways and, 1011 
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Enzyme catalysis 
active-site residues, 433B 
enzyme-substrate complex destabiliza- 
tion effects on, 421-423 
mechanisms of, 426—430 
protein motions, 428 
Enzyme kinetics, 386D 
Enzyme precursor, zymogens as, 454—455, 
454F—-455F, 454T 
Enzyme regulation, 452-480 
allosteric, 456-467. See also Allosteric 
regulation 
factors influencing activity, 452-456 
allosteric, 453 
covalent modification and, 453, 
462-467, 462F, 464F, 465F 
genetic, 452-453 
isozymes, 455-456, 456F 
product accumulation, 452 
substrate and cofactor availability, 
452 
zymogens and, 454-455, 454F, 
454T 
Enzyme-catalyzed reaction 
equations defining, 389-397, 390F, 
392T, 394T, 396B, 396F, 397F 
kinetics of, 403F, 403-409, 405F, 406F, 
406S 
mathematically defined rate of, 
386-389, 387F, 388F 
Enzymes, 15 
Enzyme-substrate complex 
binding energy in, 421 
destabilization of, enzyme catalysis 
affected by, 421-423 
entropy loss and destabilization in, 
421-423, 422F 
formation of, 422F 
intrinsic binding energy of, 421, 421F 
Epidermal growth factor, 212, 829B 
Epidermal growth factor receptor, 1016, 
1017-1019 
Epimer, 184 
Epinephrine, 1056 
glucagon vs., 683 
glycogen breakdown and, 681 
structure and function of, 75S 
Equation 
Arrhenius, 388 
of competitive inhibition, 399B 
Henderson—Hasselbalch, 38 
for kinetics of enzyme-catalyzed reac- 
tion, 389-397 
Michaelis-Menten, 390-395, 390F, 392T, 
394T 
for oxidation of palmitoyl-CoA, 709T 
Equatorial, definition of, 187D 
Equilibrium, 51 
free energy and, 53 
in hydrolysis of ATP, 63-64 
Equilibrium constant, 59, 67B 
Equilibrium ratio, ATP changing, 67B 
ERGIC-53, 213T 
Eruic acid, 220 
Erythrocyte, 539T 
Erythromycin, 194F, 982, 983S 
D-Erythrose, 182S 
Erythrose-4-P, 781T, 798 
b-Erythrulose, 183S 
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Escherichia colt 
cell walls of, 302F 
elongation factor of, 969T 
flagellar motor of, 504F 
Mformylmethionyl-tRNAi™" in, 966S 
glutamine synthetase in, 776-779, 777F, 
777G, 778F 
glutaminyl-tRNA synthetase of, 958, 958G 
maltoporin peptide chain, 255F 
Escherichia coli polymerase, 865-870, 865T, 
867T, 869T 
Escherichia coli ribonucleotide reductase, 
831, 831F 
Escherichia coli rRNA, 963, 963F 
Escherichia coli tRNA, 348F 
Essential amino acid, 531, '781T 
Essential fatty acid, 531 
Ester 
cholesterol, 757-758 
sugar, 189, 189S 
Estradiol, 233, 234S 
Estrogen, 233, 763 
ETF, 705 
Ethanol 
metabolic effects of, 855B 
NAD*-dependent dehydrogenase and, 
579B 
pyruvate metabolism and, 552-553 
Ethanolamine, 741 
Ether glycerophospholipid, 226-227, 227B 
Ethidium bromide, 326, 327S 
Eubacteria, 17 
Eubacterium, rRNA of, 348F 
Eudesmol, 230S 
Eukaryote, 17, 19-20 
cytosolic ribosomes of, 964-965, 965T 
DNA replication in, 872-874, 872F-873F 
fatty acid synthesis in, 730-732 
gene transcription in, 924-925 
glycerolipid synthesis in, 741-743 
metabolic diversity of, 511 
metabolic pathways in, 527 
mRNA in, 304F 
peptide chain elongation and termina- 
tion in, 981 
phospholipids and, 741-743 
protein sorting and translocation of, 
995-998 
protein synthesis in, 976-984, 977F, 
978T, 980B, 980F, 982T 
ribosomes of, 305F 
RNA polymerases of, 924-925, 925G 
Eukaryotic cell, 7, 17 
Eukaryotic chromosome, 336-339, 336T, 
338F 
Eukaryotic gene transcript, 305 
Eukaryotic initiation factor, 976-977, 978T 
Eukaryotic peptide chain initiation, 979 
Evaporation, 45 
Evolution 
of aminoacyl-tRNA synthetases, 955 
of cells, 17-18, 19 
histidine and, 806B 
oxygen, 640, 640F 
related proteins and, 113 
Evolved coupling stoichiometry, 842 
Exchange, strand, 879 
Exchange reaction, 408-409 
DNA strand, 881-883 


Excision DNA repair, 889B 
Excision repair, 891, 891F 
Excitable cell, 1043 
Excitatory amino acid neurotransmitter, 
1052-1053 
Excitatory neurotransmitter, 1052-1053 
Excretory product, uric acid and, 825-826 
Exercise, 708B 
carbohydrate utilization in, 680B 
Exergonic, 53 
Exit site, 965 
Exon, 303, 939, 945 
Exosome, 947, 948F 
Experiment 
bacterial transformation, 359F 
colony hybridization, 362 
Meselson—Weigle, 878F 
metabolic, 523-528 
isotopic tracers in, 525-526, 525T, 
526F 
Expressed sequence tag, 366 
Expression 
constitutive, 915 
gene 
glutamine synthetase and, 779 
regulation of. See Transcription 
unified theory of, 946-947, 947F 
protein, 370-371 
RNA, 369 
Expression cassette, 376-377 
Expression vector, 369-372, 369F, 370F 
Expression-cloning vector, 370F 
Exteins, 896B 
Exterminator, pest, 257B 
Extra loop, 345 
Extracellular fluid, energy stored in, 698T 
Extracellular matrix, 23T, 204 
Extrinsic coagulation pathway, 455 
Extrinsic protein, 246 
Eye, diabetic cataract in, 687B 


Fo unit, 612 
F; unit, 612 
F-actin, 484 
Factor, clotting, 232B, 455 
Factor-binding center, 972 
Factory, replication, 870 
Facultative anaerobe, 512 
Fad diet, 532B 
FAD-dependent acyl-CoA oxidase, 714 
Familial amyloidotic polyneuropathy, 172B 
Familial hypercholesterolemia, 760—761 
Faraday’s constant, 59 
in passive diffusion, 271 
Farnesyl, 257 
Farnesyl pyrophosphate, 753 
Farnesyl transferase, 259B, 259G 
Farnesylation, 258F 
FAS, 731 
Fast skeletal muscle troponin T isoform, 945 
Faster-acting insulin, 178B 
Fasting state, 673 
Fast-twitch muscles, 708B 
Fat 
antioxidants effect on metabolism of, 
713B 
in diet, 697 
energy in, 698T 
Fat-free mice, 851B 


Fatigue, muscle, 850-851 
F,-ATP synthase, 613G 
Fatty acid. See also Fatty acid catabolism; 
Fatty acid synthesis 
beta-oxidation of, peroxisomal, 714 
branched-chain, 714, 716, 716F 
Cis, 733-734 
catabolism of, 701—721 
essential, 531 
on megasynthases, 730-732 
mobilization of, 697, 698F, 699F, 
700-701, 700F 
odd-carbon, 710-712 
polyunsaturated 
degradation of, 714, 715F 
docosahexaenoic acid as, 736B 
regulation of, 736 
structure and function of, 219-222, 
220T, 221G, 221S 
unsaturation reactions in, 733-734 
Fatty acid catabolism 
beta-oxidation in, 701-702, 701F, 
704-709, 704F, 706F 
carnitine in, 702-703 
coenzyme A in, 702 
ketone bodies and, 717-718, 717B 
Fatty acid synthase, 730-731, 732F-733F 
Fatty acid synthesis 
acetate units in, 723-724, 725F 
acetyl-CoA carboxylase in, 725-727, 
726F 
acyl carrier proteins in, 727, 727F 
arachidonic acid and, 735-736, 736F 
cis double bond in, 733-734 
condensation of acetyl-CoA in, 729 
cytosolic acetyl-CoA in, 723, 723F 
elongation in, 733, 733F 
elucidation of, 727, 728K, 729 
in eukaryotes, 730-732 
hormones regulating, 737, 738F 
malonyl-CoA in, 722 
NADPH in, 722-723 
reduction of B-carbonyl group in, 
729-730 
Fatty acid thiokinase, 702 
Fatty acyl 
description of, 702-703 
thioester-linked, 257 
Fatty acyl synthase II, 728 
F-2,6-BPase, 671 
FCCP, 618F 
Fermentation, 553 
Ferredoxin:NADP* reductase, 638, 641 
Ferric enterobactin receptor, 254F 
Fetal hemoglobin, 475-476 
F\Fo-ATP synthase, mitochondrial, 648-649 
FGAM synthetase, 816 
FGAR anidotransferase, 816 
Fiber 
dietary, 532 
muscle, 489B 
Fibril, 151 
Fibrinogen, 455 
Fibroblast growth factor, 211 
Fibroin, 149, 149F 
Fibronectin, 210 
Fibrous protein, 93, 94F, 146-151 
alpha-keratin as, 147, 147F, 149 
collagen as, 150-151, 151F, 151S 


fibroin as, 149, 149F 
-keratin as, 149 
Filament 
nucleoprotein, 880 
sliding, 486, 487F, 488-490 
thick, 484—485 
thin, 483F, 484 
Filling up reaction, 582, 584 
Fingerprinting, peptide mass, 108-109 
First-order reaction, 386, 387F 
Fischer, Emil, 82B 
Fischer convention, 82B 
Fischer projection, 183-185, 185S 
Fixation 
carbon dioxide, 630 
Hatch-Slack pathway and, 656-659, 
658F 
light regulation of, 655-656, 655F 
photorespiration limiting, 656-659, 
657F, 658F 
nitrogen, 768, 771 
regulation of, 773-774 
Flagella, 19, 503-504 
Flagellar rotor, 504-505 
Flavin dinucleotide, 569B 
Flavin mononucleotide, 569B 
Flavodoxin, 153, 154G 
Flavoprotein, 120T, 598, 599 
FD, 505 
Flippases, 262, 262F 
Floppases, 262, 262F 
Fluid, extracellular, energy stored in, 698T 
Fluid mosaic model, 245-247, 246F 
Fluorescent protein, green, 81B, 81G, 372 
Fluoroacetate in tricarboxylic acid cycle, 
573-574 
5-Fluorocytosine, 835B 
5-Fluoroorotate, 835B 
Fluoro-substituted analog, 834B 
Fluoro-substituted pyrimidines, 835B 
5-Fluorouracil, 835B 
fMet, 966S 
fMet-tRNA;™*, 966 
Focusing, isoelectric, 131 
Folate analog, 818B 
Fold, 158, 897 
Folding, protein 
diseases of, 172B 
of globular protein, 166-167 
hydrophobic interactions in, 135 
molecular chaperones for, 168 
Hsp70, 988D, 989K, 991F 
N-linked oligosaccharides in, 209B 
polypeptide, 96F, 96G, 146 
Folic acid, 524T 
Follicle-stimulating hormone, 1009F 
Food pyramid, 2F 
Fool’s gold, 583B 
Footprinting, DNA, 908, 910B 
Force 
in folding of globular protein, 166-167 
mechanical work and, 49 
of muscle fiber, 489B 
van der Waals, 12 
weak, 11-12 
quaternary protein structure and, 
174, 176 
Fork, replication, 863, 868-869 
Formylglycinamidine ribonucleotide, 816 


NFormylmethionyl-tRNA;™, 966S 
Formylphenylananine methyl ester, 438S 
Forster resonance energy transfer, 636 
454 Technology, 319B 
Fragment 

Okazaki, 864F, 864-865 

polynucleotide, 316-317 

restriction, 312—313 
Fragmentation of polypeptide chain, 

103-105, 104F, 105T 

Frameshift mutation, 893 
Free energy 

of activation, 388 

equilibrium and, 53 

hydrolysis of ATP and, 64—65, 64F 
Free radical 

cationic, 636, 638 

in ribonucleotide reduction, 832F 
Freezing, 45 
a-D-Fructofuranose, 186S 
-b-Fructofuranose, 186G, 186S 
Fructokinase, 555 
Fructose, 555 

in glycolysis, 555 

in honey, 190B 
D-Fructose, 183S, 186S 
Fructose-1,6-bisphosphatase, 656, 664 
Fructose-2,6-bisphosphatase, 671, 672F 
Fructose-1,6-bisphosphatase reaction, 667, 

667F 

Fructose-2,6-bisphosphate, 670-671 
a-b-Fructose-1,6-bisphosphate, 189S 
Fructose-1-phosphate aldolase, 555 
a-L-Fucose, 189S 
Fuel, stored metabolic, 698T 
Fumarase 

Rear / Kn ratio of, 394T 

in TCA cycle, 577F 
Fumarate, 577S, 806, 809-810 
4-Fumaryl-acetoacetate, 810 
Functional genomics, 172B, 367D 
Fungal infection, 835B 
Furan, 184, 186S 
Furanose, 184 
Furanose form, 186S 
Furanose ring, 187 
Fusidic acid, 983S 
Fusion protein, 371T 
Fusion protein expression, 370-371 
Fusion system, gene, 371T 


G protein 
description of, 965 
GPCR signals transduced by, 1024-1025 
muscarinic receptors and, 1050-1052 
in peptide chain termination, 974 
phospholipase C activation by, 1030 
as signal transducer, 1027-1028 
GABA, 75S, 76, 1053 
Galactokinase, 556 
B-b-Galactosamine, 190S 
Galactose, 556-557 
D-Galactose, 182S 
Galactosemia, 556 
Galactose-l-phosphate uridylyltransferase, 
556, 557F 
B-Galactosidase, 558B, 914 
B-Galactosyl-1,3-a-N-acetylgalactosylserine, 
205S 
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Galactosylceramide, 746F 
Galectin-1, 215F 
Galectins, 213T, 215 
Gamma-aminobutyric acid, 75S, 76, 1053 
Ganglioside, 228, 228F, 747 
Ganglioside Gy, 228F 
Ganglioside Gyo, 746F 
GAR synthetase, 814 
GAR transformylase, 814 
Gas, nerve, 1052 
Gas chromatography, 86 
Gaseous neurotransmitter, 1047 
Gastrin, 1009F 
Gate, 274 
GCS-100, 207B 
GD0039, 207B 
GDP, 59-60 
GEF, 969 
Gel dideoxy sequencing, 318-320 
Gel electrophoresis 
SDS-polyacrylamide, 130-131 
two-dimensional, 131-132 
Gel filtration chromatography, 130F 
Geminin, 873 
Gene 
actin, 939 
adenovirus-mediated delivery of, 377, 
378F, 379 
as basis of heredity, 862D 
cloned, 363F, 369-372, 369F, 370F 
definition of, 18D 
essentiality of, 884B 
germline, 899 
immunoglobulin, 897, 899, 902 
metallothionein, 929, 929F 
ovalbumin, 939 
pro a-2 collagen, 939 
recombination-activating, 901 
regulatory, 914-915 
reporter, 371 
split, 939-940 
structural, 914-915 
synthesis of, 340 
transcription and, 906-951. See also 
Transcription 
in transgenic animals, 889B 
tumor-suppressor, 1029B 
vitellogenin, 940 
whimsical names for, 1040B 
Gene chip, 367-368, 368F 
Gene expression 
glutamine synthetase and, 779 
unified theory of, 946-947, 947F 
Gene fusion system, 371T 
Gene knockdown, 375, 376F 
Gene knockout, 375 
Gene regulatory protein, 935-939, 939F 
Gene reorganization, 899 
Gene silencing, 306-307 
Gene therapy, 376-377, 378B, 379, 822B 
Gene transcription. See Transcription 
General acid-base catalysis, 430-431 
General recombination, 877 
General transcription factor, 927, 929 
Genetic code, 952-953, 953F, 954B, 954T 
definition of, 952D 
natural variations in, 954B 
second, 953, 955 
Genetic deficiency, 376-377, 378B, 379 
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Genetic economy, 177 
Genetic engineering, 178B, 354D 
Genetic recombination, 877-886 
breakage and reunion of DNA strands 
in, 877-878, 878F 
enzymes in, 880 
homologous, 878-880, 879F 
immunology and, 895, 897-902, 898F, 
900F, 901F 
prions and, 893, 895 
RecA protein in, 881-883, 882F, 882G 
replication forks and, 885 
RuvA, RuvB, and RuvC proteins in, 
883-885 
transposons in, 886, 886F 
Geneticin, 983F 
Genome 
human, 121F, 367B, 425B 
RNA, 876-877, 876B 
Genome project, human, 367B 
Genome sequencing, 20 
Genomic library. See DNA library 
Genomic manipulation, 884B 
Genomics, 172B, 513 
Geraniol, 230S 
Geranylgeranyl, 257 
Geranylgeranylation, 258F 
Germline gene, 899 
Ghrelin, 854-855 
Gibberellic acid, 230S 
Gibbs free energy, 53 
Globular protein, 93, 94F, 152-170 
folding of 
forces driving, 166-167, 166F, 167F 
packing considerations in, 153, 155 
helices and sheets in, 152-153, 152F, 
152G, 154G 
motion in, 165-166, 165T 
solubility of, 98F 
Globule, molten, 162 
Glucagon, 1009F 
epinephrine vs., 683 
glycogen breakdown and, 681 
Glucocorticoid, 233, 764 
glycogen metabolism and, 683 
Glucocorticoid response element, 928 
Glucogenic amino acid, 531 
Glucogenic intermediate, 804 
Glucokinase, 539 
Gluconeogenesis, 554, 662-673, 662D 
glycolysis vs., 663 
in liver and kidneys, 662-663 
pathways of, 663-665, 664F 
regulation of, 669-673, 671F, 672F 
substrates of, 662 
unique reactions in, 665-669, 665F, 
666F, 667F, 668F, 669F 
Gluconeogenesis inhibitor, 668B 
Gluconic acid, 188 
D-Gluconic acid, 188S 
Gluconolactonase, 684 
p-6-Gluconolactone, 188S 
B-b-Glucopyranose, 185G, 185S 
B-D-Glucosamine, 190S 
Glucosaminoglycan, 200-201 
Glucose 
absolute configuration of, 82B 
biosynthesis and, 683-693 
breakdown of, 383F 


energy in, 698T 
exchange reaction of, 408 
in honey, 190B 
insulin secretion and, 680-681 
in liver, 669 
D-Glucose, 186S 
Glucose monitoring device, 663B 
Glucose oxidase, 190B 
Glucose oxidation, 622-623, 623T 
Glucose phosphate isomerase, 541 
Glucose tolerance test, 681F 
Glucose-l-phosphate, 57T, 462 
Glucose-6-phosphate 
control of gluconeogenesis by, 670 
in gluconeogenesis, 667, 667F 
gluconeogenesis and, 665 
glycogen phosphorylase and, 464 
isomerization of, 541 
utilization of, 691-692, 691F 
a-D-Glucose-l-phosphate, 189S, 196S 
Glucose-6-phosphate dehydrogenase, 684, 
685F 
Glucosidase, 674 
a(l1—>6) Glucosidase, 674 
b-Glucuronic acid, 188S 
Glutamate, 780S 
catabolism of amino acids and, 806 
as neurotransmitter, 1052-1053 
Glutamate dehydrogenase, 90T, 774, 774F 
Glutamate:aspartate aminotransferase, 
408F, 782 
Glutamate-dependent transamination, 780F 
Glutamic acid, 72S 
pK, value for, 77T 
titration of, 79F 
Glutamine 
pX, value for, 77T 
structure of, 72S 
Glutamine phosphoribosyl pyrophosphate 
aminotransferase, 814 
Glutamine synthesis, 775-776, 775F 
Glutamine synthetase, 89, 774, 775F, 801B 
regulation of, 776-779, 777F, 777G, 
778F 
size of molecule, 90T 
Glutaminyl-tRNA synthetase, Escherichia coli, 
958, 958G 
Glycan, 194 
Glycation end products, 677B 
Glyceraldehyde 
configuration of, 82B, 84B, 90 
structure of, 183S 
Glyceraldehyde-3-phosphate, 546, 546F, 
547F, 579B 
Glyceraldehyde-3-phosphate dehydroge- 
nase, 431T 
Glycerokinase, 738 
Glycerol, 189 
channels for, 277F 
glycolysis and, 557-558 
Glycerol kinase, 558 
Glycerol phosphate, 204 
Glycerol phosphate dehydrogenase, 558, 
559F 
Glycerolipid, 737-741, 739F 
sn-Glycerol-3-phosphate, 57T, 224S 
Glycerol-3-phosphate acyltransferase, 738 
Glycerol-3-phosphate dehydrogenase, 739 
Glycerophosphate dehydrogenase, 621 


sn-Glycerophosphate dehydrogenase, 599 
Glycerophosphate shuttle, 620-621 
Glycerophospholipid, 223G, 2238-2255, 
225F-226F, 227B, 737 
Glycinamide ribonucleotide synthetase, 814 
Glycinate, 76 
Glycine 
in green fluorescent protein, 81B 
pK, value for, 77T 
structure of, 72S 
synthesis of, 794, 796F 
titration of, 78F 
Glycine hydrochloride, 76 
Glycine oxidase, 794 
Glycine receptor, 1056F 
Glycocholic acid, 761F 
Glycoconjugate, 181D 
Glycogen, 196, 674-683 
branches of, 678F 
energy in, 698T 
glycosidic bonds in, 678F 
metabolic energy from, 673F 
regulation of, 674-675, 674F, 678-683, 
680F 
synthesis of, 675-677, 675G, 676F 
terminal chain segments of, 677-678, 
678F 
Glycogen debranching enzyme, 674F 
Glycogen phosphorylase, 196, 420T, 463F, 
674 


Glycogen phosphorylase reaction, 462-467, 
462S, 463F-464F, 4638, 675F 
Glycogen synthase, 676, 678, 678F 
Glycogenin, 673F, 676 
Glycogen-storage disease, 681B 
Glycolysis, 511, 535-562, 536F 
cellular regulation of, 554 
control of, 671F 
coupled reactions in, 537 
essential features of, 535 
first phase of 
cleavage by bisphosphate aldolase in, 
543-544, 545F 
isomerization of glucose-6-phosphate 
in, 541 
phosphorylation of glucose in, 536F, 
538-541, 539, 539F 
phosphorylation of phosphofructoki- 
nase in, 542-543, 542F, 542G, 
543B, 543F-544F 
triose phosphate isomerase in, 544, 
545F, 545G 
gluconeogenesis vs., 663 
glycerol and, 557-558 
metabolite of, in erythrocytes, 539T 
NADH in, 552-553, 553F 
pyruvate metabolism and, 552-553 
pyruvate production, 537F 
second phase of, 546-551 
dehydration of enolase in, 549-550, 
550F 
glyceraldehyde-3-phosphate dehydro- 
genase in, 546, 546F, 547F 
phosphoglycerate kinase in, 547-548, 
547F 
phosphoglycerate mutase in, 548, 
549S 
pyruvate kinase in, 550-551, 550F, 
552F 


substrates in, 554-558, 556F, 558B 
thermodynamics of, 538T 
Glycolytic pathway, 520 
Glyconjugates, 213 
Glycopeptide transpeptidase, 402F, 403 
Glycophorin, 249, 249F, 252F 
Glycoprotein, 120, 120T, 204-209 
antifreeze, 207, 207S 
N-linked, 205S 
structure of, 205S 
Glycosaminoglycan, 209 
Glycosides 
cardiac, 279, 280S, 282B 
description of, 191 
Glycosidic bond, 192, 678F 
Glycosphingolipid, 227, 1031 
Glycosyl phosphatidylinositol, 257, 258F, 
260 


Glycosylceramide, 746F 
Glyoxylate, 656 
Glyoxylate cycle, 587, 587K, 588, 589 
Glyoxysome, 588F, 589, 714 
Glyphosate, 801B 
GMK, 207B 
GMP, 814, 815F, 816-818, 820F 
GMP synthetase, 819 
Gold, fool’s, 583B 
Golgi apparatus, 20 
of animal cell, 23T 
of plant cell, 24T 
Gout, 824-825 
GPCR signal, 1024-1025 
GPI, 260 
G-protein-coupled receptor, 251, 1013, 
1015, 1037-1043 
GPT, 668-669 
G-quadruplexes, 329, 330F 
Gradient 
concentration, 270 
ion, in neuron, 1044 
proton, 611-620, 614F, 616F, 617F, 618F, 
619B, 619F 
flagella and, 503-504 
Gradient separation of amino acids, 86F 
Gram-negative bacteria, 201, 202S, 203F 
Gram-positive bacteria, 201, 202S, 203F, 
204 
Granule, storage, 20T 
Granum, thylakoid, 631 
Grass, 656 
Gratuitous inducer, 914D 
GRE, 928 
Green fluorescent protein, 81B, 81G, 372 
GroEL chaperonin, 168 
GroES-GroEL complex, 990-992, 992F, 
992G 
Groove, major or minor, 322 
Group, prosthetic, 598 
Group transfer potential, 58-59, 59T 
Growth, 839-840 
Growth factor, 212, 829B 
Growth factor receptor, 1016 
Growth hormone, 1009F 
GTP 
hydrolysis, 972 
of phosphoric acid anhydrides and, 
59-60 
in protein synthesis, 972 
in TCA cycle, 576 


GTPase-activating proteins, 1038F 
GTP-binding protein, 466 

G:U base pair, 958 

Guanidino group, 785 

Guanine, 292, 293S 

Guanine deaminase, 823 
Guanine-nucleotide exchange factor, 969 
Guanosine, 295, 295S 

Guanosine 5’-diphosphate, 296 
Guanosine 5'-monophosphate, 295, 296S 
Guanylic acid, 295 

Guanylin, 1021 

Guanylyl cyclase, 1015, 1021, 1024 
Guanylyl transferase, 940 

Guide strand, 375 

Guncotton, 201B 

Gyrase, DNA, 334-335, 335F 


H bond. See also Hydrogen bond 
description of, 321 
with polar solutes, 31 
in water, 29-30, 30F 
of water, 45 
H zone, 483 
Hairpin, 166 
Half-cell, 594 
Halobacterium halobium, 250 
Halorhodopsin, 285 
Handedness of peptide bond, 138B 
Hanes—Woolf plot, 272F, 395, 396F 
Hatch-Slack pathway, 656-659 
HATs, 933 
Haworth, Sir Norman, 185 
Haworth projection, 185-187, 185S, 186S 
H-chain gene, 899-900 
HDACs, 934 
HDL, 757 
H-DNA, 329 
H*/2e° ratio, 611 
Heart 
cardiac glycosides and, 279, 280S, 282B 
energy metabolism of, 848T 
metabolic role of, 848F, 851 
Heart myocyte, 204 
Heat, endogenous uncouplers and, 618, 
619B 
Heat capacity, 56 
Heat denaturation of DNA, 331F 
Heat shock protein, 168, 988 
Heat shock transcription factor, 929 
Heating of water, 44—45 
Heat-stable enterotoxin, 1021 
Heavy chain, 484 
Heavy isotope, 526 
a-Helical barrel, 256 
Helical twist, 323F 
Helical wheel, 153, 1033, 1033F 
Helicase, 481, 498, 499T, 863 
Helix 
a-, 95F, 137, 139-142, 936 
amphipathic, 997 
basic amphiphilic, 1033 
from calmodulin, 154G 
from citrate synthase, 154G 
from flavodoxin, 154G 
in proteins, 137, 139-142, 140F, 141F 
transmembrane, 251 
amphiphilic, 153 
Baa, 1033 
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double, 299-303, 300F, 301S 
antiparallel nature of, 300F 
Chargaff’s analysis of, 300 
digital encoding of, 302 
as dynamic structure, 326-327, 327F 
as stable structure, 321-323, 322F, 
323F 
Watson-Crick postulate of, 301, 301F 
in globular proteins, 152-153, 152F, 
152G, 154G 
hydrogen bond and, 140F 
unwinding of, 863 
Helix 2, 936 
Helix behavior of amino acids, 142T 
Helix capping, 141F, 142 
Helix-turn-helix, 936 
Helling, Robert, 358 
Heme, 467D, 607-608 
Heme iron, 471, 473, 473B 
Heme protein, oxygen-binding, 113 
Hemiacetal, 184, 191S 
Hemicellulose, 532 
Hemiketal, 184, 186S, 191S 
Hemoglobin, 467-468 
B-, 141F 
2,3-bisphosphoglycerate and, 475, 475F, 
476F 
carbon dioxide and, 474-475 
a-chain, 113 
B-chain, 113 
dimers of, 469F 
fetal, 475-476 
heme iron and, 471, 473 
myoglobin compared with, 467-469, 
468F, 468S 
nitric oxide and, 477B 
oxy and deoxy forms of, 473 
oxygen binding of, 468F, 469-471 
pathological sequence variants of, 
117T 
proteins of, 113, 115F 
quaternary structure of, 469, 471 
salt bridges between subunits of, 473F 
structure of, 96F, 113, 468S 
subunit motion of, 472F 
tetrameric structure of, 469 
Hemoglobin:oxygen interaction, 472B 
Hemolymph, 193B 
Hemopexin C-terminal domain, 169F 
Hemoproteins, 120T 
Henderson-—Hasselbalch equation, 38, 77 
Heparin, 200-201, 200S, 211S 
HEPES, 44S 
Heptad repeat pattern, 148B 
Heptulose, 183 
Herbicide, 801B 
Hereditary emphysema, 172B 
Heredity, 862D 
directing changes in, 375, 377, 378B, 
379 
Heterochromatin protein 1, 934 
Heterogeneous nuclear RNA, 303, 940 
Heterologous probe, 363 
Heterologous protein subunit, 173, 174F 
Heteromultimer, 101, 173 
Heteromultimeric protein, 89D 
Heteropolysaccharide, 194 
Heterotroph, 511D 
Hevamine, 169F 
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Hexokinase, 116F, 539-541, 540F 

in chemical reaction, 420T 

size of molecule, 90T 
Hexose 

Calvin—Benson cycle and, 653, 655 

synthesis of, 647-648 
Hexulose, 183 
HGPRT, 821, 822B 
High-density lipoprotein, 757 
High-energy biomolecule, 57-63 
High-energy bond, 58 
High-energy bond transfer potential, 59T 
High-energy phosphate compound, 57, 57T 
High-performance liquid chromatography, 

86, 132 

High-pressure liquid chromatography, 99 
High-spin state, 473B 
High-throughput DNA sequencing, 319B 
Histamine, 75S, 76 
Histidine, 549F, 803F 

evolution and, 806B 

pX, value for, 77T 

structure of, 73S 

synthesis of, 802, 803F 
Histidine-imidazole group, 42 
Histone, 302, 336, 336T, 933-934 
Histone code, 933-934 
Histone deacetylase complex, 934 
HIV infection, 443B, 876B 
HIV mRNA, 442F 
HIV-1 protease 

AIDS drugs and, 443B 

characteristics of, 441—442 

with inhibitor, 442G 

pH-rate profile for, 440F 

structure of, 440G 
HIV reverse transcriptase, 877 
H*,K*-ATPase, gastric, 280-282, 282F 
HMG-CoA lyase, 718 
HMG-CoA reductase, 752, 752F, 754F 
HMG-CoA synthase, 751-752 
hnRNA, 303, 305 
Holliday junction, 883F, 883G 
Holliday model, 878-880, 879F 
Holoenzyme, 385 

E. coli polymerase III, 867-868, 867T 
Homeobox domain, 936 
Homeostasis, 16, 847D 
Homodimer, 89D 

glycogen phosphorylase as, 462-463 
Homodisaccharide, 192 
Homogentisate dioxygenase, 810B 
Homoglycan, 194 
Homologous protein, 111 
Homologous recombination, 877 
Homolytic cleavage, 711 
Homomultimer, 173 
Homomultimeric protein, 89D 
Homopolysaccharide, 194 
Homoserine acyltransferase, 790 
Homoserine dehydrogenase, 790 
Homoserine kinase, 790 
Honey, 190B 
Hoogsteen base pairs, 328-329 
Hop diffusion, 265 
Hormone 

adenylyl cyclase and, 465F 

adrenocorticotropic, 763 

definition of, 1008D 


eating behavior and, 853-854 
eicosanoids as, 747 
fatty acid release and, 697, 700-701 


fatty acid synthesis regulated by, 737, 738F 


glycogen synthesis regulated by, 
680-683, 680F 
a-melanocyte stimulating, 854 
polypeptide, 1010 
signal-transducing receptors responding 
to, 1012F, 1013-1024, 1013F, 
1013G, 1024B 
steroid, 233, 234S, 1008-1009, 1008D 
Hormone-activated enzyme cascade, 465F 
Hsp, 988 
Hsp60 chaperone, 168, 988-989, 989F 
HTH motif, 936 
HtrA protease, 1003 
Huber, Robert, 642 
Human bactericidal permeability-increas- 
ing protein, 169F 
Human body, composition of, 5T 
Human genome, 121F, 367B, 425B 
Human growth hormone, 169F 
Human growth hormone gene, 889B 
Human immunodeficiency virus infection, 
443B, 876B 
Huntington’s disease, 988B 
Hyaluronate, 200S, 201, 212F 
Hyaluronic acid-binding domain, 213 
Hyatt, Isaiah, 201B 
Hyatt, John, 201B 
Hybrid duplex, 332 
Hybrid protein, 371F 
Hybridization 
nucleic acid, 332-333 
screening of genomic library by, 362F 
Southern, 362-363, 364B 
Hybridization experiment, colony, 362 
Hydration, 30-31, 31F 
Hydride ion, 522 
Hydrocarbon chain in membrane bilayer, 
247 
Hydrogen 
abundance of, 4 
in catabolism, 522S 
Hydrogen bond, 11T, 12-14, 13S 
helix structure and, 140F 
low barrier, 431-432, 432F 
in parallel pleated sheets, 143 
in peptide groups, 139F 
in proteins, 134-135 
in water, 29-30 
Hydrogen ion, 34 
in dissociation of oxygen from hemoglo- 
bin, 473-475 
secondary active transport and, 286 
Hydrogen tunneling, 433B 
Hydrogenase activity, 773F 
Hydrolysis 
acid, 99 
of ATP 
to ADP, 62F 
equilibria in, 63-64 
myosin and, 488—490 
entropy factors from, 60 
GTP, 972 
of nucleic acids, 307-313 
of phosphoric acid anhydrides, 59-60 
of starch, 673F 


Hydronium ion, 34 
Hydropathy index, 251 
Hydropathy plot, 251, 252F 
Hydrophobic collapse, 162 
Hydrophobic core, 167D 
Hydrophobic effect, 146 
Hydrophobic interaction, 9, 11T, 13-14, 
31-33, 135 
Hydrophobic interaction chromatography, 
132 
Hydrophobic protein in bacterial cell wall, 
203 
L-Hydroxy-acyl-CoA dehydrogenase, 
706-707 
Hydroxyapatite, 151 
B-Hydroxybutyrate, 849S 
B-Hydroxybutyrate dehydrogenase, 718 
7a-Hydroxycholesterol, 761F 
B-Hydroxydecanoy] thioester dehydrase, 
734 
Hydroxyeicosanoic acid, 747 
Hydroxyethyl-TPP, 567, 571F 
B-Hydroxyl group, 706-707 
Hydroxyl ion, 34 
7a-Hydroxylase, 762 
Hydroxylase factor-inhibiting hypoxia 
inducible factor, 559 
Hydroxylated residue 
of collagen, 150F 
of proline, 150S 
Hydroxylysine, 75S, 76 
p-Hydroxyphenylpyruvate dioxygenase, 809 
17-a-Hydroxyprogesterone, 2388S 
Hydroxyproline, 75S, 76, 150S 
Hydroxyprolyl residue, 150F 
17-B-Hydroxysteroid dehydrogenase, 2388S 
17-B-Hydroxysteroid dehydrogenase 3 defi- 
ciency, 238S 
Hypercholesterolemia, familial, 760-761 
Hyperchromic shift, 330 
Hyperpolarization, 1053 
Hyperuricemia, 825 
Hyperventilation, 44B 
Hypothesis 
lock-and-key, 409-410 
Mitchell’s chemiosmotic, 609 
operon, 914 
wobble, of codon-anticodon pairing, 
959 
Hypoventilation, 44B 
Hypoxanthine, 293S 
Hypoxanthine-guanine phosphoribosyl- 
transferase, 821 
Hypoxia inducible factor, 559 
Hypoxia inducible factor-16, 559 
Hypoxia inducible factor-la, 559 


I band, 483 

Ibuprofen, 751S 

Ice, 29-30, 29F 

Identical subunit 
dimer of, 441 
protein, 173, 176F 

p-Idose, 182S 

1-Iduronic acid, 188S 

IFs, 967 

Imidazole 
pnitrophenylacetate hydrolysis by, 431F 
titration curve for, 40, 41F 


Immunodeficiency syndrome, severe com- 
bined, 378B, 822B 
Immunoglobulin, 176F, 177B, 413, 897, 
898F 
Immunoglobulin fold, 897 
Immunoglobulin gene, 897, 899, 902 
Immunoglobulin module, 156F 
Immunology, 895, 897-902, 898F, 900K, 
901F 
Immunoprecipitation, 373 
IMP, 814, 815F, 816-818, 819F 
IMP cyclohydrolase, 817 
IMP dehydrogenase, 819 
In vitro mutagenesis, 374-375 
Inanimate environment, nitrogen in, 
768-769 
Indinavir, 443S 
Indirect readout, 937 
Indole-3-glycerol phosphate synthase, 802 
Induced dipoles, 13 
Induced fit, 409—410 
Inducer, 914D 
Inducible operon, 918 
Induction, 452, 913 
Inflammation, 215 
Infrared region, 82 
Inhibition 
allosteric, 457 
of enzymes, 398-403, 398T, 399B, 399F, 
400F, 401F, 402F 
Inhibitor 
amino acid biosynthesis, 801B 
ATP synthase, 618 
of citrate synthase, 572 
of enzyme activity, 386, 398-403, 398T, 
399B, 399F, 400F, 4015, 402F 
gluconeogenesis, 668B 
of glycogen phosphorylase, 464 
HIV-1 protease, 442G 
mechanism-based, 834B 
monoamine oxidase, 1054B 
of oxidative phosphorylation, 616-618 
protein synthesis, 981-984, 982T 
Inhibitory neurotransmitter, 1053 
Initiation 
peptide chain, 965, 966-968 
in eukaryotes, 976-979, 977F 
of transcription, 931F, 931G 
Initiation factor, 966, 967 
elF-4G, 978 
eukaryotic, 976-977, 978T 
general transcription, 930T 
Initiator aminoacyl-tRNA, 965 
Initiator tRNA, 966 
Inner mitochondrial membrane translo- 
con, 998 
Inorganic nitrogen, 768-774, 769F, 770F, 
771¥, 772F, 773F 
Inorganic precursor, 8F 
Inosine, 295S, 305S 
Inosinic acid, 814, 815F, 816-818, 819F 
Inositol-1,4,5-trisphosphate, 235, 1029 
Insect 
butterfly, 282B 
chitin of, 190 
pest control and, 257B 
trehalose of, 193B 
Insertion mutation, 893 
Insertion sequence, 886 


Insulin, 454, 1009F 
eating behavior and, 855 
glycogen synthesis and, 680-681 
as polypeptide hormone, 1010 
size of molecule, 90T 
structure of, 100S 
Insulin receptor, 1016, 1020 
Insulin receptor substrate, 1020 
Insulin-like growth factor 1, 1042 
Integral membrane protein, 249F, 251-256, 
255F, 270 
R. viridis photosynthetic reaction center 
as, 642 
Integral protein, 246 
Inteins, 896B 
Interaction 
DNA:protein, 922 
electrostatic, 321 
hemoglobin:oxygen, 472B 
hydrophobic, 9, 13-14, 31-33, 135 
ionic, 13 
molecular, in water, 30 
noncovalent, in proteins, 134-136 
protein-protein, 922-923 
van der Waals, DNA and, 321-322 
Interaction energy, van der Waals, 12F 
Intercalating agent, 326 
Intercalation, 893 
Intermediate 
acyl carrier proteins and, 727 
acyl-enzyme, 436 
amphibolic, 520 
2-carboxy-3-keto-arabinitol as, 651 
lariat, 942 
metabolic integration and, 840 
resonance-stabilized carbanion, 711F 
transition state vs., 419 
tricarboxylic acid cycle providing, 
581-582, 581F 
Intermediate-density lipoprotein, 757 
Intermembrane space, 592 
Internal energy, 48-49 
Internal loop, 343 
International unit, 393D 
Intervening sequences, 303 
Intestine, small, fatty acids in, 699F 
Intolerance, lactose, 557, 558B 
Intracellular buffering system, 42 
Intracellular calcium-binding protein, 
1031, 1033 
Intracellular nucleotidase, purine catabo- 
lism and, 823 
Intracellular second messenger, 1026T 
Intramolecular ionic bond, 14S 
Intramolecular tunnels, 802B 
Intrasteric control, 460 
Intrastrand base pairing, 342, 344-345, 346, 
348 
Intrinsic binding energy, 421, 421F 
Intrinsic coagulation pathway, 455 
Intrinsic protein, 246 
Intrinsic termination, 912 
Intrinsically unstructured protein, 168, 
170F 
Intron, 303, 939 
Invariant residues, 113 
Invertase, 193 
Inverted repeat, 327, 328F 
Invirase, 443S 
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Ion, 13 
hydride, 522 
hydrogen, 34 
hydronium, 34 
hydroxyl, 34 
magnesium, 656 
metal, 432-433 
salt, 98 
secondary active transport and, 286 
Ion channel 
ligand-gated, 1047 
oligomeric, 1047 
voltage-gated, 1044 
Ion exchange chromatography, 85-86, 
127-128 
Ion gradient in neuron, 1044 
Ion product of water, 35 
Ionic bond, 14F 
Ionic interaction, 11T, 13 
Ionization 
of amino acid side chains, 78-79 
entropy factors from, 60 
stabilization of hydrolysis products by, 60 
of water, 34-35, 34F 
Ionization constant, 37 
IP;, 1032F 
IPTG, 914, 914F 
Iron, heme, 471, 473, 473B 
Iron-sulfur cluster, 573, 573F 
Iron-sulfur protein, 598 
Irreversible enzyme inhibition, 401—403 
Islets of Langerhans, 680 
Isoacceptor tRNA, 959 
Isocaproic aldehyde, 238S 
Isocitrate 
citrate and aconitase and, 572-573 
in tricarboxylic acid cycle, 572-574 
Isocitrate dehydrogenase 
electron transport and, 574 
regulation of, 586 
in tricarboxylic acid cycle, 574, 574F, 
574G 
Isocitrate lyase, 587-588, 588F 
Isoelectric focusing, 131 
Isoelectric point, 97 
Isoform, protein, 944 
Isolated system, 48, 49F 
Isolation 
of cloned fusion proteins, 371T 
protein, 97-98 
Isoleucine 
catabolism of, 807, 808F 
pK, value for, 77T 
stereoisomers of, 83B 
structure of, 73S 
synthesis of, 790, 793, 794F 
Isologous protein subunit, 173, 174F 
Isomaltose, 192, 192S, 196 
Isomerase 
in beta-oxidation of fatty acids, 713-714 
triose phosphate, 544, 545F, 545G 
Isomerization 
of citrate by aconitase, 572-573 
of glucose-6-phosphate, 541 
Isopentenyl pyrophosphate, 753 
Isoprene, 229, 230S 
plants emitting, 231B 
Isopropyl B-thiogalactoside, 914, 914F 
Isopropylmalate dehydratase, 793 


I-18 Index 


Isopropylmalate dehydrogenase, 793 

Isopropylmalate synthase, 793 

Isopycnic centrifugation, 332B 

Isoschizomer, 312 

Isotopic tracer, as metabolic probe, 
525-526, 525T, 526F 

Isozyme, protein kinase C as, 1035-1036 

IUP, 168 


Janus protein kinase, 1037 
Jeffery, Constance, 543B 
JH gene, 900 
Judson, Horace Freeland, 143B 
Junction, 343, 343F 
Holliday, 878-880, 879F 
synaptic, 1052 
Juvenile hormone esterase, 425B 


K,, 36-37 
Kappa gene family, 899 
heat, 393, 393T 
Reat/ Kn ratio, 393-394 
KcesA, 274 
Keg, ATP changing, 67B 
Keratan sulfate, 200S, 201 
Keratin 
a-, 147, 147F, 149 
B-, 149 
Ketal, 191, 191S 
a-Keto acid, 780S 
B-Ketoacyl-CoA intermediate, 707 
Keto-enol tautomeric shift, 293 
Ketogenesis, 717D 
Ketogenic amino acid, 531 
Ketogenic intermediate, 804 
a-Ketoglutarate, 150S, 806 
a-Ketoglutarate dehydrogenase, 575, 575T 
a-Ketoglutarate family of amino acids, 
781B, 781T, 783-787, 783F, 
784F-786E, 787B 
Ketohexose, 183, 183S 
Ketone, 186S, 532B 
Ketone body, 717-718, 717B, 718F 
Ketopentose, 183, 183S 
Ketose, 183, 183S 
3-Ketosphinganine reductase, 744 
3-Ketosphinganine synthase, 745 
Ketotetrose, 183, 183S 
B-Ketothiolase, 707, 751 
Ketotriose, 183S 
Kidney, gluconeogenesis in, 662-663 
Kilocalorie, 49 
Kinase 
homoserine, 790 
mitogen-activated protein, 1034B 
nucleoside diphosphate, 666 
pyruvate dehydrogenase, regulation of, 
584-586 
receptor tyrosine, 1016-1017, 1030, 
1030F 
ribulose-5-phosphate, 653 
triose, 555 
tyrosine, 1015-1016 
Kinesins, 481, 492, 495-496, 497F 
Kinetics 
burst, 436F 
chemical, 386 
definition of, 386 


of enzyme-catalyzed reaction, 403-409, 
403F, 405F, 406F, 406S 
ping-pong, in fatty acid synthesis, 724 
zero-order, 389 
Km 391D, 394T 
KNF model, 458 
Knob, synaptic, 1044 
Knockout mice, 884B 
Knoop, Franz, 701-702 
Koshland—Nemthy-Filmer sequential 
model, 473 
Krebs, Hans, 563 
Krebs cycle, 563-591. See also Tricarboxylic 
acid cycle 


Ky, 35 


Label, affinity, 402 
lac operon, 914F, 914-915 
lac repressor, 915, 915F, 917B, 917T 
a-Lactalbumin, 116, 116G 
B-Lactamase, keat/ Kn ratio of, 394T 
Lactase, 193 
Lactate 
gluconeogenesis and, 662, 669 
pyruvate metabolism and, 552-553, 553F 
Lactic acidosis, 681B 
Lactic dehydrogenase, 455-456, 456F 
f-b-Lactose, 193 
Lactose intolerance, 557 
Lactose synthase, 558B 
Lagging strand in DNA replication, 864 
LamB protein, 255 
Lambda gene family, 899 
Lambda receptor, 255 
Lamella, 19, 631 
Landscape, energy, 163 
Langdon, Robert, 753B 
Lanolin, 229 
Lanosterol, 230S, 231, 757 
Lariat intermediate, 942 
Lathyrism, 155B 
a-Latrotoxin, 1047 
Lauric acid, 220T 
Laws of thermodynamics, 48-52 
LCAT, 758 
L-chain gene, 899 
LDL, 206, 757, 758, 760-761 
L-dopa, 1056 
Leader peptidase, 994 
Leader peptide, 921F, 994 
Leading strand in DNA replication, 864 
Leading substrate, 405 
Learning, 936B 
Lecithin, 225 
Lecithin:cholesterol acyltransferase, 758 
Lectins, 213, 213T 
Left-handed DNA, 323-324, 325F, 326 
Leloir pathway, 556-557 
Leptin, 855 
Lesch-Nyhan syndrome, 822B 
Lethal synthesis, 834B 
Leucine 
catabolism of, 807, 808F 
pX, value for, 77T 
structure of, 72S 
synthesis of, 793, 795F 
Leucine aminotransferase, 793 
Leucine zipper-basic region, 936 


Leukemia, 789B 
Leukemia virus, Maloney murine, 377 
Leukosialin, 206 
Leukotriene, 747 
Levinthal’s paradox, 162 
Levorotatory behavior, 80 
Library, DNA. See DNA library 
Life 
cell as unit of, 9 
time scale of, 17T 
Life cycle, ribosome, 976F 
Ligand-binding site, 122-123 
Ligand-gated ion channel, 1047 
Ligand-induced conformational changes, 
123 
Ligase 
Acyl-CoA, 702 
DNA, 869 
phage T4 DNA, 357 
ubiquitin-protein, 998 
Light, ultraviolet 
amino acid absorbing, 82, 92 
denaturation of DNA and, 330-331 
pyrimidine dimer repair and, 890 
pyrimidines and purines and, 293, 293F 
Light activation, 655 
Light chain, 484, 484F 
Light energy 
chlorophyll capturing, 633-634, 634F, 
6345S, 635F, 635S, 636-637, 636F 
in membrane transport, 285, 285F 
Light reaction, photosynthetic, 632, 632F 
Light regulation of carbon dioxide fixation, 
655-656, 655F 
Light-driven ATP synthesis, 648-650, 649F, 
650F 
Light-harvesting complex, 637, 647 
Light-harvesting pigment, 634, 635S 
Light-induced change in chloroplast com- 
partment, 655, 655F 
Lignin, 532, 798 
Lignoceric acid, 220T 
Limit dextrin, 674 
Limonene, 230S 
Linear molecular motor, 481 
Lineweaver—Burk plot, 272F, 395 
of competitive inhibition, 399F 
of mixed noncompetitive inhibition, 
401F 
nonlinear, 395 
of uncompetitive inhibition, 402F 
Link protein, 213 
Linker, 357 
Linking number, 334 
Linoleic acid, 220T, 221G, 221S 
1-Linoleoyl-2-palmitoylphosphatidylcholine, 
226 
Lipase, 758 
Lipid, 219-241. See also Lipid biosynthesis 
as biological signals, 234-237 
fatty acids, 219-222, 220T, 221G, 221S. 
See also Fatty acid entries 
glycerophospholipids, 223G, 223S-225S, 
225F, 226F, 227B 
membrane proteins anchored in, 
256-257 
movements of, in membranes, 261-269 
nutritional, 531 


sphingolipids, 227-229, 228F 
spontaneously formed structures of, 
244-245, 244F, 245S 
steroids, 233, 234F, 234S, 236B, 238S 
terpenes, 229-232, 230S, 231B, 2315S, 
232B 
triacylglycerols, 222, 222G, 222S, 223 
waxes and, 229, 229B, 230S 
Lipid biosynthesis, 722-767 
bile acids and, 761-762, 761F 
of complex lipids, 737-747 
CDP-diacylglycerol and, 740F, 
741-743 
ceramide and, 746-747 
glycerolipids and, 738-741 
phosphatidylethanolamine, 741, 741F 
platelet-activating factor and, 744, 
744F 
sphingolipid and, 744-745 
fatty acid, 722-737. See also Fatty acid 
synthesis 
steroid hormones and, 762—764 
Lipid transverse asymmetry, 260 
Lipid-anchored proteins, 248 
Lipidomics, 237-238 
Lipoic acid, 524T, 570B 
Lipopolysaccharide, 203, 203F 
Lipoprotein, 120T, 206, 757T-759T, 758, 
758G, 760-761 
Lipoprotein complex, 757 
Lipoprotein lipase, 758 
Liposome, 245 
Liquid-disordered state, 263, 265 
Lithium, 1031B 
Liver 
cortisol and, 684F 
energy metabolism of, 848T 
energy stored in, 698T 
gluconeogenesis in, 662-663 
glucose in, 669 
metabolic role of, 848F, 852-853, 853F 
pentose phosphate pathway and, 684 
Liver alcohol dehydrogenase, 173G 
Living organism, 1 
Living system, 1—4 
Lock-and-key model, 409-410 
Long-chain acyl-CoA dehydrogenase, 705 
Long-term memory, 936B 
Loop 
anticodon, 953 
extra, 345 
internal, 343 
reactive-center, 171B 
variable, 345 
BaB-Loop, 167 
Looping, DNA, 923 
Loops, 343F 
Lovastatin, 755B 
Low barrier hydrogen bond, 431-432, 
432F 
Low-density lipoprotein, 206, 757, 758, 
760-761 
Low-spin state, 473B 
Lumen, thylakoid, 631 
Luteinizing hormone, 1009F 
Lyase 
ATP-citrate, 724 
HMG-CoA, 718 


Lycopene, 230S 
Lymphocyte homing receptor, 212 
Lysine 
catabolism of, 809 
pX, value for, 77T 
structure of, 73S 
synthesis of, 785, 786F, 787, 788F, 790, 
791F 
titration of, 79F 
Lysine residue, 285S 
Lysis, cell, 22 
Lysogeny, 22 
Lysophosphatidic acid, 235 
Lysosomal acid lipase, 758 
Lysosome, 20, 23T 
Lysozyme, 90T, 116, 116G 


M disc, 483 
M line, 483 
Macrolides, 982 
Macromolecular synthesis, 839-840 
Macromolecule, 1, 8F 
architecture of, 10, 11, 11G 
information in, 10-11 
metabolites and, 5—7 
Macrophage, 897 
Magnesium ion 
ATP hydrolysis and, 64-65 
thylakoid vesicles and, 656 
Major groove, 322 
Malaria, 835B 
L-Malate, 577S, 579B 
Malate dehydrogenase, 578, 657 
Malate synthase, 587, 587F, 588 
Malate-—aspartate shuttle, 621 
Malathion, 1051S, 1052 
MALDI-TOF mass spectrometry, 106T 
Malic enzyme, 713F 
Malignancy. See Cancer 
Maloney murine leukemia virus, 377 
Malonyl-CoA, 722 
Malonyl-CoA-acetyl-CoA-ACP transacylase, 
729 
Maltase, 193 
Maltoporin, 254F—255F, 255, 255F, 
258F 
Maltose, 192, 192S, 408 
Mammal 
CPS-II activation in, 829B 
polyunsaturated fatty acids and, 735 
Manipulation, genomic, 884B 
Mannan, 198S 
Mannitol, 189 
Mannose, 556 
p-Mannose, 182S 
Mannose-specific agglutinin, 169F 
a-Mannosyl-serine, 205S 
Map, metabolic, 513, 514F—-515F 
MAP kinase, 829B 
MAPKs, 1034B 
Maple syrup urine disease, 808-809 
Mapping, restriction, 312F 
Marfan’s syndrome, 155B 
Marker, selectable, 355 
Mass spectrometry, 105-109, 106T, 
107F-108F, 109F 
Mathematically defined enzyme-catalyzed 
reaction, 386-389 
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Matrix 
cytosol and, 600-601 
definition of, 593D 
mitochondrial, 593 
Matrix proteoglycan, cartilage, 212F 
Matrix-assisted laser desorption ionization- 
time of flight mass spectrometry, 
106T 
MCM protein, 872 
Mechanical work, 49 
Mechanism-based inhibitor, 834B 
Mechanosensation, 267 
Mechanotransduction, 267 
MED proteins, 930 
Mediated decay system, nonsense, 946 
Mediator, 929, 930-932, 931F 
Medium-chain acyl-CoA dehydrogenase, 
705 
Megasynthases, 730 
a-Melanocyte stimulating hormone, 854 
Melting, 45 
Melting temperature, 331 
Melville, Herman, 229B 
Membrane, 242—290 
cell, 9, 19, 20T, 23T 
curvature of, 265-266, 266F 
depolarized, 1044 
electron transport in, 592-593 
fluid mosaic model of, 245-247, 246F 
lateral diffusion, 265 
lipids in water and, 244—245, 244F, 245S 
mitochondrial, 702-703 
outer, 592 
phase transitions of, 277 
plasma, 19 
proteins of, 248-260. See also Protein, 
membrane 
thylakoid, 631 
transport across, 260, 269-287 
energy input in, 277-285, 279F, 
279G, 282B, 282F 
light energy in, 285, 285F 
passive diffusion and, 271, 272F 
Membrane channels, 273T 
Membrane protein, 93, 94F, 251, 995 
Membrane raft, 264F, 265 
Membrane thickness, 246 
Membrane-associated allosteric enzyme, 
1016-1017 
Membrane-bound systems, 516 
Membrane-spanning supramolecular com- 
plex, 638 
Memory, long-term, 936B 
Menthol, 230S 
Meperidine, 617S 
B-Mercaptoethylamine, 570B 
Meselson—Weigle experiment, 878F 
Mesosome, 19 
Messenger, second 
cyclic AMP as, 1025-1026, 1026F, 
1026G, 1026T 
intracellular, 1026T 
released by phospholipase, 1028 
Messenger RNA. See mRNA entries 
Metabolic channeling, 828-829 
Metabolic experiment, isotopic tracers in, 
525-526, 525T, 526F 
Metabolic fuel, stored, 698T 
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Metabolic integration 
anabolism and, 839 
ATP coupling and, 841-843 
catabolism and, 839 
intermediates and, 840 
macromolecular synthesis and, 839-840 
in multicellular organism, 847, 848F, 
848T, 849-853, 851B, 851F 
photochemical apparatus and, 841 
quantification of cellular energy and, 
843-844, 844F 
Metabolic map, 513, 514F-515F 
Metabolic pathway 
compartmentalization of, 527, 528F, 
529F 
for inorganic nitrogen, 768-774, 769F, 
7170F, 771F, 772F, 773F 
Metabolic probe, 525-527, 525T, 526F 
Metabolic regulation, 16. See also 
Regulation 
Metabolic syndrome, 708B 
Metabolic water, 710 
Metabolism, 511—534. See also Metabolic 
entries 
ammonium and, 774—776, 775F, 776F 
anabolic, 518-523 
catabolic, 517-520 
cellular, 15, 16F 
classification related to, 512T 
diversity of, 511 
flow of energy and, 512 
glycogen regulating, 674-675, 675F, 
678-683, 680F 
map of, 513, 514F-515F 
mutations affecting, 525, 525F 
NADH and, 552-553, 553F 
nutrition and, 531-532. See also 
Nutrition 
oxygen and, 511 
pathways of, in cells, 527, 528F, 529F 
pyruvate and, 552-553, 553F 
Metabolite, 8F 
in compartmentalized pyruvate carboxy- 
lase, 666 
in erythrocytes, 539T 
macromolecules and, 5—7 
Metabolome, 529 
Metabolomics, 529 
Metabolon, 517D 
Metal activated enzyme, 432 
Metal ion, in ATP hydrolysis, 64-65 
Metal ion catalysis, 432-433 
Metal-activated proteins, 120T 
Metalloenzyme, 432 
Metalloprotein, 120T, 771 
Metallothionein gene, 929, 929F 
Meteorite, Murchison, 83B 
Metformin, 847 
Methanogen, 19 
Methemoglobin, 115F 
Methionine, 790 
catabolism of, 807, 808F 
pX, value for, 77T 
structure of, 73S, 75 
synthesis of, 790, 791F 
Methionyl-tRNA;™ formyltransferase, 
966 
Methyl carbon, 580F 
Methylated base, 304 





Methylation, 461T, 940 
cytosine, 326 
in histone code, 934 
Methyl-p-glucoside, 1918 
Methyl-directed pathway, 889B 
Methylmalonyl-CoA mutase, 711 
Mevalonate, 751-752, 753, 754F 
Mevinolin, 755B 
Mevinolinic acid, 755B 
Meyerhof, O., 535 
Mg?* channel, 276 
Mice 
fat-free, 851B 
knockout, 884B 
transgenic, 889B 
Micelle, 33, 33F, 244, 244F 
Michaelis constant, 391 
Michaelis-Menten equation, 390-395, 
390F, 393T-394T 
myoglobin and, 467 
Michel, Hartmut, 642 
Micro RNA, 306 
Microarray, DNA, 367-368 
Microtubule, 177F, 490-498, 491F, 494B 
Microtubule-binding protein tau, 988B 
Migration 
branch, 879, 882 
electrophoretic, 316-317 
Migratory bird, 709-710 
Milk, lactose in, 557, 558B 
Mimicry, molecular, 973B 
Mineralocorticoid, 764 
Miniband unit, 337 
Minor groove, 322 
Minus end of microtubule, 490 
Mismatch DNA repair, 887, 889B 
Mitchell, Peter, 609 
Mitchell’s chemiosmotic hypothesis, 609 
Mitochondria, 21F 
of animal cell, 23T 
apoptosis and, 624-626 
ATP-ADP translocase and, 618, 619F 
electron transport and, 620 
of eukaryotic cell, 20 
fatty acyl group crossing, 702-703 
function of, 8 
glyoxysomes and, 589 
of plant cell, 24T 
in specific compartments, 592-593 
Mitochondrial cytochrome c, 606F 
Mitochondrial disease, 582B 
Mitochondrial FıFo-ATP synthase, 
648-649 
Mitochondrial inner membrane, 610F 
Mitochondrial matrix, 593 
Mitochondrial outer membrane translocon, 
997 
Mitochondrial preprotein, 997F 
Mitochondrial protein, 997-998 
Mitochondrion, 593F 
Mitogen, 829B 
Mitogen-activated protein kinase, 829B, 
1034B 
Mitosis, 20 
Mixed noncompetitive inhibition, 400-401 
Mixed-function oxidase, 762 
Mobile electron carrier, 606 
Mobile element, 886 
MoCo, 769 











Model 
chromosome, 338F 
fluid mosaic, 245-247 
Holliday, 878-880, 879F 
Koshland—Nemethy-Filmer, 473 
for light absorption by chlorophyll, 636F 
lock-and-key, 409-410 
of Rhodopseudomonas viridis, 642F 
sliding filament, 486, 488-489 
Modification 
allosteric regulation and, 462-467, 4625S, 
463F, 464F, 465F, 465S 
post-translational, 993 
Module 
immunoglobulin, 156F 
protein, 1043 
MoFe-protein, 771 
Molar ratios of nucleic acids, 300T 
Molecular activity, 393 
Molecular basis of mutation, 891-893, 
892F-893F, 894F 
Molecular chaperone, 168, 988, 988D 
Hsp70, 988-989, 989F 
Molecular disease 
Duchenne muscular dystrophy, 486B 
sickle cell anemia, 476—478 
Molecular interaction in water, 30 
Molecular mimicry, 973B 
Molecular motor, 490-510 
definition of, 481D 
flagella and, 503-504 
microtubules and, 490-498, 491F, 494B 
muscle contraction and, 481-490. See 
also Muscle contraction 
Molecular tweezers, 489B 
Molecularity of reaction, 386 
Molecule 
amphipathic, 33 
amphiphilic, 33, 33F 
chiral, amino acids as, 70 
immunoglobulin, 176F 
optically active, 82B 
organic, in photosynthesis, 650-656, 
651F, 651G, 653T, 654F, 655F 
recombinant DNA, 354. See also 
Recombinant DNA 
Molten globule, 162 
Molybdenum cofactor, 769 
Monarch butterfly, 282B 
Monitoring, glucose, 663B 
Monoamine oxidase, 1054B 
Monoamine oxidase inhibitor, 1054B 
Monocyte, 749 
Monod, Jacques, 457 
Monolayer, lipid, 244, 244F 
Monomeric protein, 89D 
Monosaccharide, 182-191, 183S 
classification of, 182S, 183 
derivative forms of, 187-191, 188F, 189F, 
191F 
Monoterpene, 229 
Monounsaturated fatty acid, 219D 
Motion in globular proteins, 165-166, 165T 
Motor, molecular, 490-510 
definition of, 481D 
flagella and, 503-504 
microtubules and, 490-498, 491F, 494B 
muscle contraction and, 481—490. See 
also Muscle contraction 








Motor neuron, 1044F 
Motor proteins, 481, 492-497 
Mouse 
fat-free, 851B 
knockout, 884B 
transgenic, 889B 
Movement, capillary, 45 
MRE, 928 
mRNA, 299, 303 
cDNA libraries prepared from, 363, 366 
HIV, 442F 
3'-polyadenylation of, 941 
post-transcriptional processing of, 
940-941 
recognition and alignment of, 967 
single polycistronic, 912-913 
translation in protein synthesis, 
965-976, 965F 
aminoacyl-tRNA binding in, 969-972, 
970F, 973B 
elongation cycle in, 968, 969T 
GTP hydrolysis in, 972 
peptide chain initiation in, 966-968, 
966F, 967F, 969T 
peptide chain termination in, 974, 
975F 
polyribosomes in, 976 
ribosomal subunits in, 974, 976F 
Mucin, 206 
Multicellular organism, 847, 848F, 848T, 
849-853, 851B, 851F 
Multidrug resistance, 283, 285 
Multienzyme system, 517F 
Multifunctional polypeptide in pyrimidine 
synthesis, 828 
Multilamellar vesicle, 244F, 245 
Multimeric proteins, 175F 
Multiple codons, 921 
Multiplexes, 328-329 
Multisubstrate reaction, 409 
MurA, 169F 
Muramic acid, 190, 191S 
Murchison meteorite, 83B 
Murein, 201 
Murine leukemia virus, 377 
Muscarinic receptor, 1047, 1050-1052 
Muscle 
energy metabolism of, 848T 
energy stored in, 698T 
lactate in, 669 
metabolic role of, 848F, 849-851, 850B 
troponin T isoform of, 943 
Muscle contraction, 481-490 
actin and myosin in, 481, 483, 484F 
calcium ion release in, 481, 483 
coiled coils in, 485, 485F 
conformational change in, 497—498 
myosin in, 485F 
sliding filaments in, 486, 487F, 488-490 
Muscle fiber, 489B 
Muscle glycogen phosphorylase, 463 
Muscular dystrophy, 486B 
Mutagen, chemical, 893, 894F 
Mutagenesis 
PCR-based, 375F 
in vitro, 374-375, 375F 
Mutant form of human sulfite oxidase, 
396B 
Mutant protein, 117, 171B 








Mutation 
metabolism affected by, 525, 525F 
molecular basis of, 891—893, 892F—893F, 
894F 
prenyl transferase and, 259B 
MWC model, 457 
Myocyte, heart, 204 
Myofibrils, 481 
Myoglobin, 115F 
hemoglobin compared with, 467-469, 
468F, 468S 
oxygen binding in, 469 
oxygen-binding curve for, 468F, 470 
as oxygen-storage protein, 468 
size of molecule, 90T 
structure of, 113, 468S 
Myohemerythrin, 141F 
Myoinositol, 189 
Myosin, 481, 483, 483F-485F 
ATP hydrolysis and, 488-490 
conformational change and, 497-498 
definition of, 481D 
Myosin head, 484 
Myosin V, 493F, 494—495 
Mpyristic acid, 220T, 222G, 2228 
Mpyristoyl, amide-linked, 257, 258F 
N-Mpyristoylation, 257, 258F 
Myristyl group, 1023 


NAC, 989 

NAD*, in catabolism, 517, 522, 522F 
NAD*-dependent dehydrogenase, 406, 
579B 





NADH 
cytosolic, 620-623, 621F, 622F, 623T 
glycolysis and, 552-553, 553F 
as inhibitor of citrate synthase, 572 
oxidation of, 599-601 
in TCA cycle, 576 
NADH-coenzyme Q reductase, 598 
NADP* 
anabolism and, 517 
photosynthetic reduction, 633 
NADPH, 2, 3S 
in anabolism, 523 
ATP and, 840-841 
in fatty acid synthesis, 722—723 
glucose-6-phosphate and, 691-692, 691F 
hexose synthesis and, 647-648 
NADPH-specific malate dehydrogenase, 
657 
Nak channel, 275-276 
Na*,Kt-ATPase, 278 
Naming 
of carbohydrates, 181 
of chiral centers, 84B 
of chiral molecules, 80 
of enzymes, 384-385 
of genes, 1040B 
of polysaccharides, 194 
of proteins, 1040B 
Nanotechnology, 302B 
Nascent chain-associated complex, 989 
Natively unfolded protein, 168 
Natriuretic peptides, 1021 
ncRNA, 306 
NDP, 296-297, 298F 
Near-attack conformations, 426-427, 445, 
447 
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Negative control system, 917-918 
Negative cooperativity, 458, 500-501 
Negative entropy, 55 
Negative regulation, 915 
Negative supercoiling, 335F 
Negentropy, 52B 
Nelfinavir mesylate, 443S 
Nerve gas, 1052 
Nervonic acid, 220T 
Neuraminic acid, 191 
Neurodegenerative disorder, 988B 
Neurofibrillary tangle, 988B 
Neuroglia, 1043 
Neurological disorders, 988B, 
1054B-1055B 
Neuron, 1043, 1044F 
Neurotransmission pathway, 1044-1056 
Neurotransmitter, 1046, 1047, 1052, 1056, 
1056F 
Neutral pH, 36 
Neutrality, 36 
Neutrophil, 749 
NFTs, 988B 
nhRNA, 940 
Niacin, 524T 
Nicolson, G. L., 245 
Nicotinamide adenine dinucleotide, 517, 
522, 522F 
Nicotinamide adenine dinucleotide phos- 
phate, 517 
Nicotinamide coenzymes, 568B 
Nicotinic acetylcholine receptor, 1047 
Nicotinic receptor, 1047 
Nitrate assimilation, 768, 769—770 
Nitrate reductase, 769 
Nitric oxide 
guanylyl cyclase and, 1024 
hemoglobin and, 477B 
Nitrifying bacteria, 768 
Nitrite reductase, 769 
Nitrocellulose, 201B 
Nitrogen, 768-812 
abundance of, 4 
amino acid synthesis and, 779-810. See 
also Amino acid synthesis 
ammonium and, 774-776, 774F, 775F, 
776F 
Escherichia coli glutamine synthetase and, 
776-779, 778F 
excretion of, 810 
inorganic, 768-774, 769F, 770F, 771F, 
772¥, 773F 
Nitrogen balance, 531 
Nitrogen cycle, 769F 
Nitrogen fixation, 768, 771 
regulation of, 773-774, 773F 
Nitrogenase, 771, 772 
Nitrogenase reaction, 772-773, 772F 
Nitrogenase reductase, 771, 773G 
Nitrogenous base, 291-294, 292F, 2938S, 
294F 
p-Nitrophenylacetate, 431F, 438S 
N-linked glycoprotein, 2058S, 207-208 
N-linked saccharide, 204 
NMDA receptor, 1052F 
NMR spectroscopy, 526-527, 527F 
NMT, 257 
Nobel, Alfred, 201B 
Nomenclature. See Naming 
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Noncoding RNA, 306 
Noncompetitive inhibition, 400 
Noncovalent bond, 11 
Noncovalent interaction, in proteins, 
134-136 
Noncyclic photophosphorylation, 649 
Nonessential amino acid, 781T 
Nonhistone chromosomal protein, 
302-303, 336 
Nonhomologous recombination, 877, 878 
Nonidentical subunit, protein, 173 
Nonlinear Lineweaver—Burk plot, 396B 
Nonoverlapping code, 953F 
Nonpolar amino acid, 74 
Nonreceptor tyrosine kinase, 1023-1024 
Nonreducing end, 192 
Nonsense codon, 953 
Nonsense mediated decay system, 946 
Nonsense suppression, 960-961 
Nonsteroidal anti-inflammatory drug, 
750B-751B 
Nontemplate strand, 907B 
Norepinephrine, 1055B, 1056 
Northern blotting, 364B 
Norvir, 443S 
NPY/AgRP-producing neuron, 854 
NR,, 779 
NRy, 779 
N-terminal analysis, 102-103 
N-terminal end, 100 
NTP, 296-297, 298F 
Nuclear area of prokaryotic cell, 20T 
Nuclear envelope, 527 
Nuclear magnetic resonance 
amino acids characterized by, 82-85, 
84F 
spectroscopy, 526-527, 527F 
Nuclear pre-mRNA splicing, 941-942 
Nuclease, 308-309 
Nucleic acid. See also DNA; RNA 
base sequence of, 298S, 299 
chemical synthesis of, 339-340 
classes of, 299 
definition of, 297D 
hybridization of, 332-333 
hydrolysis of, 307-313 
phosphodiesterases and, 308-309 
restriction endonucleases and, 
310-313, 311T, 312F 
restriction enzymes and, 310 
specificity and, 309 
primary structure of, 10S, 316-320, 
317F, 318F, 321F 
secondary structure of, 320-333 
denaturation and renaturation and, 
330-333, 331K, 333F 
double helix and, 321-323, 322F, 
323F, 326-327, 327F 
left-handed DNA and, 323-324, 325F, 
326 
right-handed DNA and, 323, 324F 
Watson-Crick base pairs and, 321 
of virus, 21 
Nucleoid, 20T 
Nucleoprotein, 120T 
Nucleoprotein filament, 880 
Nucleoside, 294D, 295S 
Nucleoside diphosphate, 296-297, 297S 
in purine synthesis, 820-821 











Nucleoside diphosphate kinase, 576, 666 
Nucleoside monophosphate, 820-821 
Nucleoside triphosphate, 296-297, 297S, 
298F 
in purine synthesis, 820-821 
Nucleosome, 302, 337F 
chromatin and, 336-337 
chromatin-remodeling complexes and, 
932 
in eukaryotic gene activation, 934-935 
Nucleotide, 291-315, 291D 
cyclic, 296, 296S 
nucleic acids and, 297-313. See also 
DNA; RNA 
nucleosides and, 295S 
phosphoramidite chemistry and, 340 
purine, 813-821. See also Purine, synthe- 
sis of 
pyrimidines and purines, 291-294, 292F, 
293S, 295F 
structure and chemistry of, 10S, 
295-297, 296S, 297S, 298F 
sugar, 675, 675G, 675S 
thymine, 833-836, 834B, 835B, 836F 
Nucleotide binding, 832-833 
Nucleotide excision, 887, 891 
Nucleotide sequence 
completed genome, 367B 
DNA, 316-317 
Nucleus 
of animal cell, 23T 
of eukaryotic cell, 20 
function of, 8 
of plant cell, 24T 
steroid hormones and, 763 
Number, linking, 334 
Nutrition, 531-532 
amino acids required in, 781T 
carbohydrates and, 531, 680B 
fad diets and, 532B 
fasting state and, 673 
fiber and, 532 
lipids in, 531 
protein requirement and, 531 
vitamins and. See also Vitamin 


O antigen, 203-204 
Obligate aerobe, 512 
Obligate anaerobe, 512 
Obligate coupling stoichiometry, 841-842 
Obligatory substrate, 405 
Octadecanoic acid, 219 
Octyl glucoside, 245S 
3-Octylthio-1,1,1-trifluopropan-2-one, 
425B 
Odd-carbon fatty acid, 710-712 
OEC, 638 
Okazaki fragment, 864-865, 864F 
Oleander, 282B 
Oleic acid, 219, 220T, 221G, 221S 
Oleoyl alcohol, 230S 
Oligo(a1,4-a1,4) glucanotransferase, 674 
Oligomer, 173 
Oligomeric ion channel, 1047 
Oligomycin, 618 
Oligonucleotide 
degenerate, 362-363 
gene chips and, 367-368 
phosphoramidite chemistry and, 340 


Oligonucleotide synthesis, solid phase, 
341F 
Oligopeptide, 89D 
Oligosaccharide, 181D 
cleavage of, 208-209, 208S 
description of, 213 
disaccharides as, 191-193, 192F 
N-linked, 207-208 
O-linked saccharide, 204, 205S, 206F 
OMP decarboxylase, 828 
Oncogene, 1029B 
Open promoter complex, 908 
Open quaternary structure, 177 
Open system, 48, 49F 
Operational code, 958 
Operator, 913 
Operon, 913 
araBAD, 918-919 
lac, 914F, 914-915 
trp, 914, 920 
Operon hypothesis, 914 
Optical activity, 80 
Optical properties of amino acid, 79-82 
Optically active molecule, 82B 
ORC, 872 
Order for reactant, 386 
Ordered reaction, 403 
Organ specialization, 847, 848F, 848T, 
849-853, 851B, 851F 
Organelle, 1, 8F 
of animal cell, 23T 
function of, 7-8 
of plant cell, 24T 
in prokaryotic cell, 20 
Organic molecule in photosynthesis, 
650-656, 651F, 651G, 653T, 654F, 
655F 
Ori sequence, 355 
Origin of replication, 355, 862-863 
Origin recognition complex, 872 
Ornithine, 783, 784F 
Ornithine transcarbamoylase, 783 
Orotate phosphoribosyltransferase, 828 
Orthogonal synthesis, 119 
Osmosis, 45 
Osmotic pressure, 34, 34F 
Osteoblast, 282 
O-succinylhomoserine, 790 
Outer membrane, 592 
porins in, 255 
Ovalbumin gene, 939 
Overall activity site, 831 
Overlapping code, 953F 
Oxaloacetate, 780S 
NAD*-dependent dehydrogenase and, 
579B 
Oxidase, mixed-function, 762 
Oxidation 
alpha 
of fatty acids, 714, 716 
in Refsum’s disease, 717B 
beta, 701F, 701-702, 704-709, 704F 
of D-glucose, 188S 
of fatty acids, 701-702, 701F, 704F, 
713-714, 714F, 717 
glucose, 622-623, 623T 
migratory birds and, 709-710 
of NADH, 599-601 
of palmitoyl-CoA, 709T 


prostaglandins and, 747 

in Refsum’s disease, 717B 

of succinate, 601-603, 602F 
Oxidation reduction, 636 
Oxidative cleavage of disulfide bridges, 


102F 
Oxidative phosphorylation, 522, 563, 
592-629 
cytosolic NADH and, 620-623, 621F, 
622F, 623T 


inhibitor of, 616-618 
isocitrate dehydrogenase and, 574 
as membrane-associated process, 
592-593 
mitochondrial, 620 
organization of, 597-611, 597F, 598F, 
600B, 604G, 606F, 610F 
proton gradient and, 611-620, 614F, 
616F-618F, 619B, 619F 
reduction potentials and, 593-596, 594F, 
595T 
thermodynamics and, 611 
2,3-Oxidosqualene lanosterol cyclase, 757 
Oxygen 
abundance of, 4 
aerobic organisms and, 512 
cytochrome cand, 606-607 
hemoglobin and, 472B 
in metabolism, 511 
in myoglobin, 469 
in water, 28-29 
Oxygen binding by hemoglobin, 469 
Oxygen cycle, 512 
Oxygen dissociation, 473-475 
Oxygen evolution, 640, 640F 
18Oxygen exchange, 613-614 
Oxygen saturation curve, 474F 
Oxygenation, hemoglobin and, 469, 471 
Oxygen-binding curve 
of hemoglobin, 468K, 470-471 
of myoglobin, 468F, 470 
Oxygen-binding heme protein, 113 
Oxygen-evolving complex, 638, 645, 645F 
Oxygen-storage protein, 468 
Oxyhemoglobin, 473 
Oxytocin, 482B 


P680, 637 
P700, 637, 645 
P870, 642 
P protein, 444 
p21 protein, 260 
P site, 965 
PA700, 1000 
PAF, 226, 227B 
Pair 
diastereomeric, 184 
Watson-Crick base, 3018, 321 
Pairing 
chromosome, 878 
codon—anticodon, 958-961, 959F, 959T, 
961T 
intrastrand base, 342, 344-345, 346, 348 
Palindrome, 327, 916 
Palmitate synthesis, 728F 
Palmitic acid, 219 
beta-oxidation of, 708—709 
structure of, 220T, 221G, 221S 
Palmitoleic acid, 220T, 222G, 222S 


Palmitoyl-CoA 
in fatty acid synthesis, 726-727 
oxidation of, 709T 
sphingolipid synthesis and, 744—745 
Pancreatic juice, 699F 
Pancreatic ribonuclease A, bovine, 94F 
Pantothenic acid, 524T, 570B 
Papillomavirus El protein, 501F 
PAPS, 796 
Parallel beta-pleated sheet, 143 
Parallel chain in chitin, 198 
Parallel sheet in proteins, 143 
Paralogous proteins, 113 
Parkinson’s disease, 600B, 988B 
Particle 
ribonucleoprotein, 941 
signal recognition, 995 
small nuclear ribonucleoprotein, 305 
Partition properties of amino acid mix- 
tures, 85 
Parvalbumin, 1031 
Passenger strand, 375 
Passive diffusion, 271, 272F 
Pasteur, Louis, 82B 
“Pasteur effect,” 560 
Patch recombinant, 880 
Pathway. See also Cycle 
for ammonium biosynthesis, 769F 
anabolic, 520-521 
cyclic electron transfer, 641 
of cyclic photophosphorylation, 650F 
of dTMP synthesis, 834F 
electron-transport, 563 
Embden-Meyerhof, 535, 536F 
gluconeogenesis, 663-665, 664F 
glycolytic, 520 
Hatch-Slack, 656-659 
inorganic nitrogen and, 768-774, 
769F-773F 
Leloir, 556-557 
for light-regulated reduction of Calvin 
cycle enzymes, 655F 
metabolic, 527, 528F, 529F 
methyl-directed, 889B 
neurotransmission, 1044—1056 
of ornithine biosynthesis, 784F 
of palmitate synthesis, 728F 
pentose phosphate, 684—693. See also 
Pentose phosphate pathway 
polyol, 687B 
for purine catabolism, 823, 824F 
saccharopine, 809 
shikimate, 798, 799F 
signal transduction, 257, 992-993, 
1010-1013, 1012F 
transduction, 1031F 
Pauling, Linus, 139, 143B, 419 
PCNA homotrimer, 874F, 874G 
PCNA protein, 874 
PCP, 1052 
PCR, 373-375, 374F, 375F 
PCR-based mutagenesis, 375F 
PDZ domain, 1004 
Pelouze, Theophile, 201B 
Penicillin, 402F, 403 
Pentose, 294 
Pentose phosphate pathway, 684-693, 
685F 
gluconolactonase and, 684 
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glucose-6-phosphate dehydrogenase 
and, 684, 685F 
in liver and adipose cells, 684 
nonoxidative steps in, 686-690 
oxidative steps in, 684—686, 685F 
6-phosphogluconate dehydrogenase 
and, 684, 686 
phosphopentose epimerase in, 687-688 
phosphopentose isomerase in, 686-687, 
687F 
transaldolase in, 689-690 
transketolase in, 688-689, 688F 
Pentraxins, 213T, 215 
Pentulose, 183 
PEP 
chorismate synthesis and, 798 
formation of, 63, 63F 
in glycolysis, 549-550, 549F, 551F 
hydrolysis of, 57 
PEP carboxykinase, 664, 666-667, 666F 
PEP carboxylase, 582, 583F 
Pepsin, 384, 437 
Peptidase, 994, 996-997 
Peptide 
agouti-related, 854 
chemistry of, 89 
classification of, 89 
leader, 994 
polypeptide chains of, 89 
signal, 996-997 
solid-phase synthesis of, 118F 
vasoactive intestinal, 1056 
Peptide acid hormone, 1008D 
Peptide bond, 71D 
handedness of, 138B 
of protein, 86-87, 87F, 88F, 136 
Peptide chain 
elongation of, in eukaryotes, 981 
initiation of, 966-968, 966F, 967F, 969T 
in eukaryotes, 976-979, 977F 
in maltoporin, 255F 
termination of, 974, 975F 
in eukaryotes, 981 
in protein synthesis, 974, 975F 
Peptide mass fingerprinting, 108-109 
Peptide neurotransmitter, 1047 
Peptidoglycan, 201, 202S 
Peptidyl site, 965 
Peptidyl transfer, 969 
Peptidyl transferase, 964, 969, 971f 
Peptidyl transferase center, 969 
Peptidyl transferase reaction, 411-412, 
412F 
Peridinin-chlorophyll protein, 169F 
Peripheral protein, 246 
Periplasmic space, 203 
Perkin condensation, 571 
Permanent dipoles, 13 
Peroxisomal beta-oxidation of fatty acids, 
714 
Peroxisome, 20, 23T, 24T 
Peroxisome proliferator-activated receptor 
ô, 708B 
Pest control, 257B 
PET, 555B 
PFK-2, 671 
pH, 35-40, 36T 
alpha-helix and, 142 
of amino acids, 71, 74 
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pH (continued) 
buffers and, 41—44, 41F 
of common fluids, 36T 
denaturation of DNA and, 331 
enzymatic activity and, 396-397, 397F 
gastric Ht ,Kt-ATPase and, 280 
of glycine, 76 
hydrolysis of ATP and, 64, 64F 
respiration and, 44B 
standard-state free energies and, 54—55 
Phage T4 DNA ligase, 357 
Pharmaceuticals, enzymes and, 386 
Phase transition, 263-264, 264T 
Phencyclidine, 1052 
Phenyl fatty acid, 701F, 704F 
Phenylalanine 
catabolism of, 809-810, 809F 
pK, value for, 77T 
structure of, 73S 
synthesis of, 798-799, 800F 
tRNA, 346F-347F, 347G 
ultraviolet light absorbed by, 82 
Phenylalanine-4-monooxygenase, 799 
Phenylketonuria, 810B 
Phenylthiohydantoin amino acids, 86, 86F 
Pheophytin, 640 
Phorbol ester, 1036F 
Phosphatase, naming of, 384 
Phosphate, enol, 63 
Phosphate compound, high-energy, 56, 57T 
Phosphate ester of monosaccharides, 189 
Phosphate system, 41—42 
Phosphate-linked base, 342S 
Phosphatidic acid, 223G, 223S, 224 
Phosphatidylcholine, 225G, 1031 
Phosphatidylethanolamine, 225, 738, 741, 
741F 
Phosphatidylglycerol, 225G 
Phosphatidylinositol, 225G, 737, 1029 
Phosphatidylinositol-4,5-bisphosphate, 1029 
Phosphatidylserine, 741, 741F 
Phosphinothricin, 801B 
Phosphite-linked base, 342S 
3'-Phosphoadenosine-5'-phosphosulfate, 
796 
Phosphodiester, cyclic, 296, 296S 
Phosphodiester bridge, 297, 298S 
Phosphodiesterase, 308-309 
Phosphoenolpyruvate, 549, 583F 
chorismate synthesis and, 798 
formation of, 63, 63F 
free energies of hydrolysis of, 57T 
in glycolysis, 549-550, 549F, 551F 
Phosphoenolpyruvate family of amino 
acids, 781T 
Phosphoethanolamine transferase, 741 
Phosphofructokinase, 542-543, 542F, 542G, 
543B, 543F-544F 
Phosphofructokinase-2, 671 
Phosphoglucoisomerase, 541, 541F, 543B 
Phosphoglucomutase, 556 
Phosphoglucomutase reaction, 447B 
6-Phosphogluconate dehydrogenase, 684, 
686, 686F 
Phosphoglycerase kinase, 547-548, 547F 
3-Phosphoglycerate, 650, 793 
3-Phosphoglycerate dehydrogenase, 793 
Phosphoglycerate family of amino acids, 
793—796, 796F 
Phosphoglycerate kinase, 167F 


Phosphoglycerate mutase, 431T 
in glycolysis, 547F, 548, 549F 
Phosphoglycolohydroxamate, 426F 
Phosphohistidine, 576 
3-Phosphohydroxypyruvate, 793 
Phospholipase, 234B, 234F, 235, 1028 
Phospholipase Aj, 234F 
Phospholipase As, 234F 
Phospholipase C, 1029-1031 
Phospholipid, 223-227, 234B, 234F 
eukaryotes synthesizing, 741-743 
glycerophospholipid, 223-227, 223G, 
2238, 2248, 225F, 225S, 226F, 227B 
Phosphomannoisomerase, 556 
Phosphopantetheine group, 727 
Phosphoprotein phosphatase-1, 466, 679 
Phosphoramidite, 340 
Phosphoramidite derivative of nucleotides, 
3428 
Phosphorelay system, 1034B 
Phosphoribosyl transferase, 821 
Phosphoribosyl-anthranilate, 801 
Phosphoribosyl-anthranilate transferase 
reaction, 799 
Phosphoribulose kinase, 653 
Phosphoribulosylformimino-5-aminoimida- 
zole-4-carboxamide ribonu- 
cleotide, 804 
Phosphoric acid 
hydrolysis of anhydrides of, 59-60 
titration curve for, 40, 41F 
Phosphoric-carboxylic anhydride, 61, 63 
Phosphorolysis, 462 
Phosphoryl group transfer reaction, 58F 
Phosphorylase 
backbone structures of, 167F 
glycogen, 462S, 462-467, 463F, 463S, 
464F, 674, 675F 
Phosphorylase a, 466 
Phosphorylase b, 466 
Phosphorylase cascade, 681-683 
Phosphorylase reaction, starch, 196S 
Phosphorylation 
of acetyl-CoA carboxylase, 723F, 726-727 
description of, 461-462 
of glucose, 539 
in histone code, 934 
oxidative. See Oxidative phosphorylation 
of phosphofructokinase, 542-543, 542F, 
542G, 543B, 543F-544F 
in TCA cycle, 575-576 
Phosphorylation potential, 844 
Phosphorylation—dephosphorylation cycle, 
736, 1036 
3-Phosphoserine, 793 
Phosphotriesterase, 159F 
Photoautotroph, 512T 
Photochemical apparatus, 841 
Photoheterotroph, 512T 
Photolyase, 890 
Photophosphorylation, 643, 643F, 648, 
649-650, 649F 
Photoreactivating enzyme, 890 
Photorespiration, 656-659, 656D 
Photosynthesis, 630-661 
carbon dioxide in, 650-659 
limited fixation of, 656-659, 657F, 
658F 
organic molecules from, 650-656, 
651F, 651G, 653T, 654F, 655F 


chlorophyll-captured solar energy in, 
633-634, 634F, 634S, 635F, 6355S, 
636-637, 636F 
general properties of, 630-633 
light-driven ATP synthesis in, 649F, 650F 
photosynthetic reaction centers in, 
641-647, 642F, 643K, 643G, 646F 
photosystems in, 637-641, 638F, 640F 
quantum yield of, 647-648 
Photosynthetic pigment, 634, 636 
Photosynthetic reaction center, 641-647, 
642F, 643F, 643G, 646F 
Photosynthetic unit, 637, 637F 
Photosystem, 637-641, 638F, 640F 
Phototroph, 511D, 841 
Phototrophic organism, 56 
pH-rate profile, 440F 
Phylloquinone, 645 
Phytanic acid-a-hydroxylase, 716 
Phytanic acid-a-oxidase, 716 
Phytol, 230S, 633 
PI, 1029, 1031B 
PI-88, 207B 
Pigment, 634, 6358S, 636 
a-Pinene, 230S 
Ping-pong reaction, 404, 406-407 
in fatty acid synthesis, 724 
PIP, 1029 
Pit, coated, 758 
Pitch, 321 
Pizzarello, Sandra, 83B 
pK,, 38, 40, 42, 42F, 77T 
Plane, amide, 87 
Plant 
desert, 659 
isoprene emitted from, 231B 
photosynthesis in, 630-661. See also 
Photosynthesis 
Plaque 
amyloid, 988B 
atherosclerotic, 155B 
dental, 196 
Plasma, 43B 
Plasma membrane, 19, 23T, 24T, 242, 243, 
243F 
Plasmalogen, 227, 743-744, 743F 
Plasmid, 333 
chimeric, 355 
in cloning, 354-360, 354D, 355F, 356F, 
358K, 359F 
recombinant, 357 
as shuttle vector, 360 
Plasmid vector, 356F 
Plastid, 19, 631 
Plastocyanin, 641 
Plastocyanin:ferredoxin oxidoreductase, 
641 
Plastoquinone, 640, 640S 
Platelet-activating factor, 226, 227B, 1038F 
in lipid synthesis, 744, 744F 
Pleated sheet, B-, 142-144, 143B, 144F 
Plot 
Hanes—Woolf, 395, 396F 
Lineweaver—Burk double reciprocal, 395 
Ramachandran, 137, 138F 
Van’t Hoff, 50 
Plus end of microtubule, 490 
P/O ratio, 620 
Point, isoelectric, 97 
Point mutation, 891-892, 892F 


Polar bear, 223B 
Polar solute, 31 
Polarity 
of protein polypeptide, 89 
structural, 10 
of water, 30 
Poly(A) polymerase, 941 
Poly(A) tail, 303, 940 
3'-Polyadenylation of eukaryotic mRNA, 
940-941 
Polyadenylylation, 940 
Polyalanine, 142 
Polyamino acid, 142 
Polycistronic mRNA, single, 912-913 
Poly(Gly-Pro-Pro), 151F 
Polyleucine, 142 
Polylinker, 357 
Polymerase 
core, 907 
DNA, 317, 863-864, 865-870, 865T, 
866F, 867T, 868F, 868G, 869T, 
870B, 870K, 874 
poly(A), 941 
RNA 
in eukaryote, 924-925, 925G, 926T 
in prokaryote, 908, 909F, 910-911 
Polymerase chain reaction, 373-375, 374K, 
375K, 377F, 378F 
Polymerization 
initiation of, 911 
microtubule, 494B 
by open quaternary structures, 177 
Polyneuropathy, familial amyloidotic, 172B 
Polynucleotide, 297 
Polynucleotide chain, cleavage in, 309F 
Polynucleotide fragment, 316-317 
Polyol pathway, 687B 
Polypeptide, 89D 
backbone, 167D 
folding of, 96F, 96G, 146 
Hsp70 chaperones and, 989 
laboratory synthesis of, 117, 119 
polarity of, 89 
in pyrimidine synthesis, 828 
structure of, 10S 
synthesis of, 118F, 119 
Polypeptide chain, 89 
conformation of, 96-97 
in E. coli polymerase, 866 
fragmentation of, 103-105, 104F, 105T 
separation of, 101 
twist in, 166F 
Polypeptide hormone, 1010 
Polyprenol, 231 
Polyprotic acid, weak, 76-78 
Polyribosome, 976 
Polysaccharide, 181D, 194D 
in bacterial cell walls, 201-204, 202S, 
203F 
branched, glycogen and, 674 
cell surface, 204 
description of, 213 
function of, 194-195 
naming of, 194 
storage, 194 
structural, 196-201 
structure of, 10S 
Polysome 
electron micrograph of, 976F 
in protein synthesis, 976, 976F 


Polyunsaturated fatty acid, 219D 
degradation of, 714, 715F 
docosahexaenoic acid as, 736B 
in mammals, 735 

Porcine ribonuclease inhibitor, 169F 

Porin, 255, 258K, 593 

Portal vein system, 680, 680F 

Positive control system, 917-918 

Positive supercoiling, 335F 

Positron emission tomography, 555B 

Postsynaptic cell, 1046 

Post-transcriptional processing of mRNA, 

940-941 
Post-transcriptional regulation of gene 
expression, 979 
Post-translation modification of protein, 
119, 993 

Potassium 

action potential and, 1044-1046, 
1045F 
transport of, 272-274 

Potassium channels, 272, 274, 274F-275F 

Potassium ion, hydrolysis of ATP and, 64 

Potential 
action, 1044-1046, 1045F 
electrical, in neuron, 1044 
electrochemical, 271 
group transfer, 58-59 
phosphorylation, 844 
reduction, 593-596, 594F, 595T 
thermodynamic, 66 
transfer, types of, 59T 

Potentiality, thermodynamic, 382 

Power, catalytic, 383 

Power stroke, 489 

PPT, 801B 

pp60""", 1023 

Precursor 
inosinic acid as, to GMP and AMP, 814, 

815F, 816-818, 820F 
to plasmalogens, 743-744 
zymogens as, 454—455, 454F—455F, 
454T 

Pregnenolone, 238S, 763 

Prehybridization, 364B 

Preinitiation complex, 931-932, 977F, 978 

Prenyl, thioether-linked, 257 

Prenylation reaction, 259B 

Prephenate, 799 

Preproalbumin, 995 

Preprohormone, 1010 

Prepro-opiomelanocortin, 1010 

Prereplication complex, 872 

Pressure, osmotic, 34, 34F 

Presynaptic cell, 1046 

Pribnow box, 908 

Primary structure 
fundamental, 86-89, 87F, 88F, 89, 90T 
of nucleic acids, 316-320, 317F, 318F, 

321F 
of protein, 94F, 100-110. See also Amino 
acid sequencing 

Primase, 869 

Primed synthesis method, 317 

Primer, 317, 865 

Priming reaction in glycolysis, 538, 541F, 

542-543, 542F 

Prion, 893, 895 

Prion protein, 893, 895 

Pro a-2 collagen gene, 940 


Index l-25 


Probe 
heterologous, 363 
in Southern blotting, 364B 
Processive movement, 500 
Prochirality, 224S 
Product formation, 391 
Proenzyme, 434, 454, 454F 
Progesterone, 233, 2348, 763 
Proinsulin, 454, 454F 
Projection 
Fischer, 183-185 
Haworth, 185-187 
Prokaryote 
gene transcription in, 906-912, 909F, 
910F, 911F, 913F 
regulation of, 912-923, 914F, 915F, 
916T, 917B, 917T, 918F, 918G, 
919F, 921F, 922F 
metabolic diversity of, 511 
metabolic pathways in, 527 
mRNA in, 304F 
peptide chain initiation in, 966-968, 
967F 
ribosomes of, 305F, 962-963 
RNA polymerases, 906-907 
translocation of proteins in, 994 
Prokaryotic cell, 17, 20, 20T 
Prokaryotic promoter, 908, 910-911 
Prolactin, 1009F 
Proliferating cell nuclear antigen, 874 
Proline 
hydroxylated residues of, 150S 
pK, value for, 77T 
structure of, 72S, 150S 
synthesis of, 783 
Proline racemase, 423 
Proline racemase reaction, 423F 
Prolyl hydroxylases, 559 
Promoter, 357, 907 
cloning, 357 
in eukaryote, 927, 928F 
prokaryotic, 908, 910-911 
Promoter complex, 908 
Propeller twist, 323, 323F 
Propionyl-CoA, conversion of, 711F 
Prostaglandin, 737, 747, 747B 
Prostaglandin endoperoxide H synthase, 
747, 749F 
Prosthetic group, 81B, 120, 385, 598 
Protease 
aspartic, 437, 440-441, 441F, 441S 
HIV-1, 440F, 441-442 
HtrA, 1003 
naming of, 384 
serine, 434-437, 436F, 438F, 439B 
Proteasome, 947, 998, 1000 
Proteasome activator-700 kD, 1000 
Proteasome inhibitors, 425B, 1003B 
Protein. See also Protein synthesis 
acetylcholine transport, 1050 
actin-anchoring, 486B 
amino acid sequences of 
analysis of, 99-100 
databases of, 109-110 
nature of, 110-117, 110F, 113F, 114F, 
115F, 116G, 117T 
in structure, 136 
annexin, 1031 
biological functions of, 120-123 
biotin carboxyl carrier, 724 
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Protein (continued) 
calcium-binding, 1031 
cAMP receptor, 917 
catabolite activator, 916 
catalytic, 121 
chemistry of, 93 
cholesterol ester transfer, 758 
classification schemes for, 157-159 
of cloned genes, 369-372, 369F, 371F 
coiled-coil motif in, 148B 
conjugated, 120T 
definition of, 89D 
degradation of, 208-209, 998-1005, 
999F, 1000F, 1001F, 1004B, 1004F 
denaturation of, 51T, 159-161, 160F 
DNA-binding, 923 
domains of, 155-156 
EF hand, 1031, 1033 
energy in, 698T 
in eukaryotes, 995 
extrinsic, 246 
in fatty acid synthesis, 727, '727F 
folding of, 987-992 
diseases of, 172B 
of globular protein, 166-167 
Hsp70 chaperones for, 988-989, 989F 
hydrophobic interactions in, 135 
mechanism of, 162-163, 163F 
molecular chaperones for, 168 
N-linked oligosaccharides in, 209B 
polypeptide, 96F, 96G, 146 
thermodynamics, 163-164 
functions of, 120-123 
fusion, 371T 
G 
GPCR signals transduced by, 
1024-1025 
muscarinic receptors and, 1050-1052 
phospholipase C activation by, 1030 
as signal transducer, 1027-1028 
gene regulatory, 935-939, 939F 
globular, 152-170. See also Globular 
protein 
green fluorescent, 81B, 81G, 372 
GTP-binding, 466 
heat shock, 168, 988 
with helix-turn-helix motif, 936 
hybrid, 371F 
hydrophobic, 203 
indirect readout and, 937 
integral, 246 
integral membrane, 270 
intrinsic, 246 
intrinsically unstructured, 168, 1'70F 
iron-sulfur, 598 
isolation and purification of, 97-98, 
98B, 99T 
link, 213 
lipid-anchored, 256-258, 257B, 258K, 260 
location in cell, 994F, 994-998, 996F, 
997F 
membrane, 248-260, 995 
aggregation of, 272-273 
bacteriorhodopsin as, 250-251, 251F 
porins as, 255, 258F 
with single transmembrane segment, 
249-250, 249F 
as switching device, 256-258, 257B, 
258F, 260 


mitochondrial, 997 
motor, 481 
multifunctional, 725 
multimeric, 175F 
mutant, 117, 171B 
noncovalent interactions in, 134-136 
nonhistone chromosomal, 302-303, 336 
nutritional requirement for, 531 
oligomeric state of, 458-459 
oxygen-storage, myoglobin as, 468 
peripheral, 246 
post-translation modification of, 993 
primary structure of 
architectural arrangement of, 93-97, 
94F, 95F, 96G, 97F 
fundamental, 86-89, 87F, 88F, 90T, 93 
prion, 893, 895 
quaternary structure of, 173-178 
polymerization of, 178 
symmetry of, 174, 176F 
variety of, 177 
weak forces in, 174, 176 
ras, 259B, 1027, 1027G 
RecA, 881-883 
receptor, 1043 
regulatory, 913 
ribosomal, 962 
Rieske, 604 
Ruy, 880, 883-885 
scaffolding, 266-267 
secondary structure of, 136-146 
alpha-helix in, 137, 139-142, 140F, 
141F 
amide plane in, 136-137, 137F, 138F 
beta-pleated sheets in, 141F, 142-144, 
143B, 144F 
beta-turns in, 144-145, 145F 
parallel sheets in, 143 
simple, 119 
single-stranded DNA-binding, 863 
size of molecules of, 90T 
SMC, 338-339, 339F 
structure of, 169F 
targeting of molecules by, 121-122 
tertiary structure of, 146-172 
fibrous proteins and, 146-151, 147F, 
149F, 150S, 151F, 151S 
globular proteins and, 152-170. See 
also Globular protein 
marginal stability of, 164 
triage of, 1004B 
Tus, 870 
ubiquitin-carrier, 998 
unfolding of, 55, 55F 
viral transforming, 1023 
whimsical names for, 1040B 
Protein binding, 122 
Protein deposit, diseases with, 988B 
Protein interaction domains, 1011 
Protein isoform, 944 
Protein kinase 
covalent modification and, 453 
description of, 453F, 460-461 
mitogen-activated, 1034B 
Protein kinase A, 460, 460F, 1035 
Protein kinase C, 1029-1031, 1035-1036 
Protein phosphatase 2A, 693 
Protein sequence, conventions used in 
expressing, 907B 


Protein splicing, 896B 
Protein synthesis, 952-986 
aminoacyl-tRNA synthetases in, 953, 
955-958, 955T, 956F, 957G 
codon—anticodon pairing in, 958-961, 
959F, 959T, 961T 
Escherichia coli glutaminyl-tRNA syn- 
thetase in, 958, 958G 
in eukaryotic cells, 976-984, 977F, 978T, 
980B, 980F, 982T 
genetic code and, 952-953, 953F, 954B, 
954T 
G:U base pair in, 958 
inhibitors of, 981-984, 982T 
messenger RNA and, 303 
mRNA translation in, 965-976. See also 
mRNA translation in protein syn- 
thesis 
ribosomes in, 961-965, 962T, 963F, 
965T 
steps in, 965F 
Protein transfer potential, 59T 
Protein translocation, 994 
Protein tyrosine kinase pp60°"*, 1036, 
1036F 
Protein tyrosine phosphatase SHP-2, 
1036-1037, 1036G 
Protein-coding gene, 925-926 
Protein-protein interaction, 123, 372-373, 
922-923 
Proteoglycan, 209-213 
cell growth and, 211-212 
function of, 209-211 
Proteoglycan-hyaluronate aggregate, 213 
Proteolytic cleavage in post-translational 
processing, 993 
Proteolytic enzyme 
digestive, 454-455, 454F 
in polypeptide cleavage procedures, 
105T 
Proteome, 120 
Proteomics, 367B, 513 
Prothrombin, 232B 
Proton gradient 
flagella and, 503-504 
synthesis of ATP and, 611-620, 614F, 
616F-617F, 618F, 619B, 619F 
Proton pump, 283F 
Proton symport, 286 
Proton transport, 604-605 
Proto-oncogene, 1029B 
Provirus, 877 
Prozac, 1054B-1055B 
PrP, 893, 895 
PRPP in Lesch-Nyhan syndrome, 822B 
PS decarboxylase, 741 
PsaD, 645 
Pseudoknot formation, 344 
Pseudosubstrate sequence, 460, 845 
Pseudouridine, 305S, 345 
PSI complex, 637-638, 638F, 640, 645, 
646F, 647 
PSII complex, 637-638, 638F, 640-641 
D-Psicose, 183S 
Pump, proton, 283F 
Pure noncompetitive inhibition, 400 
Purification 
affinity, 97 
protein, 98, 99T 


Purine, 291-294, 292F, 293S 
catabolism of, 821, 823-826, 824F, 826F 
salvage of, 821, 821F 
synthesis of, 802, 804, 813-821 
ATP-dependent kinases in, 820-821 
inosinic acid in, 814, 815F, 816-818, 
819F 
regulation of, 819-820 
Purine nucleoside cycle, 823, 824F 
Purine nucleoside phosphorylase, 823 
Purine ring, 292F 
Puromycin, 983S 
Purple patches, 285 
Pyranose, 184 
Pyranose form 
of D-glucose, 186S 
of D-ribose, 186S 
Pyridoxine, 524T 
Pyrimidine, 291-294, 292F, 293S 
degradation of, 830, 830F 
fluoro-substituted, 835B 
synthesis of, 826-830, 827F, 828F 
Pyrimidine dimer repair, 890 
Pyrimidine ring, 292F 
Pyroglutamic acid, 75S, 76 
Pyrophosphate, 57T 
Pyrophosphomevalonate decarboxylase, 
755 
Pyrrolysine, 75S 
Pyruvate 
gluconeogenesis and, 662 
in glycolysis, 552-553, 553F 
glycolysis production of, 537F 
metabolism of, 552-553, 553F 
synthesis of, 805F 
transamination of, 851F 
in tricarboxylic acid cycle, 564F, 566 
Pyruvate carboxylase, 582, 664-665 
compartmentalized, 666 
in gluconeogenesis, 665-666, 665F 
Pyruvate decarboxylase, 553 
Pyruvate dehydrogenase complex, 566, 
567F, 568B-569B, 571F 
Pyruvate dehydrogenase E3-binding pro- 
tein, 566 
Pyruvate dehydrogenase kinase, 584-586 
Pyruvate dehydrogenase reaction, 566 
Pyruvate family of amino acids, 781T, 793, 
794F, 795F 
Pyruvate kinase, 550-551, 550F, 552F 
Pyruvate-Pi dikinase, 658 
Pyruvic acid, 191S 
PYY3_35, 855 


Q cycle, 603, 604-606, 605F 
Q,-SNARE, 268 
Qp- SNARE, 268 
Quadruplexes, 329, 330F 
Quality control, protein, 1003-1005, 
1004B 
Quantum yield of photosynthesis, 637, 
647-648 
Quaternary structure 
allosteric properties and, 467-468 
of hemoglobin, 469, 471 
of protein, 95, 173-178 


Racker, Efraim, 616 
Radiation, ultraviolet, 82 


Radical, free 
cationic, 636, 638 
in ribonucleotide reduction, 832F 
RAG recombinase, 901 
Ramachandran plot, 137, 138F 
Random coil, 153 
Random reaction, 403 
RANTES, 1042 
Ras protein, 259B, 1027, 1027G 
Rat cartilage proteoglycan, 210F 
Rat growth hormone gene, 889B 
Rate constant, 386 
Rate of enzyme catalysis, 386, 419 
Rate-limiting step, 437 
Ratio, 393 
equilibrium, ATP changing, 67B 
H*/2e, 611 
Rear/ Km 393-394 
molar, of nucleic acids, 300T 
P/O, 620 
Rational drug design, 404B 
Reactant, 386 
Reaction 
aminotransferase, 782B 
anaplerotic, 582, 584 
beta-oxidation, 704-709, 704F, 706F 
bimolecular, 387 
in Calvin—Benson cycle, 653, 653T, 655 
DNA strand exchange, 881-883 
filling up, 582, 584 
fructose-1,6-bisphosphatase, 667 
in gluconeogenesis, 665-669, 667F, 
668F, 669F 
glucose-6-phosphate dehydrogenase, 
684, 685F 
glutamine synthase, 776F 
glycogen phosphorylase, 462-467, 462S, 
463K, 463S, 464F, 674, 675F 
glycogen synthase, 676, 678F 
of glycolysis, 538T 
malate dehydrogenase, 578 
nitrogenase, 772-773, 772F 
nucleoside diphosphate kinase, 576 
PEP carboxykinase, 666F 
6-phosphogluconate dehydrogenase, 
684, 686, 686F 
phosphopentose isomerase in, 686-687, 
687F 
photosynthetic, 632-633, 632F 
priming, 538 
redox, 595-596 
ribulose-1,5-bisphosphate carboxylase, 
651, 651F 
second-order, 387 
serine dehydratase, 807B 
splicing, 942-943 
succinate dehydrogenase, 576 
thymidylate synthase, 836F 
transaldolase, 689-690, 690F 
transamination, 782B 
transketolase, 688-689, 688F 
in tricarboxylic acid cycle, 572T 
UDP-glucose pyrophosphorylase, 676, 
676F 


Reaction center, photosynthetic, 638, 
641-647, 642F, 643F, 643G 

Reaction half-cell, 594 

Reactive-center loop, 171B 

Readout, indirect, 937 
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Reagent, Edman’s, 102 
Reannealing, 331 
RecA, 499 
RecA protein, 881-883, 882G 
RecBCD enzyme, 880 
RecBCD-dependent initiation of recombi- 
nation, 881F 
Receptor 
acetylcholine, 1047, 1048F 
asialoglycoprotein, 209 
G-protein-coupled, 1013, 1015 
lambda, 255 
LDL, 760 
lymphocyte homing, 212 
muscarinic, 1050-1052 
nicotinic acetylcholine, 1050 
nitric oxide, 1024 
signal-transducing, 1012F, 1013-1024, 
1013F, 1013G, 1024B 
TMS, 1015 
Receptor protein, 1043 
cAMP, 917 
Receptor signal transduction, 1024-1034 
Receptor tyrosine kinase, 1016-1017, 1030, 
1030F 
Recognition matrix, 937 
Recognition particle, signal, 995 
Recombinant DNA, 354-381 
artificial chromosomes from, 360 
cloning and, 354—360. See also Cloning 
directed changes in heredity and, 
375-377, 377F, 378B, 378F 
DNA libraries and, 360-368, 361B, 
362F-363F, 364B, 365F, 366F, 
367B, 368F-370F, 371T 
polymerase chain reaction and, 
373-375, 374F, 375F 
Recombinant DNA molecule, 354 
Recombinant plasmid, 355 
Recombinase, 881 
Recombination, genetic, 877-886. See also 
Genetic recombination 
Recombination-activating gene, 901 
Recombination-dependent replication, 
884B, 885 
Recycling factor, ribosome, 974 
Red blood cell, sickle cell anemia and, 
476-478 
Red wine, polysaccharide in, 199B 
Redox couple, 594 
Redox reaction, 595-596 
Reducing end, 192 
Reducing sugars, 188 
Reductase 
aldose, 687B 
nitrogenase, 771, 773G 
Reduction 
of B-carbonyl group, in fatty acid synthe- 
sis, 729-730 
oxidation, transduction of light energy 
and, 636 
Reduction potential, 593-596, 594F, 595T 
concentration and, 596 
Reductive citric acid cycle, 583B 
Reductive cleavage, of disulfide bridges, 
102F 
Reentrant loops, 253 
Reference half-cell, 594 
Refsum’s disease, 717B 
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Regular structure, of protein, 94F 
Regulation 
allosteric. See Allosteric regulation 
autogenous, 920 
of carbon dioxide fixation, 655-656, 
655F 
by enzyme, 383-385 
of gene transcription, 912-923, 914F, 
915F, 916T, 917B, 917T, 918K, 
918G, 919F, 921F, 922F 
of gluconeogenesis, 669-673, 671F, 672F 
of glutamine synthetase, 776-779, 777F, 
777G, 778F 
of glycogen metabolism, 678-683 
of glycogen synthase, 676-679 
of isocitrate dehydrogenase, 586 
metabolic, 16 
negative, 915 
of nitrogen fixation, 773F 
of phosphofructokinase, 542G 
post-transcriptional, 979 
of purine synthesis, 819-820, 820F 
of pyrimidine synthesis, 830 
of tricarboxylic acid cycle, 584-586, 
585F 
Regulators of G-protein signaling, 1038 
Regulatory enzyme 
definition of, 383D 
nonlinear Lineweaver—Burk or 
Hanes—Woolf plots, 395 
Regulatory gene, 914-915 
Regulatory protein, 913 
gene, 935-939, 939F 
Relaxed DNA, 334 
Relaxed form of hemoglobin subunits, 473 
Release factor, 974 
Remodeling, bone, 282-283 
Renaturation of DNA, 331-332 
Rep helicase, 500 
Repair 
of DNA, 887-891, 890F, 892F 
of genetic deficiency, 376, 377, 378B 
Repeat, inverted, 327, 328F 
Replacement therapy, gene, 376, 377 
Replication, DNA, 862-876. See also DNA, 
replication of 
Replication activator protein, 872 
Replication factory, 870 
Replication fork, 863, 868-869 
Replication licensing factors, 872 
Replication switch, 872 
Replicator, 355 
Reporter gene, 371 
Repressible operon, 918 
Repression, 452, 914D 
Repressor 
lac, 915-916, 915F, 917B, 917T 
Trp, 920 
Repulsion, electrostatic, 60, 61F 
Reserpine, 1054B-1055B 
Residue 
amino acid, 89, 433B 
in alpha-helix, 139 
hydroxylated 
of collagen, 150F 
of proline, 150S 
invariant, 113 
sialic acid, 208S 
Resistance, drug, 283, 285 


Resonance 
competing, 60 
peptide bond, 87, 89, 136 
stabilization of hydrolysis products by, 60 
Resonance-stabilized carbanion intermedi- 
ate, 711F 
Respiratory acidosis, 44B 
Respiratory alkalosis, 44B 
Response element, 927, 928-929 
cAMP, 936B 
Restriction endonuclease, 310-313, 311T, 
312F, 313 
Restriction endonuclease EcoRI, 356F 
Restriction enzyme, 310-313 
Restriction fragment size, 312-313 
Reticulum 
endoplasmic, 20 
of animal cell, 23T 
of plant cell, 24T 
signal peptidase and, 996-997 
sarcoplasmic, 280 
Retina, docosahexaenoic acid in, 736B 
Retinal, 285S 
Retrograde translocation, 997 
Retrovirus, 876B 
Reunion of DNA strands, 877-878 
Reverse transcriptase, 363, 876-877 
Reverse transcriptase polymerase chain 
reaction, 373 
Reverse transcriptase-driven synthesis of 
cDNA, 366F 
RF, 974 
RGS proteins, 1039, 1039F 
Rhamnogalacturonan IJ, 199B 
a-L-Rhamnose, 189S 
Rho termination factor, 912 
Rhodopseudomonas viridis photosynthetic 
reaction center, 642-645, 642F, 
643F, 643G 
Rhodopsin, 252F 
Ribbon, extended, 197 
Ribbon structure of protein, 156F 
Ribitol, 189 
Ribitol phosphate, 204 
Riboflavin, 524T 
B-bD-Ribofuranose, 295 
Ribonuclease 
bovine, 152 
unfolding of, 161F 
Ribonuclease A, 94F, 152F, 152G 
size of molecule of, 90T 
Ribonucleic acid. See mRNA; RNA 
Ribonucleoprotein particle, 941 
small nuclear, 305 
Ribonucleotide, 294F, 295D 
Ribonucleotide reductase, 831-832, 832F 
D-Ribose, 186S 
Ribose zipper, 344 
Ribose-5-P, 691-692 
a-D-Ribose-5-phosphate, 189F 
Ribose-5-phosphate pyrophosphokinase, 814 
Ribosomal protein, 962 
Ribosomal RNA, 299, 303-304, 304F, 305F, 
306, 962-963, 963F 
intrastrand base pairing and, 346, 348 
Ribosomal subunit, 974, 976F 
Ribosome 
assembly of, 961-965 
catalytic power of, 970-971 


definition of, 961D 
eukaryotic, 305F, 964-965, 965F 
life cycle of, 976F 
in plant cell, 24T 
prokaryotic, 305F, 962-963 
prokaryotic cell, 20T 
spontaneous self-assembly of, 963 
structural organization of, 961-965, 
962T, 963F, 965T 
translocon and, 995 
tRNA-binding sites on, 966F 
Ribosome binding site, 967 
Ribosome nascent chain:SP complex, 
995-996 
Ribosome recycling factor, 974 
Ribosome RNA 
secondary structure of, 349F 
tertiary structure of, 349F, 349G 
Riboswitches, 350 
Ribothymidine, 305S 
Ribothymidylic acid, 304 
Ribozyme, 410-412, 411F 
Ribulose, 90T 
D-Ribulose, 183S 
Ribulose bisphosphate, 651 
Ribulose bisphosphate carboxylase, 656 
Ribulose-1,5-bisphosphate, 651, 651G 
Ribulose-1,5-bisphosphate carboxylase reac- 
tion, 651 
Ribulose-5-phosphate kinase, 653 
Rieske protein, 169F, 604 
Right-handed DNA, 323, 324F 
Ring 
furanose, 187 
purine, 292F 
pyrimidine, 292F 
Ritonavir, 4438 
RLFs, 872 
RNA 
biological significance of, 307 
of cloned genes, 369-372, 369F, 370F 
definition of, 297D 
double-stranded, 375 
eukaryotic transcript, 305 
hydrolysis of, 307 
messenger. See mRNA entries 
micro, 306 
noncoding, 306 
ribosomal, 303-304, 304F, 305F, 306 
intrastrand base pairing and, 346, 348 
operons of, 963F 
ribozymes, 410-412, 411F 
secondary structure of, 341-350, 345F, 
345S, 346S, 347S, 348F, 349F 
small, 305, 306 
small interfering, 375 
splicing of, 944 
transfer, 304-305 
unusual bases of, 305S 
vincinal -OH groups of, 308F 
RNA editing, 945—946 
RNA enzyme, 411-412, 412F 
RNA expression, 369 
RNA genome, replication of, 876-877, 
876B 
RNA interference, 375 
RNA polymerase 
in eukaryotes, 924-925, 925G, 926T, 930 
I, 924 


II, 924, 925G, 926T 
MI, 924 
in prokaryotes, 908, 909F, 910-911 
RNA polymerase holoenzyme, 906-907 
RNA-directed DNA polymerase activity, 876 
RNA-induced silencing complex, 375 
RNase, 309 
RNase H, 876 
RNase PH, 947 
RNC-SRP, 995-996 
Robinson, Sir Robert, 753B 
Rosanoff, M.A., 82B 
Rosanoff convention, 82B 
Rossman fold, 501 
Rotating molecular motor, 481 
Rotation 
of flagella, 503-504 
optical, 80 
Rotation axis, twofold, 174 
Rotenone, 617S 
Rough endoplasmic reticulum, 21F 
Rous sarcoma virus, 1023 
RRF, 974 
rRNA, 962-963, 963F 
(R,S) system, 81-82, 84B 
R-SNARE, 269 
RT-PCR, 373 
Rubisco, 651 
Rubisco activase, 652 
Rule, Chargaff’s, 300T 
RuvA, 880, 883-885 
RuvB, 880, 883-885 
RuvC, 880, 883-885 
RuvC resolvase, 885 


70S complex, 974 

48S initiation complex, 979F 

80S initiation complex, 979 

43S preinitiation complex, 977F, 978 

20S proteasome, 1000 

26S proteasome, 1000, 1000G 

19S regulator, 1000 

70S ribosome:RF1:RF3-GTP:termination 
signal complex, 974 

Saccharide, 204, 205S, 206F 

Saccharopine, 787 

Saccharopine pathway, 809 

SAGA complex, 933 

SAICAR synthase, 815F 

Salivary a-amylase, 196 

Salmonella, 204 

Salt, bile, 700 

Salt bridge between subunits of hemoglo- 
bin, 473F 

Salting-in of protein, 98 

Salvage of purines, 821, 821F 

SAM, 790, 997-998 

Sample half-cell, 594 

Sanger, F., 316 

Sanger’s chain termination, 317-320 

Saponification, 222 

Saquinavir, 425B, 443S 

Sarcolemma, 483 

Sarcoma virus, Rous, 1023 

Sarcomeres, 483 

Sarcoplasmic reticulum, 280, 483 

Sarin, 1051S, 1052 

Saturated fatty acid, 219D 

Saturation behavior, 271 


Saturation curve, 122, 122F 
oxygen, 474F 
substrate, 389 
Saturation effect, 389 
Saxitoxin, 1049B 
Scaffolding proteins, 266-267 
Schiff base linkage, 285S 
Schizosaccharomyces pombe, 845 
Schonbein, Christian, 201B 
Schultze, Major E., 201B 
SCID, 378B, 822B 
SCOP, 158 
Scramblases, 262 
Screening of DNA library, 362, 362F, 370 
Scurvy, 155B 
SDS-PAGE, 131 
SDS-polyacrylamide gel electrophoresis, 
130-131 
Seawater, 5T 
Sec61 complex, 996 
SECIS element, 954B 
Second genetic code, 953, 955 
Second messenger 
calcium as, 1031 
cyclic AMP as, 466, 1025-1026, 1026F, 
1026G, 1026T 
intracellular, 1026T 
phosphatidylcholine generating, 1031 
released by phospholipase, 1028 
Secondary active transport, 286-287 
Secondary structure 
of DNA, 320-333. See also DNA, sec- 
ondary structure of 
of proteins, 136-146. See also Protein, 
secondary structure of 
of RNA, 341-350, 345F, 345S, 346S, 
347S, 348F, 349F 
Second-order reaction, 387 
Secreted polypeptide hormone, 1010 
Secretion 
insulin, 680 
of pancreatic juice, 699F 
retrograde translocation preventing, 
997 
Secretory protein, 995 
Sedimentation coefficient, 132-133, 303 
Sedoheptulose-1,7-bisphosphatase, 653 
Selectable marker, 355 
Selectins, 213T, 214, 214F 
Selectivity filter, 272, 274 
Selenocysteine, 75S, 954B 
Self-replication, 3-4 
Self-splicing, 411 
Sensor, 923 
Separation, protein, 97-98, 98B 
Sequence 
consensus, 908D, 928T 
DNA 
cloning of, 355-356, 356F 
gene regulatory proteins and, 
935-939, 939F 
insertion, 886 
intervening, 303 
nucleic acid, 907B 
signal, 994 
stop-transfer, 997 
upstream activation, 927 
Sequence tag, expressed, 366 
Sequence-specific transcription factor, 937 
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Sequencing 
amino acid, 100-110. See also Amino 
acid sequencing 
DNA, 318F 
automated, 318-320 
genome, 17-18 
Sequencing gel, dideoxy, 318-320 
Sequential reaction, 403 
Serglycin, 210F 
Serine 
catabolism of, 804 
ethanolamine and, 741 
in green fluorescent protein, 81B 
in lipid synthesis, 741 
pK, value for, 77T 
sphingolipid synthesis and, 744-745 
structure of, 72S 
synthesis of, 793, 796F 
Serine acetyltransferase, 795 
Serine dehydratase reaction, 807B 
Serine hydroxymethyltransferase, 816B 
Serine phosphatase, 793 
Serine protease, 434, 436F, 437, 438K, 439B 
Serotonin, 75S, 76 
Serpin, 171B 
Serum albumin, 90T 
Serum cholesterol, 761 
Seryl tRNA synthetase, 148B 
Sesquiterpene, 229, 230S, 231 
Severe combined immunodeficiency syn- 
drome, 378B, 822B 
Sex hormone, 763 
SH modification in cleavage of disulfide 
bridges, 102F 
Sheet 
B. See Beta sheet 
in globular proteins, 152-153, 152G, 
154G 
parallel, in proteins, 143 
Shift 
chemical, 83-84, 85F 
hyperchromic, 330 
keto-enol tautomeric, 293 
Shikimate pathway, 793, 799F 
Shine-Dalgarno sequence, 967, 967F 
Short-chain acyl-CoA dehydrogenase, 705 
Shuttle, malate—aspartate, 621 
Shuttle vector, 360, 360F 
Sialic acid, 1918, 208S, 228 
Sickle cell anemia, 476-478 
Side chain, amino acid, 78—79 
Sigma factors, 907 
Sigmoid curve, 456 
Signal 
GPCR, 1024-1025 
polyadenylation, 940 
Signal peptidase, 994, 996-997 
Signal peptide, 996 
Signal receptor, 994 
Signal recognition particle, 994, 995 
Signal sequence, 994 
Signal transduction pathway, 257, 992-993, 
1010-1013, 1012F 
Signal transduction unit, 1025 
Signalsome, 1011 
Signal-transducing receptor, 1012F-1013F, 
1013-1024, 1013G, 1024B 
Silencer, 927 
Silencing, gene, 306-307 
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Silk, spider, 144 
Silk fibroin, 149F 
Simple protein, 119 
Simple sugar, 181D 
Singer, S. J., 245 
Single polycistronic mRNA, 912-913 
Single transmembrane segment, 249-250, 
249F 
Single-displacement bisubstrate mecha- 
nism, 403F 
Single-displacement reaction, 403, 403F, 
405-406 
Single-strand assimilation, 882-883 
Single-stranded DNA 
damage, 887 
description of, 331 
Single-stranded DNA-binding protein, 863 
siRNA, 306 
Siroheme, 770 
Site 
active, 389, 398 
catalytic, 178 
cloning, 355 
substrate-binding, 398 
Sitosterol, 236S 
Size exclusion chromatography, 130-131 
Skeletal muscle, 481-490. See also Muscle 
contraction 
energy metabolism of, 848T 
metabolic role of, 849-851, 850B 
structure of, 482F 
Skeletal muscle troponin T isoform, 945 
Sliding filament, 486, 487F, 488-490 
Slow-twitch muscles, 708B 
Small interfering RNA, 306, 375 
Small intestine, 699F 
Small nuclear ribonucleoprotein particle, 
306 
Small RNA, 306 
SMC proteins, 338-339, 339F 
SMC1/SMC3 heterodimers, 339 
SMC2/SMC4 heterodimers, 339 
Smooth endoplasmic reticulum, 21F 
Smooth muscle effectors, 482B 
Snake venom, 235B, 308F 
SNARE, 268-269, 269F 
snRNP, 305, 943 
Soap, 222 
Sodium, action potential and, 1044-1046, 
1045F, 1046F 
Sodium channel toxin, 1049B 
Sodium ion 
hydrolysis of ATP and, 64 
secondary active transport and, 286 
Solar energy. See Light energy 
Solid phase oligonucleotide synthesis, 341 F 
Solid-ordered state, 262 
Solid-phase synthesis, of peptide, 118F 
Solubility, of globular protein, 98F 
Soluble protein, unfolding of, 55, 55F 
Solute 
colligative properties and, 33-34 
polar, 31 
water properties and, 33 
Solution, double helix in, 327 
Solvent, water as, 30-34, 44 
Soma, 1044 
Somatic recombination, 877 
Somatostatin, 1009F 
Sorbitol, 189 


bD-Sorbose, 183S 
Sorting and assembly complex, 997-998 
Southern, E. M., 364B 
Southern blotting, 364B 
Southern hybridization, 362-363 
Space 
intermembrane, 592 
thylakoid, 631 
Specialization, organ, 847, 848F, 848T, 
849-853, 851B, 851F 
Specific acid-base catalysis, 430 
Specific activity, 393 
Specific linking difference, 335 
Specificity 
definition of, 383 
enzyme, 383 
Specificity site of ribonucleotide reductase, 
832-833 
Spectrin, 265 
Spectrometry 
amino acid properties, 82-85 
mass, 105-109, 106T, 107F-108F, 109F 
Spectroscopy, nuclear magnetic resonance, 
526-527, 527F 
Spectrum, absorption, of nucleic acid 
bases, 293-294, 293F 
Spermaceti, 229B 
Sphingolipid, 227-229, 228F, 737, 744—745, 
745F 
Sphingomyelin, 227, 746F, 1031 
Sphingomyelinase, 1031 
Sphingosine, 227 
Sphingosine-1-phosphate, 235 
Spider web, 144 
Spirogyra, 632F 
Spliceosome, 943, 943F 
Splicing, 411F, 939-940 
nuclear pre-mRNA, 940-941 
protein, 896B 
Splicing reaction, 942-943 
Split gene, 939-940 
Squalene, 230S, 231, 752, 753, 754F 
Squalene monooxygenase, 757 
SRP, 995 
SRs, 994 
SSB protein, 863 
S-shaped curve, 456 
Stability of quaternary structure, 177 
Stabilization 
of hydrolysis products, 60 
transition-state, in serine protease, 439B 
Stacking, coaxial, 344 
Standard reduction potential, 59T 
Standard state, 50 
Standard-state free energy, 54-55 
Standard-state free energy change, 53 
Stanozolol, 764, 764F 
Staphvax, 207B 
Staphylococcal protease, 103 
Starch, 195-196, 196S 
catabolism of, 673-674, 673F 
Starch phosphorylase, 195 
State function, 49 
Statins, 424B, 755B 
Statistical thermodynamics, 51 
Stator, 612 
Steady state, 2 
Steady-state assumption, 390 
Stearic acid, 6F, 219, 220T, 221G, 2215S, 
222G, 222S 


Stearoyl-CoA desaturase, 734 
l-Stearoyl-2-oleoyl-phosphatidylcholine, 
226, 226F, 226G 
Steitz, Thomas, 870B 
Stem-loop structure, 342, 912 
Stereochemistry 
of amino acid, 79-82 
of monosaccharides, 183-184 
Stereoisomer, 83B 
Stereospecificity in NAD*-dependent dehy- 
drogenase, 579B 
Steroid 
anabolic, 764, 764F 
cardiotonic, 280, 280G, 280S, 282B 
structure and function of, 233, 234F, 
234S, 236B, 238S 
Steroid hormone 
action of, 1008-1009 
definition of, 1008D 
synthesis of, 762—764 
Stigmastanol, 236S 
Stoeckenius, Walther, 616 
Stoichiometry, 841-843, 841D 
Stomata of desert plant, 659 
Stop-transfer sequence, 997 
Stop-transfer signal, 994 
Storage, long-term memory, 936B 
Storage granule, 20T 
Storage polysaccharide, 194 
Stored metabolic fuel, 698T 
Strain, structural, 422 
B-Strand, 95F 
Strand exchange, 879 
Streptococcus lividans, 274, 274F 
Streptomycin, 194F, 983S 
Stroma, 631 
magnesium ions moving into, 656 
Strong electrolyte, 46 
Structural complementarity, 14, 14G 
Structural gene, 914-915 
Structural genomics, 172B 
Structural polarity, 10 
Structural polysaccharide, 196-201 
Structural strain, 422 
Strychnine, 1053 
Substrate 
binding of, 389 
definition of, 383 
of gluconeogenesis, 662, 672-673 
in glycolysis, 554-558, 556F, 558B 
leading, 405 
obligatory, 405 
suicide, 401-403 
Substrate saturation curve, 389 
Substrate specificity site, 831, 832-833 
Substrate-binding site, 398 
Substrate-level control mechanism in gluco- 
neogenesis, 669-671, 671F 
Substrate-level phosphorylation, 575-576 
Subunit 
cytochrome, 642F 
of E. coli polymerase IIT homoenzyme, 
867T 
identical, dimer of, 441 
of prokaryotic ribosomes, 961-962 
protein, 95, 173 
Succinate 
oxidation of, 601-603, 602F 
structure of, 150S 
in TCA cycle, 576 


Succinate dehydrogenase 
competitive inhibition and, 399-400, 
400F 
as FAD-dependent, 576 
Succinate dehydrogenase reaction, 576 
Succinate thiokinase, 575 
Succinate-coenzyme Q reductase, 598 
N-Succinyl-5-aminoimidazole-4-carboxamide 
ribonucleotide, 815F 
Succinyl-CoA 
amino acid catabolism and, 807 
oxidation of, 711F, 712 
Succinyl-CoA synthetase, in TCA cycle, 
575-576 
Sucrose, 192F, 192G, 1928S, 193, 408 
Sugar 
amino, 190 
deoxy, 189 
simple, 181D 
structure of, 10S 
Sugar acid, 187-188, 188S 
Sugar alcohol, 189, 189S 
Sugar code 
description of, 213 
lectin translation of, 213 
Sugar ester, 189, 189S 
Sugar nucleotide, 675, 675G, 675S 
Suicide substrate, 401-403 
Sulfate 
assimilation of, 797F 
sulfide synthesis from, 796-797 
Sulfatide, 228 
Sulfide synthesis, 796-797, 797F 
Sulfite oxidase, wild-type, 396B 
Sulfmeturon methyl, 801B 
Sumoylation, 933 
Supercoil, 333-336, 335F, 336F, 911F 
Superfamily, 158, 499, 499T, 500F 
Superhelix density, 335 
Suppressor tRNA, 960-961 
Supramolecular complex, 6, 8F 
membrane as, 9 
membrane-spanning, 638 
virus as, 21-22 
Surface tension, of water, 45 
Switch 
membrane protein as, 256-258, 257B, 
258F, 260 
replication, 872 
Symmetry, 174 
Symmetry model, 457-459 
Symport, 286 
Synapse, 1044, 1046-1047 
Synapsis, chromosome, 879, 879F 
Synaptic bulb, 1044 
Synaptic cleft, 1044, 1048, 1050 
Synaptic junction, 1052 
Synaptic knob, 1044 
Synaptic terminal, 1044 
Synaptotagmin, 269 
Synchrony, 204 
Syndecan, 209-210, 210F 
Synechococcus elongatus, 644—645, 644F 
Synonymous codon, 953 
Synthesis 
amino acid, 779-810. See also Amino 
acid synthesis 
of ATP, 611-620, 614F, 616F-617F, 618F, 
619B, 619F 
of cDNA, 366F 


cholesterol, 750-757 
dehydration, 191 
DNA, deoxyribonucleotide formation in, 
830-833, 831K, 833F 
of eicosanoids, 747-750 
fatty acid, 722-737. See also Fatty acid 
synthesis 
of fructose-2,6-bisphosphatase, 672F 
of gene, 340 
glycogen, 675-677, 675G, 675S, 676F, 
680-683 
of hexose, 647—648, 653, 655 
of hybrid protein, 371F 
lethal, 834B 
macromolecular, 839-840 
of nucleic acid, 339-340 
polypeptide, 118F, 119 
protein, 952-986. See also Protein 
synthesis 
of pyrimidines, 826-830, 827F, 828F 
of secretory proteins, 995 
solid phase oligonucletotide, 341F 
of thymine nucleotide, 833-836, 834B, 
835B, 836F 
Synthesis method, primed, 317 
Synthetase 
Acyl-CoA, 702 
arginosuccinate, 784 
glutamine, 774, 775F 
regulation of, 776-779, 777F, 777G, 
778F 
Synthetic combinatorial library, 361B 
Synvinolin, 755B 
Szent-Györgyi, Albert, 488 


T lymphocyte, 897 
Tabun, 1051S, 1052 
TAF, 930 
Tag, expressed sequence, 366 
p-Tagatose, 183S 
Tail, myosin, 484 
p-Talose, 182S 
Tamiflu, 425B 
Tandem enzyme, 671 
Tandem mass spectrometry, 107-108, 
109F 
Tangle, neurofibrillary, 988B 
TATA box, 927, 927F 
Taurocholic acid, 761F 
Taut form of hemoglobin subunits, 473 
Tautomeric shift, keto-enol, 293 
Taxol, 494B 
TBP, 930 
Teichoic acid, 204 
Telomerase, 875 
Telomere, 299D, 340B, 874, 875B, 875F 
Temperature 
enzymatic activity and, 397, 397F 
melting, 331 
of water, 44-45 
Template, 317 
DNA, 862, 862D 
Template DNA, 908, 910-911 
Template strand, 907B 
Tense form of hemoglobin subunits, 
473 
Ter region, 869-870 
Terminal, synaptic, 1044 
Terminal chain segment, glycogen, 
677-678, 678F 
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Termination, 966 
peptide chain, 974, 975F 
in eukaryotes, 981 
in protein synthesis, 974, 975F 
Sanger’s chain, 317-320 
Termination codon, 953 
Termination site, 912 
Terminator, 922 
Terpene, 229-232, 230S, 231B, 231S, 232B 
Tertiary structure 
of DNA. See also DNA, tertiary structure 
of 
of protein, 95, 146-172. See also Protein, 
tertiary structure of 
of rRNA, 348, 349G 
of transfer RNA, 345, 346F, 348 
Testosterone, 233, 234S, 238S, 1009F 
Tetracycline, 983S 
Tetrahedral oxyanion transition state, 439B 
Tetrahydrofolate, 816B 
Tetraloop, 343 
Tetrodotoxin, 1049B 
Tetrulose, 183 
TF, 988 
TFII, 930, 933 
TFUI, 925 
Therapy, gene, 376, 377, 378B 
Thermal cycler, 373 
Thermal denaturation, of DNA, 330-331, 
333F 
Thermal properties of water, 44—45 
Thermoacidophile, 19 
Thermodynamic parameters for protein 
denaturation, 51T 
Thermodynamic potential, 66, 382 
Thermodynamics, 48-69 
ATP hydrolysis and, 64 
coupled processes and, 66 
electron transport and, 611 
enthalpy and, 49-51 
first law of, 48-49 
of glycolysis, 538T 
high-energy biomolecules and, 56, 57T 
parameters of, 55-56 
pH and, 54-55 
requirement for ATP and, 66 
second law of, 51-52 
third law of, 52 
Thermogenin, 619B, 852 
Thiamine, 524T 
Thiamine pyrophosphate, 350F, 568B 
Thick filament, 484—485 
Thin filament, 483F, 484 
Thioester-linked fatty acyl, 257 
Thioether-linked prenyl, 257 
Thiokinase, fatty acid, 702 
Thiolase, 707, 752-753 
Thiolase reaction, 707F 
Thioredoxin, 831-832 
Thioredoxin reductase, 831, 832 
4-Thiouridine, 305S 
Third-base degeneracy, 953 
Threonine, 790, 791F 
pK, value for, 77T 
stereoisomers of, 83B 
structure of, 73S 
synthesis of, 790, 791F 
Threonine deaminase, 793 
Threonine synthase, 790 
b-Threose, 182S 
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Thrombin, 113 
Thromboxane, 747 
Thylakoid lumen, 631 
Thylakoid membrane, 631 
Thylakoid space, 631 
Thylakoid vesicle, 631, 656 
Thymidine, 294 
Thymidylate synthase, 169F, 835, 836F 
Thymidylate synthase reaction, 836F 
Thymine, 292, 292F, 301S 
Thymine nucleotide synthesis, 833-836, 
834B, 835B, 836F 
Tight turn in protein, 145 
TIM, 998 
TIM barrel, 159 
Titration, 78F, 79F 
Titration curve, 39 
7-TMS protein, 1015 
TMS receptor, 1015 
Toluene, 56 
TOM, 997 
Tomography, positron emission, 555B 
Top-of-power stroke, 489 
Topoisomerase, 334 
Topology, supercoil, 333-334 
Total adenine nucleotide pool, 843-844 
Toxic shock syndrome, 227B 
Toxin 
diphtheria, peptide chain termination 
in, 980B 
tetrodotoxin, 1049B 
TPP-dependent transketolase reaction, 689F 
Tracer, isotopic, 525-526, 525T, 526F 
TRAM, 996 
Transaldolase, 689-690, 690F 
Transamination of pyruvate to alanine, 
851F 
Transamination reaction, 782B 
Transcarboxylase, 724 
trans-complex, 269 
Transcriptase, reverse, 363 
Transcription, 906-951 
DNA, 291, 292F 
eukaryotic, 305, 924-925, 925G, 926T 
initiation of, 931F, 931G 
Mediator effects on, 932 
in prokaryotes, 906-912, 909F, 910B, 
911F, 913F 
regulation of, 912-923, 914F, 915F, 
916T, 917B, 918F, 918G, 919F, 
921F, 922F 
steroid hormones and, 763 
supercoiling vs., 911F 
Transcription activator, 934 
Transcription attenuation, 920 
Transcription factor, 924 
CREB-type, 936B 
general, 929 
heat shock, 929 
sequence-specific, 937 
Transcription start site, 908 
Transcriptional activator, 923 
Transcriptome, 367D 
Transcriptomics, 513 
Transduction 
energy, 522, 636 
of GPCR signals, 1024-1025 
Transduction pathway, 1031F 
signal, 257, 1010-1013, 1012F 


Transfer, peptidyl, 969 
Transfer potential 
group, 58-59 
types of, 59T 
Transfer protein, cholesterol ester, 758 
Transfer RNA. See tRNA 
Transferase 
adenylyl, 777 
peptidyl, 964, 969 
phosphoethanolamine, 741 
Transformation experiment, bacterial, 359F 
Transforming growth factor B, 212 
Transgenic animal, 889B 
Transhydration of fumarase, 577 
Transient receptor potential, 275 
Transition state, 387, 419 
tetrahedral oxyanion, 439B 
Transition-state analogs, 423, 424B—425B 
Transition-state stabilization, in serine pro- 
tease, 439B 
Transketolase, 688-689, 688F 
Translation, 291, 292F 
Translocase 
acylcarnitine and, 702 
ATP-ADP, 618, 619F 
Translocases, 498 
Translocation, 964, 969 
protein, 994 
retrograde, 997 
Translocon, 996 
Transmembrane segment, single, 249-250, 
249F 
7-Transmembrane segment protein, 1015 
Transpeptidation, 969 
Transport 
electron, 592-629. See also Electron 
transport 
membrane, 260, 269-287. See also 
Membrane, transport across 
Transposable element, 886 
Transposition, 877, 886 
Transposon, 886, 886F 
Transverse asymmetry, 260-261 
Transverse tubules, 483 
Tranylcypromine, 1055B 
Treadmilling, 490 
Trehalose, 193B 
Triacylglycerol, 737, 757-758 
energy stored as, 697 
fatty acids liberated from, 698F 
structure and function of, 222, 222G, 
2225S, 223 
Triacylglycerol lipase, 700 
Triage, protein, 1004B 
Triantennary oligosaccharide, 209 
Tricarboxylic acid cycle, 532B, 563-591 
aconitase reaction in, 573, 573K, 573G 
chemical logic of, 564 
citrate synthase in, 571-574, 571F, 572G 
diagram of, 565F 
energetic consequences of, 578-581 
enzymes and reactions of, 572T 
fluoroacetate blocking, 573-574 
intermediates in, 581-582 
isocitrate dehydrogenase in, 574, 574F, 
574G 
isocitrate formation in, 572-574 
a-ketoglutarate dehydrogenase in, 575 
overview of, 563-564, 564F 


oxaloactetate and, 575-578 
oxidation of succinyl CoA and, 712 
pyruvate and, 566 
pyruvate dehydrogenase complex in, 
71F 
regulation of, 584-586, 585F 
Trigger factor, 988 
Trioleoylglycerol, 222 
Triose, 183 
Triose kinase, 555 
Triose phosphate isomerase, 159F, 394T 
backbone structures of, 167F 
in glycolysis, 544, 545F, 545G 
Triple helix of collagen, 150-151, 151F, 151S 
Triplet code, 952-953 
Tristearin, 222G, 222S 
Tristearoylglycerol, 222 
Triterpene, 230S, 231 
Triton X-100, 245S 
tRNA, 299, 304-305 
aminoacyl-tRNA synthetase recognizing, 
956-958 
cognate, 955 
intrastrand base pairing and, 344—345 
structure of, 9538S 
general, 953F 
secondary, 345 
tertiary, 345, 346F, 348 
suppressor, nonsense codons and, 
960-961 
tRNA*“", 958 
tRNA-binding site, 966F 
tRNAM*, 977 
Trojan horse substrate, 402 
Tropocollagen, 150 
Tropomyosin, 483D 
Troponin, 483D 
Troponin C, 849, 1031 
Troponin T isoform, muscle, 945 
trp operon, 914, 920 
trp repressor, 920 
Trypsin 
naming of, 384 
in polypeptide fragmentation, 103, 104F 
as serine protease, 113 
trypsinogen, 434, 435F 
Tryptophan 
pK, value for, 77T 
structure of, 73S 
synthesis of, 799-802, 800F, 802G 
ultraviolet light absorbed by, 82 
Tryptophan synthase, 802, 802F, 802G 
d-Tubocurarine, 1050 
Tubulin, 490 
Tumor. See also Cancer 
positron emission tomography of, 555B 
telomeres and, 340B 
Tumor-suppressor gene, 1029B 
Tunneling, 433B, 802 
Turnover number, 393, 393T 
Tus protein, 870 
Twist, 334 
helical, 323F 
propeller, 323, 323F 
Two-dimensional gel electrophoresis, 
131-132, 131F 
Twofold rotation axis, 174 
Two-state transition, 160 
Tyk2, 1037 


Type 1 diabetes, 717B 
Type 2 diabetes, 717B 
Tyrosine 
catabolism of, 809-810, 809F 
catecholamine transmitters derived 
from, 1056 
in green fluorescent protein, 81B 
pK, value for, 77T 
structure of, 73S 
synthesis of, 798-799, 800F, 801F 
ultraviolet light absorbed by, 82 
Tyrosine hydroxylase, 1056 
Tyrosine kinase, 1015 
nonreceptor, 1023-1024 
phospholipase C activation by, 1030 
Tyrosine kinase pp60°"*, 1036, 1036F 
Tyrosine kinase pp60"**, 1023 
Tyrosine kinases, 461 


Ubiquinone, 598 
Ubiquitin, 998 
Ubiquitin-activating enzyme, 998 
Ubiquitination, 998 
Ubiquitin-carrier protein, 998 
Ubiquitin-protein ligase, 998 
UCP1, 619B 
UDP-glucose pyrophosphorylase, 556, 
557F, 676, 676F 
UDP-glucose-4-epimerase, 556 
Ultracentrifugation, 132-133 
Ultrafiltration, 129 
Ultraviolet light 
amino acid absorbing, 82 
denaturation of DNA and, 330 
pyrimidine dimer repair and, 890 
pyrimidines and purines and, 293, 293F 
MP synthase, 828 
ncharged amino acid, 74 
ncompetitive inhibition, 401 
ncoupler, 618, 619B 
ncoupling protein-1, 619B, 852 
ndecaprenyl alcohol, 231S 
nfolded protein, natively, 168 
nfolding of protein, 55 
nilamellar vesicle, 244F, 245 
nsaturated fatty acid, 219D, 733-734 
oxidation of, 713-714 
nstructured protein, 168 
nwinding of helix, 863 
pstream, 908 
pstream activation sequence, 927 
ptake, single-strand, 883 
racil, 292, 292F, 307S 
rate oxidase, 825 
rea, 810 
rea cycle, 783, 785, 786F, 787, '787B 
rease, 420T, 826 
rease reaction, 419 
reido group, 784 
reotelic, 810 
ric acid, 293S, 810, 813 
gout caused by, 824-825 
purine catabolism and, 823 
structure of, 814S 
Uricotelic organism, 810 
Uridine, 294, 295S 
Uridine 5’-diphosphate, 296 
Uridine diphosphate glucose, 675G, 6755S, 
676 
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Uridine diphosphoglucose, 57T 
Uridine 5'-monophosphate, 296S 
Uridylic acid, 295 

Uridylylation, 461T 

Uroguanylin, 1021 

Uronic acid, 188 

UTP, UMP synthesis and, 829 
U-turn, 343 


Vacuolar ATPase, 282 
Vacuole, 20, 24T 
Vacuum blotting, 364B 
Valine 
catabolism of, 807, 808F 
pX, value for, 77T 
structure of, 72S 
synthesis of, 793, 794F 
van der Waals contact distance, 12 
van der Waals forces, 12, 12F 
van der Waals interaction 
description of, 11T 
DNA and, 321-322 
ubiquitousness of, 136 
van der Waals interaction energy, 12F 
Vancocin, 207B 
Van’t Hoff plot, 50 
Vaporization, 44 
Variable loop, 345 
Vasoactive intestinal peptide, 1056 
V(D)J recombinase, 900-902 
Vector 
cloning, 355, 355F 
expression, 369, 369F, 370F 
expression-cloning, 370F 
plasmid, 356F 
shuttle, 360, 360F 
Vein, portal, 680, 680F 
Velocity 
of enzyme-catalyzed reaction, 386 
in Michaelis-Menten equation, 390-391 
Venom, snake, 308F 
Versican, 209, 210F 
Very long-chain acyl-CoA dehydrogenase, 
704-705, 705F 
Very-low-density lipoprotein, 757, 758 
Vesicle 
coated, 758 
multilamellar, 244F, 245 
thylakoid, 631, 656 
unilamellar, 244F, 245 
Vesicle-mediated transport, 268F 
Vevesca, 207B 
VH gene, 900 
Viagra, 404B, 404S 
Viceroy butterfly, 282B 
Vicinal XOH groups of RNA, 308F 
Vinblastine, 284F, 494B 
Vinca alkaloids, 494B 
Vincristine, 284F, 494B 
Viracept, 443S 
Viral neuraminidase inhibitor, 425B 
Viral transforming protein, 1023 
Virus, 876B 
in gene delivery system, 377, 378F 
Rous sarcoma, 1023 
structure and function of, 21-22, 24F-25F 
Vitamin By» 
activation of, 712B 
catalysis of, 711-712 
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Vitamin C deficiency, 155B 

Vitamin K epoxide reductase, 232B 

Vitamin K reductase, 232B 

Vitamins, 523, 524T 

Vitellogenin gene, 940 

VLDL, 757, 758 

Voltage-dependent anion channel, 
624 

Voltage-gated ion channel, 1044 

von Gierke disease, 681B 

V-type ATPase, 1050 


Walker, John, 613 
Wall, cell, 19, 20T 
bacterial, 201-204, 202S, 203F 
of plant cell, 24T 
Warburg, Otto, 535 
Warfarin, 232B 
Water, 28-47 
buffers and, 41—44 
environmental role of, 44—45 
lipid structures formed in, 244—245, 
244F, 245S 
metabolic, 710 
pH and, 35-40, 36T 
properties of, 28-35 
colligative, 33-34 
forming ions, 34-35 
hydrogen bonding as, 29-30 
solvent, 30-34 
unusual, 28 
structure of, 29S 
Water:plastoquinone oxidoreductase, 
638 
Watson, J. D., 862 
Watson-Crick base pair, 321 
Watson-Crick postulate of double helix, 
301, 301F 
Wavelength, 634 
Wax, 229, 229B, 230S 
WD40 repeat domain, 625 
Weak acid, polyprotic, 76-78 
Weak electrolyte, 37 
Weak force 
chemical, 11, 11T 
biomolecular recognition and, 
14-15 
environmental conditions and, 15 
quaternary protein structure and, 174, 
176 
Web, spider, 144 
Western blotting, 364B 
Wheel, helical, 153, 1033, 1033F 
Whole genome sequencing, 17-18 
Wild-type human sulfite oxidase, 396B 
Wobble hypothesis of codon-anticodon 
pairing, 959 
Woodward, R. B., 753B 
Work, mechanical, 49 
Writhe, 334 
Wyman, Jeffries, 457 
Waa, 256 


Xanthine, 293S 

Xanthine oxidase, 823-824, 825F 
Xylitol, 189 

a-Xylosyl-threonine, 205S 
D-Xylulose, 183S 
Xyluose-5-phosphate, 692-693, 692F 
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YAC, 360 

Yeast 
enolase, 550F 
hexokinase, 540F 
phenylalanine tRNA, 346F, 347F, 347G 
RNA polymerase II subunits of, 926T 
rRNA of, 348F 

Yeast artificial chromosome, 360 

Yeast mating factor, 260 

Yeast two-hybrid system, 372-373, 372F 


Z line, 483 

Z scheme, 638, 639F, 640 

Z-DNA, 323-324, 324F, 325T, 326 
cytosine methylation and, 326 

Zero-order kinetics, 389 

Zinc finger, 936, 937F 

Zipper, ribose, 344 

Zipper motif, 938 

Zn finger motif, 937-938, 937F 

Zn-finger, 936, 937F 


Z-variant of protein, 171B 
Zwitterion, 70D, 76F 
Zymogen, 434 
as enzyme precursor, 454—455, 
454F-455F, 454T 
gastric and pancreatic, 454—455, 454T 
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ADH 
ADP 
AIDS 
AMP 
ALA 
ATCase 
atm 
ATP 
BChl 
bp 
BPG 
BPheo 
C 


CMP 
CoA or CoASH 
CoQ 
cpm 
CTP 


Fag 
FAD 
FADH, 
FBP 
FBPase 
Fd 
fMet 
FMN 
F-1-P 


Common Abbreviations Used by Biochemists 


adenine or the amino acid alanine 
antibody 

antigen 

acetyl-coenzyme A 

acetylcholine 

acyl carrier protein 

alcohol dehydrogenase 

adenosine diphosphate 

acquired immunodeficiency syndrome 
adenosine monophosphate 
6-aminolevulinic acid 

aspartate transcarbamoylase 
atmosphere 

adenosine triphosphate 
bacteriochlorophyll 

base pair 

bisphosphoglycerate 
bacteriopheophytin 

cytosine or the amino acid cysteine 
calorie 

calmodulin 

cyclic 3',5'-adenosine monophosphate 
catabolite activator protein 
complementary DNA 

cytidine diphosphate 
complementarity-determining region 
chlorophyll 

carboxymethyl 

cytidine monophosphate 
coenzyme A 

coenzyme Q 

counts per minute 

cytidine triphosphate 

dalton or the amino acid aspartate 
deoxy 

dideoxy 

diacylglycerol 

diethylaminoethyl 
dihydroxyacetone phosphate 
dihydrofolate 

dihydrofolate reductase 
dinitrophenol 
dihydroxyphenylalanine 
deoxyribonucleic acid 
double-stranded 

reduction potential 
erythrose-4-phosphate 

elongation factor 

epidermal growth factor 

electron paramagnetic resonance 
endoplasmic reticulum 
electrospray ionization mass spectrometry 
Faraday’s constant 

antibody molecule fragment that binds antigen 
flavin adenine dinucleotide 
reduced flavin adenine dinucleotide 
fructose-1,6-bisphosphate 

fructose bisphosphatase 
ferredoxin 

N-formyl-methionine 

flavin mononucleotide 
fructose-1-phosphate 


F-6-P 


Hb 

HDL 
HGPRT 
HIV 
HMG-CoA 
hnRNA 
HPLC 


fructose-6-phosphate 

guanine or the amino acid glycine 

Gibbs free energy 

y-aminobutyric acid 

galactose 

guanosine diphosphate 

green fluorescent protein 

gas-liquid chromatography 

glucose 

guanosine monophosphate 

glucose-1-phosphate 

glyceraldehyde-3-phosphate 

glucose-6-phosphate 

glucose-6-phosphate dehydrogenase 

glutamine synthetase 

glutathione (reduced glutathione) 

glutathione disulfide (oxidized glutathione) 

guanosine triphosphate 

hour 

Planck’s constant 

hemoglobin 

high-density lipoprotein 

hypoxanthine-guanine phosphoribosyltransferase 

human immunodeficiency virus 

hydroxymethylglutaryl-coenzyme A 

heterogeneous nuclear RNA 

high-pressure (or high-performance) liquid 
chromatography 

hypoxanthine 

hydroxylysine 

hydroxyproline 

inosine or the amino acid isoleucine 

intermediate density lipoprotein 

initiation factor 

immunoglobulin G 

inosine monophosphate 

inositol-1, 4, 5-trisphosphate 

isopropylthiogalactoside 

infrared 

inosine triphosphate 

joule 

kilo (10°) 

rate constant 

Michaelis constant 

kilobases 

kilodaltons 

liter 

lactate dehydrogenase 

low density lipoprotein 

lysine 

molar 

milli (107%) 

milliliter 

millimeter 

mannose 

myoglobin 

micro RNA 

mole 

messenger RNA 

mass spectrometry 

millivolt 

Avogadro’s number 





PEP 
PEPCK 
PFK 
PG 
2-PG 
3-PG 
PDGF 
PGI 
PGK 
PGM 
Pheo 
PIP. 
PK 
PKU 
PLP 
Pol 
PP; 


nano (107°) 

nicotinamide adenine dinucleotide 

reduced nicotinamide adenine dinucleotide 

nicotinamide adenine dinucleotide phosphate 

reduced nicotinamide adenine dinucleotide 
phosphate 

N-acetylglucosamine 

N-acetylmuramic acid 

N-acetylneuraminic acid 

noncoding RNA 

nanometer 

nuclear magnetic resonance 

orotidine 

phosphate or the amino acid proline 

pico (107!*) 

inorganic phosphate 

polyacrylamide gel electrophoresis 

porphobilinogen 

plastocyanin or the phospholipid 
phosphatidylcholine 

polymerase chain reaction 

phosphatidylethanolamine 

phosphoenolypyruvate 

PEP carboxykinase 

phosphofructokinase 

prostaglandin 

2-phosphoglycerate 

3-phosphoglycerate 

platelet-derived growth factor 

phosphoglucoisomerase 

phosphoglycerate kinase 

phosphoglucomutase 

pheophytin 

phosphatidylinositol-4,5-bisphosphate 

pyruvate kinase 

phenylketonuria 

pyridoxal-5-phosphate 

polymerase 

pyrophosphate ion 

plastoquinone 

5-phosphoribosyl-1-pyrophosphate 

photosystem or the phospholipid 
phosphatidylserine 

phenylthiohydandoin 

coenzyme Q, ubiquinone or the amino acid 
glutamine 

reduced coenzyme Q (ubiquinol) 


Abbreviations for Amino Acids 


gas constant or the amino acid arginine 
revolution 

rough endoplasmic reticulum 
release factor 

restriction fragment length polymorphism 
ribonucleic acid 

RNA interference 

ribosomal RNA 
ribose-5-phosphate 
ribulose-1,5-bisphosphate 
ribulose-5-phosphate 

Svedberg unit or the amino acid serine 
second 

sedimentation coefficient 
S-adenosylmethionine 

sodium dodecylsulfate 

small interfering RNA 
sphingomyelin 

situation normal, all fouled up 
small nuclear RNA 

small nucleolar RNA 

small RNAs 

small nuclear ribonucleoprotein 
signal recognition particle 
thymine or the amino acid threonine 
absolute temperature 
tricarboxylic acid cycle 
thymidine diphosphate 
tetrahydrofolate 

thymidine monophosphate 
tobacco mosaic virus 

thiamine pyrophosphate 
thymidine triphosphate 

uracil 

uridine diphosphate 
UDP-glucose 

uridine monophosphate 
uridine triphosphate 

ultraviolet 

volt or the amino acid valine 
maximal velocity 

very low density lipoprotein 
xanthine 

xanthine monophosphate 
xylulose-5-phosphate 

year 

yeast artificial chromosome 





Alanine 
Arginine 
Asparagine 
Aspartate 
Cysteine 
Glutamate 
Glutamine 
Glycine 
Histidine 


Isoleucine 


Q 
z 
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Leucine Leu 
Lysine Lys 
Methionine Met 
Phenylalanine Phe 
Proline Pro 
Serine Ser 
Threonine Thr 
Tryptophan Trp 
Tyrosine Tyr 
Valine Val 
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Physical Constants 








Name Symbol SI Units cgs Units 

Avogadro’s number N 6.022137 X 1073/mol 6.022137 xX 107>/mol 
Boltzmann constant k 1.38066 x 107° J/K 1.38066 x 107!° erg/K 
Curie Ci 3.7 X 10! d/s 3.7 X 10! d/s 

Electron charge e 1.602177 X 107" coulombt} 4.80321 X 107" esu 
Faraday constant F 96485 J/V - mol 9.6485 X 10!!! erg/V - mol 
Gas constant* R 8.31451 J/K : mol 8.31451 X 107 erg/K : mol 
Gravity acceleration g 9.80665 m/s? 980.665 cm/s? 

Light speed (vacuum) c 2.99792 X 108 m/s 2.99792 X 10!° cm/s 
Planck’s constant h 6.626075 X 10-4] -s 6.626075 X 107” erg > s 





*Other values of R: 1.9872 cal/K - mol = 0.082 liter - atm/K - mol. 


tl coulomb = 1 J/V. 


Conversion Factors 





Energy: 1 Joule = 10” ergs = 0.239 cal 
1 cal = 4.184 Joule 
Length: 1 nm = 10 A=1X~" cm 
Mass: 1 kg = 1000 g = 2.2 Ib 
1 lb = 453.6 g 


Pressure: 1 atm = 760 torr = 14.696 psi 
l torr = 1 mm Hg 
Temperature: K = °C + 273 
C = (5/9) (°F — 32) 
Volume: 1 liter = 1 X 1073 më = 1000 cm? 





Useful Equations 





The Henderson-Hasselbalch Equation 
pH = pK, + log([A"]/[HA]) 

The Michaelis-Menten Equation 
v= Vinax[S]/ (Kn + [S]) 


Temperature Dependence of the Equilibrium Constant 


AH?” = —Rd(In K.q)/d(1/T) 


Free Energy Change under Non-Standard-State Conditions 


AG=AG° + RT In ([C][D]/[AJ[B]) 


Free Energy Change and Standard Reduction Potential 
AG” = —nF¥AE: 
Reduction Potentials in a Redox Reaction 
A€: = AE‘(acceptor) — €‘(donor) 
The Proton-Motive Force 
Ap = AY — (2.3 RT/F#)ApH 
Passive Diffusion of a Charged Species 
AG= G — G = RT In(Cy/C,) + ZFAW 





The Standard Genetic Code 





AAA Lysine CAA 
AAC Asparagine CAC 
AAG Lysine CAG 
AAU Asparagine CAU 
ACA Threonine CCA 
ACC Threonine CCC 
ACG Threonine CCG 
ACU Threonine CCU 
AGA Arginine CGA 
AGC Serine CGC 
AGG Arginine CGG 
AGU Serine CGU 
AUA Isoleucine CUA 
AUC Isoleucine CUC 
AUG Methionine* CUG 
AUU Isoleucine CUU 


Glutamine 
Histidine 
Glutamine 
Histidine 
Proline 
Proline 
Proline 
Proline 
Arginine 
Arginine 
Arginine 
Arginine 
Leucine 
Leucine 
Leucine 


Leucine 





GAA Glutamate UAA stop 

GAC Aspartate UAC Tyrosine 

GAG Glutamate UAG stop 

GAU Aspartate UAU Tyrosine 

GCA Alanine UCA Serine 

GCC Alanine UCC Serine 

GCG Alanine UCG Serine 

GCU Alanine UCU Serine 

GGA Glycine UGA stop 

GGC Glycine UGC Cysteine 

GGG Glycine UGG Tryptophan 
GGU Glycine UGU Cysteine 

GUA Valine UUA Leucine 

GUC Valine UUC Phenylalanine 
GUG Valine UUG Leucine 

GUU Valine UUU Phenylalanine 





*AUG also serves as the principal initiation codon. 


Icons and Colors in Illustrations 


The following symbols and colors are used in this text to help in illustrating structures, 
reactions, and biochemical principles. 


Elements: 
| | = Oxygen | | = Nitrogen |] = Phosphorus | = Sulfur = Carbon 


Inorganic phosphate and pyrophosphate are sometimes symbolized by the following icons: 


® PP 


Inorganic phosphate (P;) Pyrophosphate (PP;) 


Sugars: 
Glucose Galactose Mannose Fructose Ribose 
Nucleotides: 
= Guanine = Cytosine = Adenine = Thymine = Uracil 
Amino acids: 
| | = Non-polar/hydrophobic = Polar/uncharged = Acidic = Basic 
Enzymes: 


+ = Enzyme activation 


O = Enzyme inhibition or inactivation 


© = Enzyme = Enzyme Enzyme names are printed in red. 


In reactions, blocks of color over parts of molecular structures are used so that discrete parts of the 
reaction can be easily followed from one intermediate to another, making it easy to see where the 
reactants originate and how the products are produced. 


Some examples: 





O 
OS } E- —OH — COO 
d 
Phosphoryl group Hydroxyl group Amino group Carboxyl group 


Red arrows are used to indicate nucleophilic attack. f .. ka 


These colors are internally consistent within reactions and are generally consistent 
within the scope of a chapter or treatment of a particular topic. 
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